| |
Last updated on April 28, 2025. This conference program is tentative and subject to change
Technical Program for Thursday July 10, 2025
To show or hide the keywords and abstract (text summary) of a paper (if available), click on the paper title
Open all abstracts
Close all abstracts
Presentation  In person  On-line  No presentation  No information
|
ThA01 |
Plaza AB |
RI - Model Predictive Control II |
RI Session |
Chair: Hosseinzadeh, Mehdi | Washington State University |
Co-Chair: Paredes Salazar, Juan Augusto | University of Maryland, Baltimore Couunty |
|
10:00-10:03, Paper ThA01.1 | |
Model Predictive Control for Systems with Partially Unknown Dynamics under Signal Temporal Logic Specifications |
|
Dai, Zhao Feng | University of Waterloo |
Pant, Yash Vardhan | University of Waterloo |
Smith, Stephen L. | University of Waterloo |
Keywords: Predictive control for nonlinear systems, Stochastic optimal control, Autonomous systems
Abstract: In this work, we design a model predictive controller (MPC) for systems to satisfy Signal Temporal Logic (STL) specifications when the system dynamics are partially unknown, and only a nominal model and past runtime data are available. Our approach uses Gaussian process regression to learn a stochastic, data-driven model of the unknown dynamics, and manages uncertainty in the STL specification resulting from the stochastic model using Probabilistic Signal Temporal Logic (PrSTL). The learned model and PrSTL specification are then used to formulate a chance-constrained MPC. For systems with high control rates, we discuss a modification for improving the solution speed of the control optimization. In simulation case studies, our controller increases the frequency of satisfying the STL specification compared to controllers that use only the nominal dynamics model.
|
|
10:03-10:06, Paper ThA01.2 | |
An Optimized Behavioral Intervention for Managing Gestational Weight Gain Using Semi-Physical Modeling and Hybrid Model Predictive Control |
|
Khan, Owais | Arizona State University |
Campregher, Francesco | University of Brescia |
Rivera, Daniel E. | Arizona State Univ |
Visioli, Antonio | University of Brescia |
Pauley, Abigail | Pennsylvania State University |
Downs, Danielle | Penn State University |
Keywords: Biomedical, Hybrid systems, Predictive control for linear systems
Abstract: This paper describes an optimized behavioral intervention Healthy Mom Zone (HMZ) for managing gestational weight gain featuring sequential decision-making using Hybrid Model Predictive Control (HMPC). Dynamical models incorporating both behavioral and physiological aspects of the problem are presented and estimated from HMZ participant data via constrained semi-physical modeling. Daily measurements are provided to a controller that ultimately makes judicious (though infrequent) augmentations on categorical dosages of healthy eating and physical activity intervention components. Consequently, an HMPC algorithm is required which must follow a logical sequence of control actions conforming to practical requirements. A case study shows the benefits relative to a conventional "IF-THEN" approach. The computational framework (both modeling and control) serves as the basis for the Healthy Mom Zone 2.0 intervention currently being evaluated in a randomized clinical trial (NIH R01DK134863, NCT05807594) at Penn State University.
|
|
10:06-10:09, Paper ThA01.3 | |
REVISE: Robust Probabilistic Motion Planning in a Gaussian Random Field |
|
Rose, Alex | Massachusetts Institute of Technology |
Aggarwal, Naman | Massachusetts Institute of Technology |
Jewison, Christopher | Massachusetts Institute of Technology |
How, Jonathan P. | MIT |
Keywords: Stochastic optimal control, Randomized algorithms, Optimization
Abstract: This paper presents Robust samplE-based coVarIance StEering (REVISE), a multi-query algorithm that generates robust belief roadmaps for dynamic systems navigating through spatially dependent disturbances modeled as a Gaussian random field. Our proposed method develops a novel robust sample-based covariance steering edge controller to safely steer a robot between state distributions, satisfying state constraints along the trajectory. Our proposed approach also incorporates an edge rewiring step into the belief roadmap construction process, which provably improves the coverage of the belief roadmap. When compared to state-of-the-art methods [1], [2], REVISE improves median plan accuracy (as measured by Wasserstein distance between the actual and planned final state distribution) by 10x in multi-query planning and reduces median plan cost (as measured by the largest eigenvalue of the planned state covariance at the goal) by 2.5x in single-query planning for a 6DoF system. Our code is available at https://acl.mit.edu/REVISE/.
|
|
10:09-10:12, Paper ThA01.4 | |
Adaptive Kinetic Monte Carlo-Based Model Predictive Control for Mitigating Catalyst Deactivation |
|
Nagpal, Satchit | Texas A&M University |
Kim, Youngjo | Hanwha Solutions Chemical Division |
Kwon, Joseph | Texas A&M University |
Keywords: Stochastic optimal control, Modeling, Manufacturing systems
Abstract: Dry methane reforming (DRM) offers a promising avenue to convert the greenhouse gases such as methane (CH4) and carbon dioxide (CO2) into syngas for synthesizing long-chain alkanes. However, catalyst deactivation remains a significant obstacle, primarily due to the deep cracking of CH4 and CO2 and the metal-catalyzed formation of carbon whiskers on the catalyst surface. To tackle this challenge, we introduce a comprehensive multiscale model that analyzes DRM reactions on nickel surfaces and predicts carbon whisker formation. This model integrates microscopic kinetic Monte Carlo (kMC) simulations for surface reactions, mesoscopic pelletscale modeling for whisker growth prediction, and macroscopic modeling to account for packed bed porosity and reactor pressure drop. Moreover, by applying time-steppers using the gap-tooth scheme, we significantly enhance computational efficiency, reducing computational time from 17 days to 6 hours with minimal loss in accuracy for whisker length prediction. Controlling catalyst deactivation is crucial for maintaining the efficiency and economic viability of the DRM process. Implementing optimal operation for reactor’s jacket temperature enables effectively mitigating catalyst deactivation and prolonging catalyst lifespan. Our multiscale model not only aids in understanding the factors affecting catalyst deactivation and surface reaction kinetics but also provides a robust framework for implementing MPC strategies to optimize and control the DRM process.
|
|
10:12-10:15, Paper ThA01.5 | |
Multirate Model Predictive Control of Inner-Outer Loops |
|
Islam, Syed Aseem Ul | University of Michigan |
Paredes Salazar, Juan Augusto | University of Maryland, Baltimore Couunty |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Control system architecture, Sampled-data control, Predictive control for nonlinear systems
Abstract: Inner-outer-loop control is widely used for controlling mechanical systems with time-scale separation. Model Predictive Control (MPC) is a popular technique for systems that require command following with state and control constraints. We present a conversion algorithm for MPC-based inner- and outer-loop control by accounting for timing intricacies. For uncertain systems, we apply inner-outer-loop control based on predictive cost adaptive control (PCAC) to flight-control examples, with and without the conversion algorithm. Numerical examples show that inner-outer-loop PCAC improves command following and constraint satisfaction when the conversion algorithm is used. The investigation in this paper is numerical, and thus the contribution is technological rather than theoretical.
|
|
10:15-10:18, Paper ThA01.6 | |
Model Predictive Path Integral Control of I2RIS Robot Using RBF Identifier and Extended Kalman Filter |
|
Esfandiari, Mojtaba | Johns Hopkins University |
Du, Pengyuan | Johns Hopkins University |
Wei, Haochen | Johns Hopkins University |
Gehlbach, Peter | Johns Hopkins University |
Munawar, Adnan | Johns Hopkins University |
Kazanzides, Peter | Johns Hopkins University |
Iordachita, Iulian | Johns Hopkins University |
Keywords: Mechanical systems/robotics, Stochastic optimal control, Kalman filtering
Abstract: Modeling and controlling cable-driven snake robots is a challenging problem due to nonlinear mechanical properties such as hysteresis, variable stiffness, and unknown friction between the actuation cables and the robot body. This challenge is more significant for snake robots in ophthalmic surgery applications, such as the Improved Integrated Robotic Intraocular Snake (I2RIS), given its small size and lack of embedded sensory feedback. Data-driven models take advantage of global function approximations, reducing complicated analytical models' challenge and computational costs. However, their performance might deteriorate in case of new data unseen in the training phase. Therefore, adding an adaptation mechanism might improve these models' performance during snake robots' interactions with unknown environments. In this work, we applied a model predictive path integral (MPPI) controller on a data-driven model of the I2RIS based on the Gaussian mixture model (GMM) and Gaussian mixture regression (GMR). To analyze the performance of the MPPI in unseen robot-tissue interaction situations, unknown external disturbances and environmental loads are simulated and added to the GMM-GMR model. These uncertainties of the robot model are then identified online using a radial basis function (RBF) whose weights are updated using an extended Kalman filter (EKF). Simulation results demonstrated the robustness of the optimal control solutions of the MPPI algorithm and its computational superiority over a conventional model predictive control (MPC) algorithm.
|
|
10:18-10:21, Paper ThA01.7 | |
Closed-Loop Analysis of ADMM-Based Suboptimal Linear Model Predictive Control |
|
Srikanthan, Anusha | University of Pennsylvania |
Karapetyan, Aren | ETH Zürich |
Kumar, Vijay | University of Pennsylvania |
Matni, Nikolai | University of Pennsylvania |
Keywords: Optimal control, Optimization algorithms, Constrained control
Abstract: Many practical applications of optimal control are subject to real-time computational constraints. When applying model predictive control (MPC) in these settings, respecting timing constraints is achieved by limiting the number of iterations of the optimization algorithm used to compute control actions at each time step, resulting in so-called suboptimal MPC. This paper proposes a suboptimal MPC scheme based on the alternating direction method of multipliers (ADMM). With a focus on the linear quadratic regulator problem with state and input constraints, we show how ADMM can be used to split the MPC problem into iterative updates of an unconstrained optimal control problem (with an analytical solution), and a dynamics-free feasibility step. We show that using a warm-start approach combined with enough iterations per time-step, yields an ADMM-based suboptimal MPC scheme which asymptotically stabilizes the system and maintains recursive feasibility.
|
|
10:21-10:24, Paper ThA01.8 | |
Ultrasound-Informed Recursive Koopman Model Predictive Control for Ankle Assistance |
|
Singh, Mayank | North Carolina State Univeristy |
Lambeth, Krysten | North Carolina State University |
Hakam, Noor | North Carolina State University |
Sharma, Nitin | North Carolina State University |
Keywords: Predictive control for linear systems, Human-in-the-loop control, Nonlinear systems identification
Abstract: This paper presents an ultrasound-informed recursive Koopman Model Predictive Control (RK-MPC) framework for enhancing gait assistance in individuals with impaired locomotion. By leveraging muscle activation and fatigue estimates derived from ultrasound imaging, the proposed approach improves the performance of predictive control strategies for functional electrical stimulation (FES) systems. The Koopman operator is used to linearize the nonlinear gait dynamics in a higher-dimensional space. We develop the framework for phase-specific dynamics for the stance and swing phases. We incorporate ultrasound-derived muscle fatigue measurements to design update laws for the Koopman operator to improve prediction accuracy. We present our simulation results for ankle motion control to track a desired gait trajectory while monitoring a prescribed fatigue level.
|
|
10:24-10:27, Paper ThA01.9 | |
Improved Offline Design for Robust MPC for Polytopic Time-Varying Systems |
|
Aravena, Marcelo S. | Unicamp |
Muñoz-Carpintero, Diego | Universidad De O'Higgins |
Palma Olate, Jonathan Matias | U Talca |
Oliveira, Ricardo C. L. F. | University of Campinas - UNICAMP |
Keywords: Predictive control for linear systems, Linear parameter-varying systems, LMIs
Abstract: This paper introduces a novel LMI-based approach for synthesizing robust controllers for polytopic time-varying systems within the Model Predictive Control (MPC) framework. Compared to existing methods in the literature addressing the same problem, the proposed approach offers the following improvements and innovations for the offline design: the use of parameter-dependent Lyapunov matrices; the design of controllers of arbitrary order; the simultaneous design of all controller matrices; and the incorporation of the scalar arising from the S-procedure, which typically requires a time-consuming linear search, into the synthesis conditions affinely, allowing it to be optimized jointly with the other variables. The synthesis procedure is formulated as an iterative algorithm where LMIs are solved at each iteration. The effectiveness of the proposed strategy is demonstrated through numerical examples, highlighting its feasibility and superior performance, particularly in achieving larger invariant regions compared to existing methods.
|
|
10:27-10:30, Paper ThA01.10 | |
Brunovsky Riccati Recursion for Linear Model Predictive Control |
|
Yang, Shaohui | EPFL |
Ohtsuka, Toshiyuki | Kyoto Univ |
Jones, Colin N. | EPFL |
Keywords: Predictive control for linear systems, Numerical algorithms, Linear systems
Abstract: In almost all algorithms for Model Predictive Control (MPC), the most time-consuming step is to solve some form of Linear Quadratic (LQ) Optimal Control Problem (OCP) repeatedly. The commonly recognized best option for this is a Riccati recursion based solver, which has a time complexity of mathcal{O}(N(n_x^3 + n_x^2 n_u + n_x n_u^2 + n_u^3)). In this paper, we propose a novel textit{Brunovsky Riccati Recursion} algorithm to solve LQ OCPs for Linear Time Invariant (LTI) systems. The algorithm transforms the system into Brunovsky form, formulates a new LQ cost (and constraints, if any) in Brunovsky coordinates, performs the Riccati recursion there, and converts the solution back. Due to the sparsity (block-diagonality and zero-one pattern per block) of Brunovsky form and the data parallelism introduced in the cost, constraints, and solution transformations, the time complexity of the new method is greatly reduced to mathcal{O}(n_x^3 + N(n_x^2 n_u + n_x n_u^2 + n_u^3)) if N threads/cores are available for parallel computing.
|
|
10:30-10:33, Paper ThA01.11 | |
Robust Parametric Shrinking Horizon Model Predictive Control and Its Application to Spacecraft Rendezvous |
|
Castroviejo-Fernandez, Miguel | University of Michigan |
Ambrosino, Michele | University of Michigan |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Predictive control for linear systems, Robust control, Aerospace
Abstract: This paper introduces a robust Model Predictive Control approach in which a shrinking prediction horizon and a system input parameterization are exploited to control a linear system with set-bounded disturbances while satisfying state and control constraints. By exploiting input parameterization, the number of decision variables in the optimal control problem and the computational time can be reduced. The simulated spacecraft rendezvous maneuver is used to highlight the potential of the proposed approach for practical applications.
|
|
10:33-10:36, Paper ThA01.12 | |
Robust Steady-State-Aware Model Predictive Control for Systems with Limited Computational Resources and External Disturbances |
|
Jafari Ozoumchelooei, Hassan | Washington State University |
Hosseinzadeh, Mehdi | Washington State University |
Keywords: Predictive control for linear systems, Robust control, Optimal control
Abstract: Model Predictive Control (MPC) is a powerful control strategy; however, its reliance on online optimization poses significant challenges for implementation on systems with limited computational resources. One possible approach to address this issue is to shorten the prediction horizon and adjust the conventional MPC formulation to enlarge the region of attraction. However, these methods typically introduce additional computational load. Recently, steady-state-aware MPC has been introduced to ensure output tracking and convergence to a given desired steady-state configuration while maintaining constraint satisfaction at all times without adding extra computational load. Despite its promising performance, steady-state-aware MPC does not account for external disturbances, which can significantly limit its applicability to real-world systems. This paper aims to advance the method further by enhancing its robustness against external disturbances. To achieve this, we adopt the tube-based design framework, which decouples nominal trajectory optimization from robust control synthesis, thereby requiring no additional online computational resources. Theoretical guarantees of the proposed methodology are shown analytically, and its effectiveness is assessed through simulations and experimental studies on a Parrot Bebop 2 drone.
|
|
10:36-10:39, Paper ThA01.13 | |
Application of Root-Finding Methods to Iterative Model Predictive Control of Pseudo-Linear Systems |
|
Abdulelah Alhazmi, Rami | University of Michigan |
Paredes Salazar, Juan Augusto | University of Maryland, Baltimore Couunty |
Islam, Syed Aseem Ul | University of Michigan |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Predictive control for nonlinear systems, Iterative learning control, Constrained control
Abstract: For nonlinear systems that can be written in pseudo-linear form, we use iterative model predictive control (IMPC) for receding-horizon optimization. Pseudo-linear models, which are written in terms of state-dependent-coefficients (SDC’s), are widely used with the state-dependent Riccati equation. IMPC iterates over the horizon by updating the future control and state sequences until the future control sequence converges. To facilitate convergence, this paper explores the effectiveness of four root-finding methods and compares their performance with fixed-point iteration. At each iteration, a quadratic programming problem is solved using the current SDC’s to obtain a full-state-feedback controller. To assess the effectiveness of the root-finding methods, each technique is used to stabilize a collection of benchmark nonlinear systems.
|
|
10:39-10:42, Paper ThA01.14 | |
Real-Time Tuning of Time-Varying Weight Parameter for Nonlinear Model Predictive Control Using Adversarial Objective Function |
|
Ishihara, Shinji | Hitachi Ltd |
Ohtsuka, Toshiyuki | Kyoto Univ |
Keywords: Predictive control for nonlinear systems, Optimal control, Computational methods
Abstract: Nonlinear Model Predictive Control (NMPC) is a powerful tool that can be applied to various control objects. However, it is known that the behavior realized by the NMPC varies greatly depending on the results of tuning the weight parameters of the objective function. In recent years, some methods have been proposed to automatically tune the weight parameters by utilizing tools such as machine learning, but these methods require a large number of prior trials. We propose a method to tune time-varying stage cost weight parameters in the NMPC for reference tracking without prior trials so as to minimize the tracking error. We formulate the NMPC as a min-max problem with the weight parameters as the maximizer and the control input as the minimizer, in order to optimize the weight parameter and the control input simultaneously. Furthermore, treating this optimization problem as a nonlinear receding-horizon differential game, we develop a real-time optimization algorithm. The effectiveness of the proposed method was confirmed by numerical simulations. We confirmed that the proposed method improves the control performance over the NMPC using fixed weight parameters designed by Bayesian optimization.
|
|
10:42-10:45, Paper ThA01.15 | |
Continuation Method for Nonsmooth Model Predictive Control Using Proximal Technique |
|
Shima, Ryotaro | Toyota Central R&D Labs |
Moriyasu, Ryuta | Toyota Central R&D Labs |
Kato, Teruki | Toyota Central R&D Labs., Inc |
Keywords: Predictive control for nonlinear systems, Optimal control, Numerical algorithms
Abstract: This paper presents a novel framework for the continuation method of model predictive control based on optimal control problem with a nonsmooth regularizer. Via the proximal operator, the first-order optimality inclusion relation is reformulated into an equation system, to which the continuation method is applicable. In addition, we present constraint qualifications that ensure the well-posedness of the proposed equation system. A numerical example is also presented that demonstrates the effectiveness of our approach.
|
|
10:45-10:48, Paper ThA01.16 | |
Investigating Resilience of Cyberattack Detection Using Lyapunov-Based Economic Model Predictive Control to Data Poisoning |
|
Durand, Helen | Wayne State University |
Leonard, Akkarakaran Francis | Wayne State University |
Keywords: Predictive control for nonlinear systems, Process Control, Lyapunov methods
Abstract: Cyberattacks may be performed on process control systems due to their integration of networking and computing with physical systems. Prior work in our group has developed detection strategies for nonlinear systems under sensor, actuator, and combined sensor and actuator attacks which can ensure, under characterizable conditions, that attacks can be detected before they cause safety issues. However, this work did not take into account the potential that an attacker could attempt to provide data to a process that causes an attack to remain undetected but that also is consistent with different process dynamics than those which the process has. This could be problematic if process models are consistently updated online based on the most recent process data, such that skewed data during routine process operation (or during operations designed to produce informative data for model updates) could lead to new models being developed for model-based controllers that are, in a sense, ``specified'' by the attacker. This work provides steps toward understanding how to prevent an attacker from achieving these types of data poisoning attacks for processes under Lyapunov-based economic model predictive control (LEMPC).
|
|
10:48-10:51, Paper ThA01.17 | |
Efficient Switching in Mixed-Integer Predictive Control for a Three-Phase Electric Arc Furnace |
|
Dinh, Minh Tuan | LCIS, Grenoble INP, UGA |
Prodan, Ionela | Grenoble Institute of Technology (Grenoble INP) - Esisar |
Lesage, Olivier | Eramet Ideas |
Mendes, Eduardo | LCIS - Grenoble INP |
Keywords: Control applications, Power systems, Process Control
Abstract: This article proposes an efficient strategy for the real-time implementation of mixed-integer model predictive control (MI-MPC) in the context of an Electric Arc Furnace (EAF) application. Based on the EAF dynamical model and the discrete nature of the control inputs, we formulate mixed-integer programming problems within a predictive control framework to design the transformer power control system. To address the computational challenges of the MIP approaches, we introduce a model-informed optimal rounding strategy, which approximates the solution to the MI-MPC problem, delivering a sub-optimal solution that satisfies system constraints, enhances tracking performance and reduces computation time. The proposed MPC implementations are compared and evaluated using a Software-in-the-Loop (SIL) EAF simulator.
|
|
10:51-10:54, Paper ThA01.18 | |
Robust Data-Driven Predictive Run-To-Run Control for Automated Serial Sectioning |
|
Oakley, Rhianna | University of New Mexico |
Polonsky, Andrew | Sandia National Laboratories |
Chao, Paul | Sandia National Laboratories |
Danielson, Claus | University of New Mexico |
Keywords: Iterative learning control, Process Control, Optimization
Abstract: This paper presents a one-step predictive run-to-run controller (R2R-MPC) for the automation of mechanical serial sectioning (MSS), a destructive material analysis process. To address the inherent uncertainty and disturbances in the MSS process, a robust closed-loop approach is presented. The robust R2R-MPC models the uncertainty of the MSS process using a linear differential inclusion. As an analytical model of the MSS process is unavailable, the differential inclusion is identified from historical data. The R2R-MPC is posed as an optimization problem that computes incremental changes to the control input which minimize the worst-case material removal errors. This optimization-based controller is combined with a run-to-run controller to provide integral action that rejects constant disturbances and tracks constant reference removal rates. To demonstrate the efficacy of our robust R2R-MPC, we present simulation results which compare the presented controller with a conventional non-robust R2R.
|
|
10:54-10:57, Paper ThA01.19 | |
Layered Nonlinear Model Predictive Control for Robust Stabilization of Hybrid Systems |
|
Olkin, Zachary | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Predictive control for nonlinear systems, Stability of hybrid systems, Robotics
Abstract: Computing the receding horizon optimal control of nonlinear hybrid systems is typically prohibitively slow, limiting real-time implementation. To address this challenge, we propose a layered Model Predictive Control (MPC) architecture for robust stabilization of hybrid systems. A high level “hybrid” MPC is solved at a slow rate to produce a stabilizing hybrid trajectory, potentially sub-optimally, including a domain and guard sequence. This domain and guard sequence is passed to a low level “fixed mode” MPC which is a traditional, time-varying, state-constrained MPC that can be solved rapidly, e.g., using nonlinear programming (NLP) tools. A robust version of the fixed mode MPC is constructed by using tracking error tubes that are not guaranteed to have finite size for all time. Using these tubes, we demonstrate that the speed at which the fixed mode MPC is re-calculated is directly tied to the robustness of the system, thereby justifying the layered approach. Finally, simulation examples of a five link bipedal robot and a controlled nonlinear bouncing ball are used to illustrate the formal results.
|
|
10:57-11:00, Paper ThA01.20 | |
Spatially Temporally Distributed Informative Path Planning for Multi-Robot Systems |
|
Nguyen, Binh | Texas A&M University-Corpus Christi |
Nguyen, Linh | Federation University Australia |
Nghiem, Truong X. | University of Central Florida |
La, Hung | University of Nevada |
Baca, Jose | Texas A&M University-Corpus Christi |
Rangel, Pablo | Texas A&M University-Corpus Christi |
Cid Montoya, Miguel | Clemson University |
Nguyen, Thang | Texas A&M University-Corpus Christi |
Keywords: Autonomous robots, Sensor networks, Control applications
Abstract: This paper investigates the problem of informative path planning for a mobile robotic sensor network in spatially temporally distributed mapping. The robots are able to gather noisy measurements from an area of interest during their movements to build a Gaussian process (GP) model of a spatio-temporal field. The model is then utilized to predict the spatio-temporal phenomenon at different points of interest. To spatially and temporally navigate the group of robots so that they can optimally acquire maximal information gains while their connectivity is preserved, we propose a novel multi-step prediction informative path planning optimization strategy employing our newly defined local cost functions. By using the dual decomposition method, it is feasible and practical to effectively solve the optimization problem in a distributed manner. The proposed method was validated through synthetic experiments utilizing real-world data sets.
|
|
ThA02 |
Plaza DE |
RI - Learning and Optimization |
RI Session |
Chair: Ito, Yuji | Toyota Central R&D Labs., Inc |
Co-Chair: Xu, Zhe | Arizona State University |
|
10:00-10:03, Paper ThA02.1 | |
On Generating Explanations for Reinforcement Learning Policies: An Empirical Study |
|
Yuasa, Mikihisa | University of Illinois Urbana-Champaign |
Tran, Huy | University of Illinois at Urbana-Champaign |
Sreenivas, Ramavarapu S. | Univ. of Illinois |
Keywords: Reinforcement learning, Intelligent systems, Robotics
Abstract: Understanding a reinforcement learning policy, which guides state-to-action mappings to maximize rewards, necessitates an accompanying explanation for human comprehension. In this paper, we introduce a set of linear temporal logic formulae designed to provide explanations for policies, and an algorithm for searching through those formulae for the one that best explains a given policy. Our focus is on explanations that elucidate both the ultimate objectives accomplished by the policy and the prerequisite conditions it upholds throughout its execution. The effectiveness of our proposed approach is illustrated through a simulated game of capture-the-flag and a car-parking environment.
|
|
10:03-10:06, Paper ThA02.2 | |
Theoretical Analysis of Heteroscedastic Gaussian Processes with Posterior Distributions |
|
Ito, Yuji | Toyota Central R&D Labs., Inc |
Keywords: Machine learning, Stochastic systems, Uncertain systems
Abstract: This study introduces a novel theoretical framework for analyzing heteroscedastic Gaussian processes (HGPs) that identify unknown systems in a data-driven manner. Although HGPs effectively address the heteroscedasticity of noise in complex training datasets, calculating the exact posterior distributions of the HGPs is challenging, as these distributions are no longer multivariate normal. This study derives the exact means, variances, and cumulative distributions of the posterior distributions. Furthermore, the derived theoretical findings are applied to a chance-constrained tracking controller. After an HGP identifies an unknown disturbance in a plant system, the controller can handle chance constraints regarding the system despite the presence of the disturbance.
|
|
10:06-10:09, Paper ThA02.3 | |
Data-Driven Modeling for Optimal Control of Circadian Rhythms |
|
Wen, Yunshi | Rensselaer Polytechnic Institute |
Julius, Agung | Rensselaer Polytechnic Institute |
Keywords: Machine learning, Systems biology, Data driven control
Abstract: Existing methods for controlling circadian rhythms are based on standard mathematical models that are not personalized, as they are fitted to population-wide data. Moreover, the standard models are expressed in terms of variables, such as the core body temperature, which is not practically measurable. In this paper, we introduce a data-driven modeling approach that can use generic biometric measurements. We train a machine learning model with low-dimensional latent state representations and affine dynamics. Without any prior knowledge of the real system, we show that this model has long-horizon prediction power and outperforms traditional system identification methods. Applied to the optimal circadian entrainment problem, our model can calculate optimal control inputs that achieve over 80% of the performance of solutions computed directly from the ground truth model.
|
|
10:09-10:12, Paper ThA02.4 | |
Traffic-Aware Pedestrian Intention Prediction |
|
Orvati Nia, Fahimeh | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Keywords: Machine learning, Traffic control
Abstract: Accurate pedestrian intention estimation is crucial for the safe navigation of autonomous vehicles (AVs) and hence attracts a lot of research attention. However, current models often fail to adequately consider dynamic traffic signals and contextual scene information, which are critical for realworld applications. This paper presents a Traffic-Aware SpatioTemporal Graph Convolutional Network (TA-STGCN) that integrates traffic signs and their states (Red, Yellow, Green) into pedestrian intention prediction. Our approach introduces the integration of dynamic traffic signal states and bounding box size as key features, allowing the model to capture both spatial and temporal dependencies in complex urban environments. The model surpasses existing methods in accuracy. Specifically, TA-STGCN achieves a 4.75% higher accuracy compared to the baseline model on the PIE dataset, demonstrating its effectiveness in improving pedestrian intention prediction.
|
|
10:12-10:15, Paper ThA02.5 | |
Demand Forecasting for Electric Vehicle Charging Stations Using Multivariate Time-Series Analysis |
|
Sanami, Saba | Concordia University |
Mosalli, Hesamoddin | Concordia University |
Yang, Yu | California State University Long Beach |
Yeh, Hen-Geul | California State University, Long Beach |
Aghdam, Amir G. | Concordia University |
Keywords: Machine learning, Energy systems
Abstract: As the number of electric vehicles (EVs) continues to grow, the demand for charging stations is also increasing, leading to challenges such as long wait times and insufficient infrastructure. High-precision forecasting of EV charging demand is crucial for efficient station management, to address some of these challenges. This paper presents an approach to predict the charging demand at 15-minute intervals for the day ahead using a multivariate long short-term memory (LSTM) network with an attention mechanism. Additionally, the model leverages explainable AI techniques to evaluate the influence of various factors on the predictions, including weather conditions, day of the week, month, and any holiday. SHapley Additive exPlanations (SHAP) are used to quantify the contribution of each feature to the final forecast, providing deeper insights into how these factors affect prediction accuracy. As a result, the framework offers enhanced decision-making for infrastructure planning. The efficacy of the proposed method is demonstrated by simulations using the test data collected from the EV charging stations at California State University, Long Beach.
|
|
10:15-10:18, Paper ThA02.6 | |
High Precision Position Estimation of a Centimeter Scaled Robot Using Deep Learning and an Actively Controlled Magnetic Field |
|
Pushpalayam, Navaneeth | University of Minnesota |
Alexander, Lee | University of Minnesota |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Mechanical systems/robotics, Mechatronics, Sensor fusion
Abstract: This paper develops a 2D position estimation system for a robot moving over a plane using an actively controlled magnetic field. The position estimation system consists of two magnetic sensors on the moving robot and an actively controlled external rotating permanent magnet. The orientation of the external magnet is controlled to roughly point at the robot and a narrow localized magnetic field map is developed using deep learning around the pointing direction of the magnet. Using the magnetic fields at both sensors, the radial and polar positions of the robot are estimated using an unscented Kalman filter. The orientation of the magnet is then more finely controlled to point precisely at one of the magnetic sensors. This enables the design of an asymptotically stable nonlinear observer that provides enhanced accuracy in the radial position estimation of the robot. Experimental results are presented on the performance of the estimation system for a robot moving in a 2D plane.
|
|
10:18-10:21, Paper ThA02.7 | |
Trajectory-Based Automata Learning for Offline Reinforcement Learning |
|
Meshkat Alsadat, Shayan | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning, Automata, Machine learning
Abstract: Offline reinforcement learning (RL) learns a policy from a fixed batch of data. However, the overestimation of the values rooted in the out-of-distribution actions limits the applicability of offline RL. It results in methods constraining or regularizing the learned policy based on the dataset. Hence, offline RL algorithms try to tackle this issue by adding a secondary component, e.g., new hyperparameters or generative models. Our proposed method aims to tackle this problem by taking a new perspective using two approaches on the offline RL. We tend to use deterministic finite automoton (DFA) to learn Offline-DFA (offline deterministic finite automaton) or ARM-DFA (association rule mining DFA) from the trajectories in the dataset without implementing any secondary component or constraining or regularizing the learned policy. This means that the learned policy does not face out-of-distribution actions. We use the Offline-DFA (or ARM-DFA) to guide the policy learning process based on the dataset’s trajectories. We propose a novel method called automata learning for offline RL with q-learning (ALOQ), which implements this perspective into the offline RL. We show the convergence guarantee of our proposed method to an optimal policy. In practice, Offline- DFA (or ARM-DFA) is used to learn the ground truth task, i.e., encoded by a reward machine, which allows the agent to learn an optimal policy. We demonstrate the performance of our proposed method by comparing it to the existing offline RL methods, converging to an optimal policy faster than the existing offline RL methods.
|
|
10:21-10:24, Paper ThA02.8 | |
Canonical Form of Datatic Description in Control Systems |
|
Zhan, Guojian | Tsinghua University |
Zheng, Ziang | Tsinghua University |
Li, Shengbo Eben | Tsinghua University |
Keywords: Reinforcement learning, Control applications, Neural networks
Abstract: The design of feedback controllers is undergoing a paradigm shift from modelic (i.e., model-driven) control to datatic (i.e., data-driven) control. Canonical form of state space model is an important concept in modelic control systems, exemplified by Jordan form, controllable form and observable form, whose purpose is to facilitate system analysis and controller synthesis. In the realm of datatic control, there is a notable absence in the standardization of data-based system representation. This paper for the first time introduces the concept of textit{canonical data form} for the purpose of achieving more effective design of datatic controllers. In a control system, the data sample in canonical form consists of a textit{transition} component and an textit{attribute} component. The former encapsulates the plant dynamics at the sampling time independently, which is a tuple containing three elements: a state, an action and their corresponding next state. The latter describes one or some artificial characteristics of the current sample, whose calculation must be performed in an online manner. The attribute of each sample must adhere to two requirements: (1) causality, ensuring independence from any future samples; and (2) locality, allowing dependence on historical samples but constrained to a finite neighboring set. The purpose of adding attribute is to offer some kinds of benefits for controller design in terms of effectiveness and efficiency. To provide a more close-up illustration, we present two canonical data forms: temporal form and spatial form, and demonstrate their advantages in reducing instability and enhancing training efficiency in two datatic control systems.
|
|
10:24-10:27, Paper ThA02.9 | |
Exploiting Adjacent Similarity in Multi-Armed Bandit Tasks Via Transfer of Reward Samples |
|
Nr, Rahul | Indian Institute of Science Bengaluru |
Katewa, Vaibhav | Indian Institute of Science Bangalore |
Keywords: Reinforcement learning, Machine learning, Statistical learning
Abstract: We consider a sequential multi-task problem, where each task is modeled as the stochastic multi-armed bandit with K arms. We assume the bandit tasks are adjacently similar in the sense that the difference between the mean rewards of the arms for any two consecutive tasks is bounded by a parameter. We propose two algorithms (one assumes the parameter is known while the other does not) based on UCB to transfer reward samples from preceding tasks to improve the overall regret across all tasks. Our analysis shows that transferring samples reduces the regret as compared to the case of no transfer. We provide empirical results for our algorithms, which show performance improvement over the standard UCB algorithm without transfer and a naive transfer algorithm
|
|
10:27-10:30, Paper ThA02.10 | |
Boosting Exploration in Reinforcement Learning for Sparse Reward Tasks |
|
Zhang, Yuhang | Tsinghua University |
Lyu, Yao | Tsinghua University |
Zhan, Guojian | Tsinghua University |
Zou, Wenjun | Tsinghua University |
Li, Shengbo Eben | Tsinghua University |
Keywords: Reinforcement learning, Machine learning, Neural networks
Abstract: Effective exploration is critical for reinforcement learning (RL) to understand the environment and achieve high performance, especially in sparse reward tasks. Existing methods often depend on task-specific prior knowledge for exploration guidance, which is unavailable in complex tasks, or rely on additional policy objectives to encourage exploration, which leads to sub-optimal solutions. This paper addresses these limitations by introducing a dual-policy guided exploration (DPE) mechanism, which learns an extra policy to promote the sample collection in unfamiliar state areas. In this mechanism, we define a curiosity signal using the random network distillation technique, which evaluates the state familiarity to the agent. Then, a separate courage policy is trained by maximizing the accumulated discounted curiosity signal, which aims to explore unfamiliar areas and discover potential high-value behaviors. By incorporating this mechanism into the distributional RL framework, we propose a Distributional Soft Actor-Critic algorithm for Sparse reward tasks (DSAC-S), which comprises three modules: dual-policy guided exploration, curiosity signal learning, and actor-critic training. We validate its effectiveness in several sparse reward tasks, including MountainCar, Sparse HalfCheetah, and IDSim. The results demonstrate that DSAC-S successfully conquers these tasks with its enhanced exploration ability, while the two baselines either fail to solve these problems or exhibit poor performance.
|
|
10:30-10:33, Paper ThA02.11 | |
Few-Shot Learning-Enhanced Tiered Path Planning for Mars Rover Navigation |
|
Wang, Ziyi | Purdue University |
Yu, Di | Purdue University |
Khalilzadeh Fath, Mina | Missouri University of Science and Technology |
Pei, Chaoying | Missouri University of Science and Technology |
Keywords: Autonomous systems, Machine learning, Aerospace
Abstract: Path planning for Mars rovers presents significant challenges due to the diverse terrain, ranging from easily navigable areas to hazardous zones. Traditional methods typically classify terrain simply as passable or impassable, failing to account for the nuances of more moderately challenging areas. In this paper, we introduce a tiered terrain-aware path planning strategy, employing few-shot learning to classify and segment Martian terrain into levels of difficulty. The few-shot learning model, trained on Earth, is sent to the rover, enabling real-time processing of images from satellites or helicopters. The flexibility of few-shot learning, which requires minimal data and training time, enables quick updates and redeployment of the policy when necessary. Then, a modified A* path planning algorithm is proposed to generate paths on the tiered terrain maps. This algorithm takes into account the classified terrain tiers, allowing the rover to dynamically adjust its path based on real-time assessments. By integrating few-shot learning with the modified A* algorithm, the rover is equipped to make real-time intelligent decisions, enhancing its ability to navigate complex terrains effectively. Simulation results demonstrate the rover’s enhanced capability to navigate complex terrains, illustrating the effectiveness and flexibility of this integrated approach.
|
|
10:33-10:36, Paper ThA02.12 | |
Fusing Multiple Algorithms for Heterogeneous Online Learning |
|
Gadginmath, Darshan | University of California, Riverside |
Tripathi, Shivanshu | University of California, Riverside |
Pasqualetti, Fabio | University of California, Riverside |
Keywords: Optimization algorithms, Optimization, Distributed parameter systems
Abstract: This paper addresses the challenge of online learning in contexts where agents accumulate disparate data and use different local learning algorithms due to resource constraints. We introduce the Switched Online Learning Algorithm (SOLA), designed to solve the heterogeneous online learning problem by fusing updates from diverse agents through a dynamic switching mechanism contingent upon their respective performance and available resources. We theoretically analyze the design of the selecting mechanism to ensure that the regret of SOLA is bounded. Our findings show that the number of changes in selection needs to be bounded by a parameter dependent on the performance of the different local algorithms. Additionally, two numerical experiments are presented to emphasize the effectiveness of SOLA, first on an online linear regression problem and then on an online classification problem with the MNIST dataset.
|
|
10:36-10:39, Paper ThA02.13 | |
GP-Enhanced Autonomous Drifting Framework Using ADMM-Based ILQR |
|
Xie, Yangyang | Zhejiang University |
Hu, Cheng | Zhejiang University |
Baumann, Nicolas | ETH Zurich |
Ghignone, Edoardo | ETH Zurich |
Magno, Michele | ETH Zurich |
Xie, Lei | Zhejiang University |
Keywords: Automotive control, Predictive control for nonlinear systems, Machine learning
Abstract: Autonomous drifting is a complex challenge due to the highly nonlinear dynamics and the need for precise real-time control, especially in uncertain environments. To address these limitations, this paper presents a hierarchical control framework for autonomous vehicles drifting along general paths, primarily focusing on addressing model inaccuracies and mitigating computational challenges in real-time control. The framework integrates Gaussian Process (GP) regression with an Alternating Direction Method of Multipliers (ADMM)-based iterative Linear Quadratic Regulator (iLQR). GP regression effectively compensates for model residuals, improving accuracy in dynamic conditions. ADMM-based iLQR not only combines the rapid trajectory optimization of iLQR but also utilizes ADMM's strength in decomposing the problem into simpler sub-problems. Simulation results demonstrate the effectiveness of the proposed framework, with significant improvements in both drift trajectory tracking and computational efficiency. Our approach reduced the RMSE lateral error by 38% and achieved a computation time approximately 25% of that required by IPOPT.
|
|
10:39-10:42, Paper ThA02.14 | |
Control-Aware Trajectory Prediction for Communication-Free Drone Swarm Coordination in Cluttered Environments |
|
Yan, Longhao | National University of Singapore |
Zhou, Jingyuan | National University of Singapore |
Yang, Kaidi | National University of Singapore |
Keywords: Autonomous robots, Distributed control, Machine learning
Abstract: Swarms of Unmanned Aerial Vehicles (UAVs) have demonstrated enormous potential in many industrial and commercial applications. However, before deploying UAVs in the real world, it is essential to ensure they can operate safely in complex environments, particularly where communication is not available. To address this challenge, we propose a control-aware learning-based trajectory prediction algorithm that can enable communication-free UAV swarm control in a cluttered environment. Specifically, our proposed algorithm can enable each UAV to predict the planned trajectories of its neighbors in scenarios without communication. The predicted planned trajectories will serve as input to a distributed model predictive control (DMPC) approach. The proposed algorithm combines (1) a trajectory prediction model based on EvolveGCN, a Graph Convolutional Network (GCN) that can handle dynamic graphs, and (2) a KKT-informed training approach that applies the Karush-Kuhn-Tucker (KKT) conditions in the training process to encode DMPC information into the trained neural network. We evaluate our proposed algorithm in a funnel-like environment. The results show that the proposed algorithm outperforms state-of-the-art benchmarks, providing close-to-optimal control performance even when communication is unavailable.
|
|
10:42-10:45, Paper ThA02.15 | |
Gradient Flow Approximations in Temporal Difference Learning |
|
Neshaei Moghaddam, Amirreza | UCLA |
Gharesifard, Bahman | Queen's University |
Keywords: Machine learning, Reinforcement learning, Algebraic/geometric methods
Abstract: We consider the continuous-time temporal difference (TD) learning dynamics with nonlinear value function approximations, where there is a slim understanding of the convergence properties in irreversible regimes. Motivated by Krener's linearization idea ala Lie-brackets, we obtain conditions on the approximating value function and irreversibility coefficients under which the TD dynamics behaves close to a gradient flow. We show that our conditions lead to a set of partial differential equations, and study the existence of solutions using the algebraic invertibility of differential operators. Whenever a solution exists, using a perturbation analysis, we provide a stability result for nonlinear TD dynamics. As a by-product, we state the implications of the results for the classical case of linear approximations, where our conditions are algebraic, and easily verifiable.
|
|
10:45-10:48, Paper ThA02.16 | |
Maximum a Posteriori Least-Squares Temporal Difference |
|
van Zuijlen, Roy | Eindhoven University of Technology |
Antunes, Duarte | Eindhoven University of Technology, the Netherlands |
Keywords: Reinforcement learning, Optimal control, Statistical learning
Abstract: Least-squares Temporal Difference (LSTD) is a model-free method to estimate the cost/value function of a Markov Decision Process (MDP). Interestingly, it is equivalent to estimating the state transition probabilities of the MDP through Maximum Likelihood (ML) and computing the cost/value function with (model-based) policy evaluation. However, LSTD does not incorporate prior knowledge of the state transition probabilities of the MDP. This paper proposes a data-based method, coined MAP-LSTD, that incorporates such prior knowledge without explicitly estimating the model. This method is analogous to LSTD in the sense that it is equivalent to estimating the state transition probabilities (model) through Maximum a Posteriori (MAP) instead of ML and computing the cost function with policy evaluation. Moreover, as LSTD, it can be implemented recursively. The key to our method is to model the unknown state distributions with Dirichlet distributions and encapsulate prior knowledge in the parameters of these distributions. An important concomitant is that the prior can be obtained from previous data samples and/or simulated samples, leveraging a model/simulator. Through an example, we illustrate how our method can speed up the convergence of the estimated cost function and reduce the estimation error significantly by exploiting prior knowledge derived from a model/simulator.
|
|
10:48-10:51, Paper ThA02.17 | |
Policy Optimization for PDE Control with a Warm Start |
|
Zhang, Xiangyuan | University of Illinois at Urbana-Champaign |
Mowlavi, Saviz | Mitsubishi Electric Research Laboratories |
Benosman, Mouhacine | Mitsubishi Electric Research Laboratories |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Reinforcement learning, Reduced order modeling, Distributed parameter systems
Abstract: Dimensionality reduction is crucial for controlling nonlinear partial differential equations (PDE) through a ``reduce-then-design'' strategy, which identifies a reduced-order model and then implements model-based control solutions. However, inaccuracies in the reduced-order modeling can substantially degrade controller performance, especially in PDEs with chaotic behavior. To address this issue, we augment the reduce-then-design procedure with a policy optimization (PO) step. The PO step fine-tunes the model-based controller to compensate for the modeling error from dimensionality reduction. This augmentation shifts the overall strategy into reduce-then-design-then-adapt, where the model-based controller serves as a warm start for PO. Specifically, we study the state-feedback tracking control of PDEs that aims to align the PDE state with a specific constant target subject to a linear-quadratic cost. Through extensive experiments, we show that a few iterations of PO can significantly improve the model-based controller performance. Our approach offers a cost-effective alternative to PDE control using end-to-end reinforcement learning.
|
|
10:51-10:54, Paper ThA02.18 | |
Learning Clusters of Partially Observed Linear Dynamical Systems |
|
Rui, Maryann | Massachusetts Institute of Technology |
Dahleh, Munther A. | Massachusetts Inst. of Tech |
Keywords: Identification, Statistical learning, Linear systems
Abstract: We study the problem of learning clusters of partially observed linear dynamical systems from multiple input-output trajectories. This setting is particularly relevant when there are limited observations (e.g., short trajectories) from individual data sources, making direct estimation challenging. In such cases, incorporating data from multiple related sources can improve learning. We propose an estimation algorithm that leverages different data requirements for the tasks of clustering and system identification. First, short impulse responses are estimated from individual trajectories and clustered. Then, refined models for each cluster are jointly estimated using multiple trajectories. We establish end-to-end finite sample guarantees for estimating Markov parameters and state space realizations and highlight trade-offs among the number of observed systems, the trajectory lengths, and the complexity of the underlying models.
|
|
10:54-10:57, Paper ThA02.19 | |
Online Reinforcement Learning with Passive Memory |
|
Pattanaik, Anay | University of Illinois, Urbana Champaign |
Varshney, Lav R. | University of Illinois at Urbana-Champaign |
Keywords: Reinforcement learning
Abstract: This paper considers an online reinforcement learning algorithm that leverages pre-collected data (passive memory) from the environment for online interaction. We show that using passive memory improves performance and further provide theoretical guarantees for regret that turns out to be near-minimax optimal. Results show that quality of passive memory determines sub-optimality of the incurred regret. The proposed approach and results hold in both continuous and discrete state-action spaces.
|
|
10:57-11:00, Paper ThA02.20 | |
Generalizable Spacecraft Trajectory Generation Via Multimodal Learning with Transformers |
|
Celestini, Davide | Politecnico Di Torino |
Afsharrad, Amirhossein | Stanford University |
Gammelli, Daniele | Stanford University |
Guffanti, Tommaso | Stanford University |
Zardini, Gioele | Massachusetts Institute of Technology |
Lall, Sanjay | Stanford University |
Capello, Elisa | Politecnico Di Torino, CNR-IEIIT |
D'Amico, Simone | Stanford University |
Pavone, Marco | Stanford University |
Keywords: Machine learning, Optimal control, Aerospace
Abstract: Effective trajectory generation is essential for reliable on-board spacecraft autonomy. Among other approaches, learning-based warm-starting represents an appealing paradigm for solving the trajectory generation problem, effectively combining the benefits of optimization- and data-driven methods. Current approaches for learning-based trajectory generation often focus on fixed, single-scenario environments, where key scene characteristics, such as obstacle positions or final-time requirements, remain constant across problem instances. However, practical trajectory generation requires the scenario to be frequently reconfigured, making the single-scenario approach a potentially impractical solution. To address this challenge, we present a novel trajectory generation framework that generalizes across diverse problem configurations, by leveraging high-capacity transformer neural networks capable of learning from multimodal data sources. Specifically, our approach integrates transformer-based neural network models into the trajectory optimization process, encoding both scene-level information (e.g., obstacle locations, initial and goal states) and trajectory-level constraints (e.g., time bounds, fuel consumption targets) via multimodal representations. The transformer network then generates near-optimal initial guesses for non-convex optimization problems, significantly enhancing convergence speed and performance. The framework is validated through extensive simulations and real-world experiments on a free-flyer platform, achieving up to 30% cost improvement and 80 % reduction in infeasible cases with respect to traditional approaches, and demonstrating robust generalization across diverse scenario variations.
|
|
ThA03 |
Plaza CF |
RI - Networked and Multiagent Systems |
RI Session |
Chair: Tegling, Emma | Lund University |
Co-Chair: Fregene, Kingsley C. | Lockheed Martin |
|
10:00-10:03, Paper ThA03.1 | |
Resilient Leader-Follower Consensus with Multi-Hop Communication |
|
Yuan, Liwei | Hunan University |
Ishii, Hideaki | University of Tokyo |
Keywords: Network analysis and control, Agents-based systems, Fault tolerant systems
Abstract: We study the problem of resilient leader-follower consensus in multi-agent systems (MASs) where some of the agents may malfunction. The objective is for nonfaulty agents to reach consensus on a reference value broadcast by leader agents despite possible misbehaviors of adversarial agents. To this end, we utilize the multi-hop weighted mean subsequence reduced (MW-MSR) algorithm to achieve the goal in multi-agent networks with directed topologies. We characterize a necessary and sufficient condition on graph structures for our algorithm to succeed, which is expressed in a novel notion of robust following graphs. With one-hop communication, our condition is tighter than the ones in the related resilient leader-follower consensus works. With multi-hop communication, we can have an even more relaxed graph condition for our algorithm to succeed. Lastly, we present numerical examples to verify the effectiveness of our algorithm.
|
|
10:03-10:06, Paper ThA03.2 | |
Resilient Distributed Vector Consensus under Dynamic State Imprecision |
|
Lee, Christopher | University of Texas at Dallas |
Abbas, Waseem | University of Texas at Dallas |
Keywords: Network analysis and control, Control of networks, Cooperative control
Abstract: In this paper, we study the resilient distributed vector consensus problem in networks where agents have imprecise knowledge of their neighbors' states and must operate in the presence of adversarial agents. Unlike existing methods that assume exact state observations, we consider a more realistic scenario where each normal agent's knowledge of its neighbors' states is restricted to a bounded region in mathbb{R}^d, referred to as the imprecision region. The size of this region reflects the degree of uncertainty and dynamically shrinks as agents update their states and approach consensus. We analyze the required decrease in the imprecision region at each iteration to ensure consensus, even in the presence of adversarial agents and state imprecision. By leveraging novel geometric insights, we present a method to ensure that each normal agent can consistently find a point within the convex hull of its normal neighbors' true states, despite not knowing these states exactly. We also provide numerical evaluations to demonstrate the practical effectiveness of our approach in distributed optimization contexts.
|
|
10:06-10:09, Paper ThA03.3 | |
Transient Control of Linear Multi-Agent Systems with Leader-Follower Configuration |
|
Liu, Siyuan | KTH Royal Institute of Technology |
Chen, Fei | University of California, San Diego |
Dimarogonas, Dimos V. | KTH Royal Institute of Technology |
Keywords: Agents-based systems, Autonomous systems, Cooperative control
Abstract: This paper presents a distributed control framework for leader-follower multi-agent systems with general linear dynamics to achieve consensus alongside prescribed transient performance. The multi-agent system is in a leader-follower configuration with only a set of agents selected as leaders and controlled via external inputs. In particular, we propose a distributed control framework comprising a prescribed performance controller for the leaders, while the followers are governed solely by a consensus protocol. When the decay rate of the performance functions is within a sufficient bound, we show that this distributed control law achieves consensus for the entire multi-agent system, with the guarantee that the trajectories of the consensus errors remain bounded by a time-varying prescribed transient performance function. Finally, the proposed results are illustrated through numerical examples.
|
|
10:09-10:12, Paper ThA03.4 | |
A Two-Stage Mechanism for Prioritized Trajectory Planning in Multi-Agent Systems |
|
Chen, Yu-Wen | University of California, Berkeley |
Kizilkale, Can | University of California Berkeley, LBL |
Arcak, Murat | University of California, Berkeley |
Keywords: Agents-based systems, Game theory, Distributed control
Abstract: In multi-agent systems with coupled objectives and/or constraints, agents may misreport information to achieve individual gains. This issue is exacerbated when agents possess local decision-making power, such as in multi-agent trajectory planning, where the increased autonomy amplifies individual benefits at the expense of a higher social cost. To overcome this problem, we leverage the Vickrey-Clarke-Grove (VCG) framework and propose a strategyproof, two-stage mechanism. We further extend this mechanism to prioritized planning and prevent agents from manipulating their priority.
|
|
10:12-10:15, Paper ThA03.5 | |
Consensus Problem of Asymmetric Higher-Order Interaction Multi-Agent Networks |
|
Wei, Haoyu | Shanghai Jiao Tong University |
Pan, Lulu | Shanghai Jiao Tong University |
Shao, Haibin | Shanghai Jiao Tong University |
Wang, Peng | Shanghai Jiao Tong University |
Keywords: Autonomous systems, Agents-based systems
Abstract: This paper examines the consensus problem of multi-agent systems with asymmetric higher-order interactions amongst neighboring agents, where the asymmetric matrix-valued inter-agent coupling mechanism is employed to model the higher-order interactions amongst neighboring agents, and the associated pairwise interaction is also supposed to be irreciprocal. First, we reveal the fundamental mechanism of how asymmetric higher-order interactions influence the error elimination between a pair of neighboring agents according to the asymmetric/skew-symmetric decomposition of square matrices. Second, we provide sufficient conditions to guarantee the convergence of asymmetric HOI networks from the network topology perspective. By examining the null space of the Laplacian matrix associated with asymmetric HOI networks, we finally present necessary/sufficient conditions for achieving average consensus on asymmetric HOI networks.
|
|
10:15-10:18, Paper ThA03.6 | |
A Formation Based Multi-Agent Receding Horizon Control Method for Signal Strength Model Estimation |
|
Zhu, Yancheng | Boston University |
Andersson, Sean B. | Boston University |
Keywords: Optimal control, Estimation, Autonomous systems
Abstract: This paper considers the problem of localizing a set of nodes in a wireless sensor network where both the node positions and communication model parameters are unknown. We assume that a multi-agent system moves in formation through the environment, taking measurements of the Received Signal Strength, and seek a controller that optimizes a performance metric based on the Fisher Information Matrix. We propose a two-stage formation-based receding horizon approach that alternates between estimating the parameters and determining where to move and how to scale the formation to maximally inform the estimation problem. We apply a Dynamic Programming approach to solve the multi-stage look ahead control problem of the first stage, followed by a Particle Swarm Optimization algorithm to determine the best formation configuration in the second stage. We demonstrate our approach using different formation structures and compare it against multiple baselines.
|
|
10:18-10:21, Paper ThA03.7 | |
Distributed Adaptive Consensus with Obstacle and Collision Avoidance for Networks of Heterogeneous Multi-Agent Systems |
|
Koulong, Armel | University of Alabama |
Pakniyat, Ali | University of Alabama |
Keywords: Cooperative control, Agents-based systems, Adaptive control
Abstract: This paper presents a distributed adaptive control strategy for multi-agent systems with heterogeneous time-varying nonlinear dynamics, integrating obstacle and collision avoidance mechanisms. We propose an adaptive control strategy designed to ensure leader-following formation consensus while effectively managing collision and obstacle avoidance using potential functions. By integrating neural network (NN) based estimation and adaptive tuning laws, the proposed strategy ensures consensus, safety and stability in leader-following formations under fixed topologies. The NNs enhance the adaptability of the proposed architecture, enabling agents to learn and adjust to environmental and system changes.
|
|
10:21-10:24, Paper ThA03.8 | |
Distributed Bipartite Formation Control with Output Regulation for Heterogeneous Multi-Agent Systems |
|
Chiang, Ming-Li | National Taiwan Ocean University |
Chen, Jian-Mou | National Taiwan Ocean University Department of Electrical Engine |
Chen, Zhengyu | National Taiwan University |
Keywords: Cooperative control, Distributed control, Output regulation
Abstract: In this paper, we investigate the bipartite formation tracking control problem with output regulation for heterogeneous multi-agent systems (HMAS) under distributed communication. The HMAS consists of linear and nonlinear agents, where we utilize a leader-follower structure and the properties of the Laplacian matrix to design the formation controller. A distributed observer is designed to address the issue about the lack of leader state information. We propose a consensus based control protocol to achieve bipartite formation and tracking of the desired reference trajectory. System stability is verified using Lyapunov stability theory and asymptotic stability is guaranteed under the basic output regulation requirements. Some simulation results are provided to demonstrate the feasibility and effectiveness of the designed controller.
|
|
10:24-10:27, Paper ThA03.9 | |
Cooperative Multi-Agent Constrained Stochastic Linear Bandits |
|
Afsharrad, Amirhossein | Stanford University |
Oftadeh, Parisa | University of California Santa Cruz |
Moradipari, Ahmadreza | University of California Santa Barbara |
Lall, Sanjay | Stanford University |
Keywords: Cooperative control, Optimization, Human-in-the-loop control
Abstract: In this study, we explore a collaborative multi-agent stochastic linear bandit setting involving a network of (N) agents that communicate locally to minimize their collective regret while keeping their expected cost under a specified threshold (tau). Each agent encounters a distinct linear bandit problem characterized by its own reward and cost parameters, i.e., local parameters. The goal of the agents is to determine the best overall action corresponding to the average of these parameters, or so-called global parameters. In each round, an agent is randomly chosen to select an action based on its current knowledge of the system. This chosen action is then executed by all agents, then they observe their individual rewards and costs. We propose a safe distributed upper confidence bound algorithm, so called textit{MA-OPLB}, and establish a high probability bound on its (T)-round regret. MA-OPLB utilizes an accelerated consensus method, where agents can compute an estimate of the average rewards and costs across the network by communicating the proper information with their neighbors. We show that our regret bound is of order mathcal{O}left(frac{d}{tau-c_0}frac{log(NT)^2}{sqrt{ N}}sqrt{frac{T}{log(1/|lambda_2|)}}right), where lambda_2 is the second largest (in absolute value) eigenvalue of the communication matrix, and tau-c_0 is the known cost gap of a feasible action. We also experimentally show the performance of our proposed algorithm in different network structures.
|
|
10:27-10:30, Paper ThA03.10 | |
Density-Driven Formation Control of a Multi-Agent System with an Application to Search-And-Rescue Missions |
|
Afrazi, Mohammad | New Mexico Institute of Mining and Technology |
Seo, Sungjun | New Mexico Institute of Mining and Technology |
Lee, Kooktae | New Mexico Tech |
Keywords: Decentralized control, Cooperative control, Agents-based systems
Abstract: In this paper, a novel Density-Driven Formation Control (D2FC) framework is presented for the formation control of swarm robots, driven by a given reference density map. This work is specifically designed for search and rescue (SAR) missions. Leveraging principles from optimal transport theory and decentralized control, our approach enables a swarm of agents to distribute themselves according to a given density distribution, focusing on high-priority areas without the need for centralized coordination. By integrating the virtual leader paradigm and formation, the swarm adapts efficiently to cover areas of interest. To improve the area coverage efficiency for achieving high detection rate of victims on a disaster site, a new information update model is proposed. Simulation results demonstrate that swarms utilizing the proposed D2FC method significantly outperform uncoordinated swarms in victim detection across various scenarios. The proposed approach enhances the effectiveness of SAR operations and can be extended to other applications requiring coordinated multi-agent systems.
|
|
10:30-10:33, Paper ThA03.11 | |
Multi-Agent Causal Dynamics Learning for Temporally Extended Tasks with Reward Machine Inference |
|
Partovi Aria, Hadi | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning, Agents-based systems, Machine learning
Abstract: We introduce an approach called Multi-Agent State Abstraction with Causal dynamics and Reward Machine Learning (Multi-SACReM), designed to enhance the efficiency of reinforcement learning (RL) in multi-agent environments. By integrating causal information through state abstraction, Multi-SACReM enables agents to learn more effectively in complex, dynamic settings. In our framework, each agent employs a reward machine, to optimize its behavior. Multi-SACReM identifies and eliminates redundant causal relationships, allowing agents to focus on essential interactions and optimize policies accordingly. Our approach works in both decentralized settings, where agents learn independently using local observations, and centralized RL environments. In centralized RL, a controller coordinates agents using global information to optimize collective behavior. We evaluate Multi-SACReM through case studies comparing decentralized versus centralized approaches. Results show Multi-SACReM effectively learns causal models with reward machines, comparing decentralized versus centralized approaches in multi-agent settings.
|
|
10:33-10:36, Paper ThA03.12 | |
Linear Quadratic Regulator of Switched Multi-Agent Systems |
|
Wu, Guangyu | Tongji University |
Keywords: Switched systems, Optimal control, Hybrid systems
Abstract: This paper formulates and addresses the linear quadratic regulator problem concerning asynchronously switched multi-agent systems comprising agents with different dynamics. The objective is to derive closed-form solutions that minimize an overall performance index instead of numerical solutions or sub-optimal solutions. The mixed-integer programming problem is transformed into a continuous optimization problem suitable for solutions based on the maximum principle. We decompose the co-optimization problem involving switching variables coupled with control inputs into multiple mutually independent sub-problems and derive analytical bang-bang type solutions for the switching variables. Additionally, the conditions for the existence of distributed solutions for optimal control and switching functions are presented. Finally, numerical examples are conducted to validate the effectiveness of the proposed method.
|
|
10:36-10:39, Paper ThA03.13 | |
Multi-Agent Reinforcement Learning in Non-Cooperative Stochastic Games Using Large Language Models |
|
Meshkat Alsadat, Shayan | Arizona State University |
Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning, Automata, Machine learning
Abstract: We study the use of large language models (LLMs) to integrate high-level knowledge in stochastic games using reinforcement learning with reward machines to encode non-Markovian and Markovian reward functions. In non- cooperative games, one challenge is to provide agents with knowledge about the task efficiently to speed up the convergence to an optimal policy. We aim to provide this knowledge in the form of deterministic finite automata (DFA) generated by LLMs(LLM-generated DFA). Additionally, we use reward machines (RMs) to encode the temporal structure of the game and the non-Markovian or Markovian reward functions. Our proposed algorithm, LLM-generated DFA for Multi-agent Reinforcement Learning with Reward Machines for Stochastic Games (StochQ-RM), can learn an equivalent reward machine to the ground truth reward machine (specified task) in the environment using the LLM-generated DFA. Additionally, we propose DFA-based q-learning with reward machines (DBQRM) to find the best responses for each agent using Nash equilibrium in stochastic games. Despite that the LLMs are known to hallucinate, we show that our method is robust and guaranteed to converge to an optimal policy. Furthermore, we study the performance of our proposed method in three case studies.
|
|
10:39-10:42, Paper ThA03.14 | |
An Algorithm for Distributed Computation of Reachable Sets for Multi-Agent Systems |
|
Thapliyal, Omanshu | Hitachi America Ltd |
Hwang, Inseok | Purdue University |
Clarke, Shanelle Gertrude | Purdue University |
Keywords: Networked control systems, Network analysis and control, Linear systems
Abstract: In this paper, we consider the problem of distributed reachable set computation for multi-agent systems (MASs) interacting over an undirected, stationary graph. A full state-feedback control input for such MASs depends no only on the current agent's state, but also of its neighbors. However, in most MAS applications, the dynamics are obscured by individual agents. This makes reachable set computation, in a fully distributed manner, a challenging problem. We utilize the ideas of polytopic reachable set approximation and generalize it to a MAS setup. We formulate the resulting sub-problems in a fully distributed manner and provide convergence guarantees for the associated computations. The proposed algorithm's convergence is proved for two cases: static MAS graphs, and time-varying graphs under certain restrictions.
|
|
10:42-10:45, Paper ThA03.15 | |
Optimal Formation Motion Planning and Control for Multi-UAVs Based on Deep Reinforcement Learning |
|
Xuan, Shuxing | University of Electronic Science and Technology of China |
Liang, Hongjing | University of Electronic Science and Technology of China |
Yang, Jin | University of Electronic Science and Technology of China |
Keywords: Optimal control, Reinforcement learning, Cooperative control
Abstract: Avoiding collisions between unmanned aerial vehicles (UAVs) during formation is a challenge in leaderfollower formation control. This paper presents a two-stage optimal formation control scheme. Firstly, an end-to-end motion planning method is designed for collision-free motion during the formation process, utilizing deep reinforcement learning. Specifically, an actor-critic network with two hidden layers is constructed, incorporating an obstacle avoidance module based on Bounding Volume Hierarchy (BVH) trees. A novel composite reward function, consisting of target penalty and safety penalty, is proposed to guide the training of the actor-critic network. Furthermore, a low-complexity consensus protocol is developed to synchronize the state of the followers and leaders, and the system is rigorously proven to be asymptotically stable. Finally, the effectiveness of the proposed method is validated through simulations involving a set of UAVs.
|
|
10:45-10:48, Paper ThA03.16 | |
Zeroth-Order Feedback Optimization in Multi-Agent Systems: Tackling Coupled Constraints |
|
Duan, Yingpeng | Peking University |
Tang, Yujie | Peking University |
Keywords: Optimization, Optimization algorithms, Cooperative control
Abstract: This paper investigates distributed zeroth-order feedback optimization in multi-agent systems with coupled constraints, where each agent operates its local action vector and observes only zeroth-order information to minimize a global cost function subject to constraints in which the local actions are coupled. Specifically, we employ two-point zeroth-order gradient estimation with delayed information to construct stochastic gradients, and leverage the constraint extrapolation technique and the averaging consensus framework to effectively handle the coupled constraints. We also provide convergence rate and oracle complexity results for our algorithm, characterizing its computational efficiency and scalability by rigorous theoretical analysis. Numerical experiments are conducted to validate the algorithm's effectiveness.
|
|
10:48-10:51, Paper ThA03.17 | |
Scalable Robust Optimization for Safe Multi-Agent Control under Unknown Deterministic Uncertainty |
|
Abdul, Arshiya Taj | Georgia Institute of Technology |
Saravanos, Augustinos D. | Georgia Institute of Technology |
Theodorou, Evangelos A. | Georgia Institute of Technology |
Keywords: Networked control systems, Uncertain systems, Distributed control
Abstract: This paper introduces a novel framework for multi-agent trajectory optimization under unknown deterministic uncertainty. Many systems are affected by deterministic disturbances, such as environmental effects, system degradation, etc., which cannot be accurately modeled using stochastic signals. Therefore, it is crucial to develop trajectory optimization frameworks that ensure safety despite these disturbances. To this end, we focus on solving a multi-agent trajectory optimization problem involving robust constraints, such as collision avoidance, that must be satisfied for all possible realizations of uncertainty lying in an ellipsoidal set. Conventional robust optimization techniques that are used to address such problems are computationally expensive and struggle when dealing with numerous constraints. To overcome this, we propose tighter approximations of robust constraints that significantly reduce computational complexity without compromising safety. Furthermore, leveraging these constraint approximations, we introduce a distributed robust optimization framework for decentralized multi-agent robust trajectory optimization based on the Alternating Direction Method of Multipliers (ADMM). This framework allows agents to optimize their trajectories without sharing control parameters or system information (e.g., dynamics), thereby preserving data security. The effectiveness of the proposed robust constraint approximations and the scalability of the presented distributed framework are demonstrated through simulation experiments with up to 164 agents.
|
|
10:51-10:54, Paper ThA03.18 | |
Performance Bounds for Multi-Vehicle Networks with Local Integrators |
|
Hansson, Jonas | Lund University |
Tegling, Emma | Lund University |
Keywords: Decentralized control, Distributed control, Large-scale systems
Abstract: In this work, we consider the problem of coordinating a collection of nth-order integrator systems. The coordination is achieved through the novel serial consensus design; this control design achieves a stable closed-loop system while adhering to the constraint of only using local and relative measurements. Earlier work has shown that second-order serial consensus can stabilize a collection of double integrators with scalable performance conditions independent of the number of agents and topology. This paper generalizes these performance results to an arbitrary order n ≥ 1. The derived performance bounds depend on the condition number, measured in the vector-induced maximum matrix norm, of a general diagonalizing matrix. We precisely characterize how a minimal condition number can be achieved. Third-order serial consensus is illustrated through a case study of PI-controlled vehicular formation, where the added integrators are used to mitigate the effect of unmeasured load disturbances. The theoretical results are illustrated through examples.
|
|
ThA04 |
Governor's Square 15 |
RI - Control of Robotic Systems and Mechatronics |
RI Session |
Chair: Mitterbach, Philipp | Eindhoven University of Technology |
Co-Chair: Li, Ji-Hong | Korea Institute of Robotics and Technology Convergence |
|
10:00-10:03, Paper ThA04.1 | |
A Structure-Preserving FEM-Model for the Control of Planar Soft Robots |
|
Mitterbach, Philipp | Eindhoven University of Technology |
Kuling, Irene | Technical University of Eindhoven |
Pogromsky, A. Yu. | Eindhoven University of Technology |
Keywords: Robotics, Reduced order modeling, Emerging control applications
Abstract: A major challenge in soft robotics is model-based control. Many common models for soft robots suffer a loss of mechanical structure when model reduction is performed. This is a significant problem for control, since many control methods, e.g. energy-based control, rely heavily on this structure. In this paper, we present an alternative finite dimensional reduction method that preserves the mechanical structure of the model of a soft robot. Using a form of finite element Galerkin projection, our approach incorporates this reduction process into a variational framework that is suitable for control applications. As a proof-of-principle, we present the model of a planar soft robot with preserved mechanical structure and a simple control paradigm. With this method, soft robots can be described in a form that is suitable for control strategies that depend on their mechanical structure, such as energy-based control.
|
|
10:03-10:06, Paper ThA04.2 | |
Fast Whole-Body Strain Regulation in Continuum Robots |
|
Ogunmolu, Olalekan | Microsoft Research |
Keywords: Robotics, Decentralized control, Adaptive control
Abstract: We propose reaching steps towards the real-time strain control of multiphysics, multiscale continuum soft robots. To study this problem fundamentally, we ground ourselves in a model-based control setting enabled by mathematically precise dynamics of a soft robot prototype. Poised to integrate, rather than reject, inherent mechanical nonlinearities for embodied compliance, we first separate the original robot dynamics into separate subdynamics --- aided by a perturbing time-scale separation parameter. Second, we prescribe a set of stabilizing nonlinear backstepping controllers for regulating the resulting subsystems' strain dynamics. Third, we study the interconnected singularly perturbed system by analyzing and establishing its stability. Fourth, our theories are backed up by fast numerical results on a single arm of the Octopus robot arm. We demonstrate strain regulation to equilibrium, in a significantly reduced time, of the whole-body reduced-order dynamics of an infinite degrees-of-freedom soft robot. This paper communicates our thinking within the backdrop of embodied intelligence: it informs our conceptualization, formulation, computational setup, and yields improved control performance for infinite degrees-of-freedom soft robots.
|
|
10:06-10:09, Paper ThA04.3 | |
Density Functions for Dynamic Safe Navigation of Robotic Systems |
|
Krishnamoorthy Shankara Narayanan, Sriram Sundar | Clemson University |
Moyalan, Joseph | University of California, Merced |
Zheng, Andrew | Clemson University |
Vaidya, Umesh | Clemson University |
Keywords: Robotics, Autonomous systems, Control applications
Abstract: This paper introduces a novel approach based on density functions for safe navigation in dynamic environments with time-varying obstacles and targets. We analytically construct time-varying density functions to address the challenges of dynamic safe navigation. Using this approach, we develop a safe feedback controller that ensures safety and almost everywhere stability in dynamic environments. The proposed framework is demonstrated on complex robotic systems, including the safe navigation of a quadruped robot with time-varying obstacles and sensor uncertainty, as well as a robotic arm tracking a time-varying target with static obstacles.
|
|
10:09-10:12, Paper ThA04.4 | |
LQR-CBF-RRT*: Safe and Optimal Motion Planning |
|
Yang, Guang | Boston University |
Cai, Mingyu | Lehigh University |
Ahmad, Ahmad | Boston University |
Prorok, Amanda | University of Cambridge |
Tron, Roberto | Boston University |
Belta, Calin | University of Maryland |
Keywords: Robotics, Control applications, Lyapunov methods
Abstract: We present LQR-CBF-RRT*, an incremental sampling-based algorithm for offline motion planning. Our framework leverages the strength of Control Barrier Functions (CBFs) and Linear Quadratic Regulators (LQR) to generate safety-critical and optimal trajectories for general affine control systems. This work uses CBF for safety guarantees and LQRs for optimal control synthesis during edge extensions. Traditional CBF methods involve Quadratic Programs (QPs), which add computational overhead and can sometimes be infeasible. Conversely, LQR-based controllers typically employ first-order Taylor approximations for nonlinear systems, necessitating consistent recalculations. To enhance motion planning efficiency, our framework directly verifies CBF constraints during the planning process, thereby eliminating the need for QP solutions. Additionally, we cache optimal LQR gain matrices in a hash table to bypass re-computation during local linearizations in the rewiring phase. To further boost sampling efficiency, we integrate the Cross-Entropy Method. Our results demonstrate that the proposed planner outperforms existing algorithms in computational efficiency and exhibits robust performance in real-world experiments.
|
|
10:12-10:15, Paper ThA04.5 | |
Two-Stage Proprioceptive State Estimation with Stability Guarantee for Legged Robots |
|
Li, Jun | Harbin Institute of Technology |
Wan, Yuhui | University of Leeds |
Li, Weihua | Harbin Institute of Technology |
Wang, Jianfeng | Harbin Institute of Technology |
Zhou, Chengxu | University College London |
Keywords: Robotics, Estimation, Sensor fusion
Abstract: To achieve dynamic legged locomotion, real-time acquisition of the robot floating base state is critical. However, current full-coupled state estimation solutions may suffer from convergence issues. To address this, a two-stage state estimator that uses proprioceptive sensors is proposed to estimate the floating base's velocity and pose. The approach decomposes the floating-base state estimation problem into base orientation estimation and base velocity and position estimation, resulting in a two-stage multi-sensor fusion algorithm with stability guarantees. Additionally, a covariance inflation method is introduced to consider the influences of contact switching by adjusting noise covariances. The proposed state estimator has been successfully implemented and verified on a real quadrupedal robot, supporting dynamic motions such as multi-gaited locomotion and push recovery.
|
|
10:15-10:18, Paper ThA04.6 | |
Robust Push Recovery on Bipedal Robots: Leveraging Multi-Domain Hybrid Systems with Reduced-Order Model Predictive Control |
|
Dai, Min | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Robotics, Hybrid systems, Optimal control
Abstract: In this paper, we present a novel control framework to achieve robust push recovery on bipedal robots while locomoting. The key contribution is the unification of hybrid system models of locomotion with a reduced-order model predictive controller determining: foot placement, step timing, and ankle control. The proposed reduced-order model is an augmented Linear Inverted Pendulum model with zero moment point coordinates; this is integrated within a model predictive control framework for robust stabilization under external disturbances. By explicitly leveraging the hybrid dynamics of locomotion, our approach significantly improves stability and robustness across varying walking heights, speeds, step durations, and is effective for both flat-footed and more complex multi-domain heel-to-toe walking patterns. The framework is validated with high-fidelity simulation on Cassie, a 3D underactuated robot, showcasing real-time feasibility and substantially improved stability. The results demonstrate the robustness of the proposed method in dynamic environments.
|
|
10:18-10:21, Paper ThA04.7 | |
Quadratic Programming-Based Posture Manipulation and Thrust-Vectoring for Agile Dynamic Walking on Narrow Pathways |
|
Wang, Chenghao | Northeastern University |
Sihite, Eric | Northeastern University |
Venkatesh Krishnamurthy, Kaushik | Northeastern University |
Pitroda, Shreyansh | Northeastern University |
Salagame, Adarsh | Northeastern University |
Ramezani, Alireza | Northeastern University |
Gharib, Morteza | Caltech |
Keywords: Robotics, Optimal control, Biologically-inspired methods
Abstract: There has been significant advancement in legged robot's agility where they can show impressive acrobatic maneuvers, such as parkour. These maneuvers rely heavily on posture manipulation. To expand the stability and locomotion plasticity, we use the multi-modal ability in our legged-aerial platform, the Husky Beta, to perform thruster-assisted walking. This robot has thrusters on each of its sagittal knee joints which can be used to stabilize its frontal dynamic as it walks. In this work, we perform a simulation study of quadruped narrow-path walking with Husky Beta, where the robot will utilize its thrusters to stably walk on a narrow path. The controller is designed based on a centroidal dynamics model with thruster and foot ground contact forces as inputs. These inputs are regulated using a QP solver to be used in a model predictive control framework. In addition to narrow-path walking, we also perform a lateral push-recovery simulation to study how the thrusters can be used to stabilize the frontal dynamics of our robot.
|
|
10:21-10:24, Paper ThA04.8 | |
Thermodynamics-Inspired Trajectory Optimization of a Planar Robotic Arm |
|
Fathizadeh, Meysam | Research Assistant, Mechanical Engineering Department, Cleveland |
Richter, Hanz | Cleveland State University |
Keywords: Robotics, Optimization, Energy systems
Abstract: The paper considers energy-oriented trajectory optimization of a two-link robot inspired by thermodynamics,particularly on the second law. A formulation of thermodynamic principles is developed within the framework of dynamical systems, extending these principles for use in multi-domain systems. The approach involves periodic motions, with the cyclic-averaged subsystem energies replacing temperature in an extended definition of entropy generation. In a representative problem, the robot arm moves between two arbitrary points under the influence of an external force and performs work by repeatedly elevating loads. The trajectories to be optimized were parameterized with a Fourier decomposition. Simulation results indicate that the proposed cost function effectively minimizes energy consumption with some advantages over direct minimization of the supplied energy. Specifically, only partial knowledge of the dissipation characteristics is needed, and the proposed cost function exhibits improved convexity properties near the optimal point in comparison to energy supply.
|
|
10:24-10:27, Paper ThA04.9 | |
Versatile Safety-Aware MPC for Dynamic Whole-Body Loco-Manipulation |
|
Hu, Muqun | University of Southern California |
Rigo, Alberto | University of Southern California |
Nguyen, Quan | University of Southern California |
Keywords: Robotics, Predictive control for nonlinear systems, Optimization
Abstract: Existing loco-manipulation frameworks employ nonlinear models that require complex commands and parameters that are difficult to plan and adjust for application on different tasks. This work presents a model predictive control (MPC) framework for dynamic loco-manipulation that enables versatile whole-body motion while ensuring safety. Our approach simplifies the user command to object trajectory only, then optimizes for whole-body motion for precise object tracking, which can be applied to disparate tasks without formulation change. This approach also allows for predictive collision avoidance for both the robot and the object, both during and after contact between the object and the robot system. We demonstrated the efficient model dynamics and collision avoidance method of this unified formulation through simulation results on the Unitree Aliengo with a custom-made arm, showing the ability to transport objects weighing 20% of the robot mass with precision of within 3 cm error from the final position. Further, our proposed method is able to throw an object at a moving target traveling away at 1 m/s. Leveraging this prediction accuracy, we demonstrated "flight phase" collision avoidance without a pre-planned trajectory by throwing a 1 kg object over a 1.2 m wall.
|
|
10:27-10:30, Paper ThA04.10 | |
Thruster-Assisted Incline Walking of a Legged-Aerial Robot Using Reduced Order Model and Collocation Method |
|
Venkatesh Krishnamurthy, Kaushik | Northeastern University |
Wang, Chenghao | Northeastern University |
Pitroda, Shreyansh | Northeastern University |
Salagame, Adarsh | Northeastern University |
Sihite, Eric | Northeastern University |
Ramezani, Alireza | Northeastern University |
Gharib, Morteza | Caltech |
Keywords: Robotics, Reduced order modeling, Biologically-inspired methods
Abstract: In this study, our aim is to evaluate the effectiveness of thruster-assisted steep slope walking for the Husky Carbon, a quadrupedal robot equipped with custom-designed actuators and plural electric ducted fans, through simulation prior to conducting experimental trials. Thruster-assisted steep slope walking draws inspiration from wing-assisted incline running (WAIR) observed in birds, and intriguingly incorporates posture manipulation and thrust vectoring, a locomotion technique not previously explored in the animal kingdom. Our approach involves developing a reduced-order model of the Husky robot, followed by the application of an optimization-based controller utilizing collocation methods and dynamics interpolation to determine control actions. Through simulation testing, we demonstrate the feasibility of hardware implementation of our controller.
|
|
10:30-10:33, Paper ThA04.11 | |
Hierarchical Reinforcement Learning and Value Optimization for Challenging Quadruped Locomotion |
|
Coholich, Jeremiah | Georgia Institute of Technology |
Murtaza, Muhammad Ali | Georgia Institute of Technology |
Hutchinson, Seth | Georgia Tech |
Zsolt, Kira | Georgia Tech Research Institute |
Keywords: Robotics, Reinforcement learning, Optimization
Abstract: We propose a novel hierarchical reinforcement learning framework for quadruped locomotion over challenging terrain. Our approach incorporates a two-layer hierarchy in which a high-level policy (HLP) selects optimal goals for a low-level policy (LLP). The LLP is trained using an on-policy actor-critic RL algorithm and is given footstep placements as goals. We propose an HLP that does not require any additional training or environment samples and instead operates via an online optimization process over the learned value function of the LLP. We demonstrate the benefits of this framework by comparing it with an end-to-end reinforcement learning (RL) approach. We observe improvements in its ability to achieve higher rewards with fewer collisions across an array of different terrains, including terrains more difficult than any encountered during training.
|
|
10:33-10:36, Paper ThA04.12 | |
Visual Inverse Kinematics: Finding Feasible Robot Poses under Kinematic and Vision Constraints |
|
Wu, Liangting | Boston University |
Tron, Roberto | Boston University |
Keywords: Robotics, Visual servo control
Abstract: This paper introduces the Visual Inverse Kinematics (VIK) problem to fill the gap between robot Inverse Kinematics (IK) and visual servoing (VS) control. IK aims to find joint configurations that achieve an explicitly given end effector pose. VS aims to drive the joint configurations to achieve an end effector position that is assumed to be kinematically feasible but is specified by vision constraints. Our proposed problem, VIK, aims to find a joint configuration that is kinematically feasible and achieves a specific image configuration. With respect to IK, we do not have a given end effector pose, and with respect to VS, we do not necessarily have an exact desired image configuration, but only vision-based constraints. In this work, we develop a formulation of the VIK problem with a Field of View (FoV) constraint, enforcing the visibility of an object from a camera on the robot. Our proposed solution is based on the idea of adding a virtual kinematic chain connecting the physical robot and the object; the FoV constraint is then equivalent to a joint angle kinematic constraint. Along the way, we introduce multiple vision-based cost functions to fulfill different objectives. We solve this formulation of the VIK problem using a method that involves a semidefinite program (SDP) constraint followed by a rank minimization algorithm. The performance of this method for solving the VIK problem is validated through simulations.
|
|
10:36-10:39, Paper ThA04.13 | |
Dynamic Modeling and Optimization of a Compliant Worm Robot |
|
Zhou, Xinyu | Michigan State University |
Luedtke, Christian | Michigan State University |
Qi, Xinda | Michigan State University |
Tan, Xiaobo | Michigan State University |
Keywords: Modeling, Robotics, Mechatronics
Abstract: In this work we present a dynamic model for a novel compliant worm robot, that is designed to travel in corrugated pipe environments. The robot uses the anisotropic interaction between its fins and pipe ridges to achieve locomotion when its body undergoes cable-driven peristaltic movement. The model, represented as a hybrid system, incorporates the deformation of the fins during contact and accounts for the switching logic of the anchoring positions. With the actuated body length as input, the model captures the movement dynamics and continuous locomotion of the robot. The model is used to optimize the gait, specifically the waveform of the body length change, for achieving the trade-off between travel speed and power efficiency, and to optimize fin design, revealing that large anisotropy enhances both travel speed and energy savings. The approach is supported with simulation results.
|
|
10:39-10:42, Paper ThA04.14 | |
Optimal Gait Design for Nonlinear Soft Robotic Crawlers |
|
Yenan, Shen | Princeton University |
Leonard, Naomi Ehrich | Princeton University |
Bamieh, Bassam | Univ. of California at Santa Barbara |
Arbelaiz, Juncal | Princeton University |
Keywords: Optimal control, Biologically-inspired methods, Robotics
Abstract: Soft robots offer a frontier in robotics with enormous potential for safe human-robot interaction and agility in uncertain environments. A stepping stone towards unlocking their potential is a control theory tailored to soft robotics, including a principled framework for gait design. We analyze the problem of optimal gait design for a soft crawling body – the crawler. The crawler is an elastic body with the control signal defined as actuation forces between segments of the body. We consider the simplest such crawler: a two-segmented body with a passive mechanical connection modeling the viscoelastic body dynamics and a symmetric control force modeling actuation between the two body segments. The model accounts for the nonlinear asymmetric friction with the ground, which together with the symmetric actuation forces enable the crawler’s locomotion. Using a describing-function analysis, we show that when the body is forced sinusoidally, the optimal actuator contraction frequency corresponds to the body’s natural frequency when operating with only passive dynamics. We then use the framework of Optimal Periodic Control (OPC) to design optimal force cycles of arbitrary waveform and the corresponding crawling gaits. We provide a hill-climbing algorithm to solve the OPC problem numerically. Our proposed methods and results inform the design of optimal forcing and gaits for more complex and multi-segmented crawling soft bodies.
|
|
10:42-10:45, Paper ThA04.15 | |
Towing Type of 3D Trajectory Tracking for a Class of Underactuated Autonomous Underwater Vehicles |
|
Li, Ji-Hong | Korea Institute of Robotics and Technology Convergence |
Kang, Hyungjoo | Korea Institute of Robotics and Technology Convergence |
Kim, Min-Gyu | Korea Institute of Robotics and Technology Convergence |
Lee, Mun-Jik | Korea Institute of Robotics and Technology Convergence |
Jin, Han-Sol | Korea Institute of Robotics & Technology Convergence |
Cho, Gun Rae | Korea Institute of Robotics and Technology Convergence |
Keywords: Lyapunov methods, Robotics, Mechanical systems/robotics
Abstract: This paper presents a novel approach for 3D trajectory tracking of underactuated autonomous underwater vehicles (AUVs). Unlike previous works, this method does not require the position tracking error to be zero but instead allows it to take a nonzero constant value. This setup resembles a vehicle being towed by a rope, where one end follows the reference trajectory. In this towing-type tracking scheme, the vehicle's desired attitude (defined by its pitch and yaw angles) is aligned with the polar and azimuth angles from the vehicle to a target point on the reference trajectory. Notably, this desired attitude is state-dependent, distinguishing it from the polar and azimuth angles of the reference trajectory's tangent. However, this paper demonstrates that, at least in certain special cases, the proposed tracking method ensures the vehicle's desired attitude converges to the reference trajectory's tangent. Numerical studies are conducted to validate the effectiveness of the proposed scheme.
|
|
10:45-10:48, Paper ThA04.16 | |
Safe and Efficient Robot Action Planning in the Presence of Unconcerned Humans |
|
Amiri, Mohsen | Washington State University |
Hosseinzadeh, Mehdi | Washington State University |
Keywords: Optimal control, Predictive control for linear systems, Robotics
Abstract: This paper proposes a robot action planning scheme that provides an efficient and probabilistically safe plan for a robot interacting with an unconcerned human--someone who is either unaware of the robot's presence or unwilling to engage in ensuring safety. The proposed scheme is predictive, meaning that the robot is required to predict human actions over a finite future horizon; such predictions are often inaccurate in real-world scenarios. One possible approach to reduce the uncertainties is to provide the robot with the capability of reasoning about the human's awareness of potential dangers. This paper discusses that by using a binary variable, so-called danger awareness coefficient, it is possible to differentiate between concerned and unconcerned humans, and provides a learning algorithm to determine this coefficient by observing human actions. Moreover, this paper argues how humans rely on predictions of other agents' future actions (including those of robots in human-robot interaction) in their decision-making. It also shows that ignoring this aspect in predicting human's future actions can significantly degrade the efficiency of the interaction, causing agents to deviate from their optimal paths. The proposed robot action planning scheme is verified and validated via extensive simulation and experimental studies on a LoCoBot WidowX-250.
|
|
10:48-10:51, Paper ThA04.17 | |
A Simplified Underactuated Platform for AI-Ready Bipedal Walking Control: The Stilt-Bot |
|
Kim, Baekseok | University of Nevada, Las Vegas |
Oh, Paul | University of Nevada Las Vegas |
Keywords: Mechanical systems/robotics, Autonomous robots, Robotics
Abstract: Recent advancements in applying machine learning to bipedal robots have demonstrated significant potential. However, the need for more comprehensive Artificial Intelligence (AI) testing environments has become increasingly evident, as current evaluations are constrained by limited datasets, making real-world testing essential. This paper presents Stilt-bot, an AI test bed specifically designed to support skill transfer across various robotic platforms, regardless of changes in size, shape, or actuator power. Inspired by how humans adapt their gait throughout growth, Stilt-bot employs a simple yet versatile 6 degrees-of-freedom (DOF) design and a prismatic sliding mechanism that enhance its agility and reduce weight. This configuration allows easy modifications to its height, mass, and power, providing a flexible and intuitive platform for evaluating AI-based walking control strategies. Both simulation and experimental results confirm Stilt-bot’s capability for stable flat-ground walking, demonstrating its effectiveness as a test bed for developing robust AI-driven bipedal locomotion.
|
|
10:51-10:54, Paper ThA04.18 | |
Safety-Critical Stabilization of Force-Controlled Nonholonomic Robots |
|
Han, Tianyu | The City College of New York |
Wang, Bo | City College of New York |
Keywords: Nonholonomic systems, Lyapunov methods, Autonomous vehicles
Abstract: We present a safety-critical controller for the problem of stabilization for force-controlled nonholonomic mobile robots. The proposed control law is based on the constructions of control Lyapunov functions (CLFs) and control barrier functions (CBFs) for cascaded systems. To address nonholonomicity, we design the nominal controller that guarantees global asymptotic stability and local exponential stability for the closed-loop system in polar coordinates and construct a strict Lyapunov function valid on any compact sets. Furthermore, we present a procedure for constructing CBFs for cascaded systems, utilizing the CBF of the kinematic model through integrator backstepping. Quadratic programming is employed to combine CLFs and CBFs to integrate both stability and safety in the closed loop. The proposed control law is time-invariant, continuous along trajectories, and easy to implement. Our main results guarantee both safety and local asymptotic stability for the closed-loop system.
|
|
10:54-10:57, Paper ThA04.19 | |
Refining Motion for Peak Performance: Identifying Optimal Gait Parameters for Energy-Efficient Quadrupedal Bounding |
|
Alqaham, Yasser G. | Syracuse University |
Cheng, Jing | Syracuse University |
Gan, Zhenyu | Syracuse University |
Keywords: Optimal control, Mechanical systems/robotics, Hybrid systems
Abstract: Energy efficiency is a critical factor in the performance and autonomy of quadrupedal robots. While previous research has focused on mechanical design and actuation improvements, the impact of gait parameters on energetics has been less explored. In this paper, we hypothesize that gait parameters, specifically duty factor, phase shift, and stride duration, are key determinants of energy consumption in quadrupedal locomotion. To test this hypothesis, we modeled the Unitree A1 quadrupedal robot and developed a locomotion controller capable of independently adjusting these gait parameters. Simulations of bounding gaits were conducted in Gazebo across a range of gait parameters at three different speeds: low, medium, and high. Experimental tests were also performed to validate the simulation results. The findings demonstrate that optimizing gait parameters can lead to significant reductions in energy consumption, enhancing the overall efficiency of quadrupedal locomotion. This work contributes to the advancement of energy-efficient control strategies for legged robots, offering insights directly applicable to commercially available platforms.
|
|
10:57-11:00, Paper ThA04.20 | |
Torque Constraint Modeling and Reference Shaping for Servo Systems |
|
Lu, Zehui | Purdue University |
Zhang, Tianpeng | Harvard University |
Wang, Yebin | Mitsubishi Electric Research Labs |
Keywords: Electrical machine control, Optimization, Estimation
Abstract: Servo systems, one of the backbones of modern manufacturing, are supposed to move as fast as possible for high productivity. Due to the inaccurate information on torque capacity, conventional trajectory generation methods are either overly conservative, compromising yield, or violate dynamical feasibility, compromising quality. This work proposes a method to address these shortcomings. Stable adaptive estimation of the servomotor model parameters is first performed, then torque capacity constraints are established as analytical functions of the motor speed based on parameter estimates, and finally, a computationally efficient algorithm is developed to reshape an aggressive (dynamically infeasible) trajectory into a feasible one. Theoretical analysis and numerical simulation validate the effectiveness of the proposed method.
|
|
ThB01 |
Plaza AB |
Trajectory Optimization and Tracking II |
Regular Session |
Chair: Manyam, Satyanarayana Gupta | Air Force Research Labs |
Co-Chair: Morel, Yannick | Maastricht University, Faculty of Psychology |
|
13:30-13:45, Paper ThB01.1 | |
Comparison of NLP Solvers and Derivative Accuracy for Solving Multi-Impulse Cislunar Trajectory Optimization Problems |
|
Yamamoto, Koya | Texas A&M Univeristy |
Taheri, Ehsan | Auburn University |
Junkins, John L. | Texas A&M Univ |
Keywords: Optimization algorithms, Computational methods, Aerospace
Abstract: This paper investigates and compares the convergence performance of two widely used Nonlinear Programming (NLP) solvers—MATLAB’s texttt{fmincon} and IPOPT— for solving multi-impulse cislunar trajectory optimization problems. The problem is formulated as a minimum-fuel (or minimum-Delta v) trajectory optimization for a transfer between two Distant Retrograde Orbits (DROs) in the Circular Restricted Three-Body Problem (CR3BP). Derivatives of the objective and constraints, required by the solvers, are computed using three methods: a) solver’s built-in finite-difference (FD) method, b) the complex-step based (CX) method, and c) an analytical method. The results demonstrate that while analytical derivatives provide the fastest convergence, the CX method achieves nearly the same performance, despite being significantly easier to compute than the analytical derivatives. The CX method outperforms the FD method in terms of derivative accuracy and its impact on the convergence performance of the solvers (i.e., total number of iterations and function evaluations). Results indicate that IPOPT exhibits faster convergence compared to texttt{fmincon}, when CX and analytical derivatives are used.
|
|
13:45-14:00, Paper ThB01.2 | |
Persistent Monitoring Trajectory Optimization in Partitioned Environments |
|
Hall, Jonas | Boston University |
Cassandras, Christos G. | Boston University |
Andersson, Sean B. | Boston University |
Keywords: Switched systems, Estimation, Optimization algorithms
Abstract: We consider the problem of using an autonomous agent to persistently monitor a collection of targets distributed in a given environment. We generalize existing work by allowing the agent's dynamics to vary throughout the environment, leading to a hybrid dynamical system. This introduces an additional layer of complexity towards the planning portion of the problem: we must not only identify in which order to visit the points of interest, but also in which order to traverse the regions. We propose a tailored global path planner and prove that it is not only probabilistically complete, but converges in probability to a time-optimal solution. We then design an offline sequence planner together with an online trajectory optimizer. Simulations validate the results.
|
|
14:00-14:15, Paper ThB01.3 | |
Generalization of Optimal Geodesic Curvature Constrained Dubins' Path on Sphere with Free Terminal Orientation |
|
Kumar, Deepak Prakash | Texas A&M University |
Darbha, Swaroop | Texas a & M Univ |
Manyam, Satyanarayana Gupta | DCS Corp., Air Force Research Labs |
Casbeer, David W. | Air Force Research Laboratory |
Keywords: Optimal control, Robotics, Aerospace
Abstract: In this paper, motion planning for a Dubins vehicle on a unit sphere to attain a desired final location is considered. The radius of the Dubins path on the sphere is lower bounded by r, where r represents the radius of the tightest left or right turn the vehicle can take on the sphere. Noting that r in (0, 1) and can affect the trajectory taken by the vehicle, it is desired to determine the candidate optimal paths for r ranging from nearly zero to close to one to attain a desired final location. In a previous study, this problem was addressed, wherein it was shown that the optimal path is of type CG, CC, or a degenerate path of the CG and CC paths, which includes C, G paths, for r leq frac{1}{2}. Here, C in {L, R} denotes an arc of a tight left or right turn of minimum turning radius r, and G denotes an arc of a great circle. In this paper, the candidate paths for the same problem are generalized to model vehicles with a larger turning radius. In particular, it is shown that the candidate optimal paths are of type CG, CC, or a degenerate path of the CG and CC paths for r leq frac{sqrt{3}}{2}. Noting that at most two LG paths and two RG paths can exist for a given final location, this paper further reduces the candidate optimal paths by showing that only one LG and one RG path can be optimal, yielding a total of seven candidate paths for r leq frac{sqrt{3}}{2}. Additional conditions for the optimality of CC paths are also derived in this study.
|
|
14:15-14:30, Paper ThB01.4 | |
Singularity-Free Task-Priority Design for Trajectory Tracking in Space Robots |
|
Bruschi, Pietro | Politecnico Di Milano |
Invernizzi, Davide | Politecnico Di Milano |
Keywords: Aerospace, Hierarchical control, Lyapunov methods
Abstract: Space robots, i.e., spacecraft with robotic manipulators, are essential for in-orbit servicing and debris removal. The high actuation redundancy of these systems allows for the imposition of multiple tasks with different priority levels. This work presents a singularity-free task-priority design for trajectory tracking in space robots, effectively removing the stringent assumption that the manipulator operates within the feasible workspace (free of kinematic singularities). The design, based on a modular hierarchical architecture, ensures end-effector tracking takes priority over tasks like maintaining a safe distance and a proper attitude of the spacecraft base. The proposed control law uses a 6D matrix representation of rigid motion that unifies attitude and position, offering invariance to inertial frame changes, making it ideal for space robotics.
|
|
14:30-14:45, Paper ThB01.5 | |
Anti-Windup Compensation for Quadrotor Trajectory Tracking with External Disturbances |
|
Shahbazzadeh, Majid | University of Louisville |
Richards, Christopher | University of Louisville |
Keywords: Constrained control, Flight control, Control applications
Abstract: This paper considers the problem of trajectory tracking for quadrotors operating in wind conditions that result in propeller thrust saturation. To address this problem, an antiwindup compensator (AWC) is developed to reduce the tracking performance degradation and destabilizing effects from thrust saturation. Relationships are derived showing how the tracking error and AWC states are influenced by the wind disturbance and saturation, and how the influences depend on the controller and AWC gains. As a result, these gains can be tuned to achieve desired performance levels. Simulation results are presented to validate the effectiveness of the proposed method.
|
|
14:45-15:00, Paper ThB01.6 | |
Accurate Planar Trajectory Tracking for a Class of Nonlinear Non-Minimum Phase Marine Vehicles |
|
Morel, Yannick | Maastricht University, Faculty of Psychology |
Keywords: Lyapunov methods, Autonomous systems
Abstract: The work presented addresses the tracking control problem for a class of marine vehicles characterized by the type of propulsion system used and resulting impact on dynamical behavior. Specifically, we consider movement in the horizontal plane of vehicles that use a rudder for steering (or, equivalently, vectored thrust), as opposed to differential thrust, or lateral thrusters. The corresponding system dynamics are underactuated, featuring fewer independent control inputs than degrees of freedom. They also typically display a non-minimal phase nature (in the nonlinear sense), where desirable equilibria of the internal (non-actuated) dynamics prove unstable. We present solutions to this problem, which rely on careful design of the set of errors tracked by the external dynamics, guaranteeing uniform ultimate boundedness of tracking errors.
|
|
ThB02 |
Plaza DE |
Statistical Learning |
Regular Session |
Chair: Lamperski, Andrew | University of Minnesota |
Co-Chair: Tang, Wentao | NC State University |
|
13:30-13:45, Paper ThB02.1 | |
Function Gradient Approximation with Random Shallow ReLU Networks with Control Applications |
|
Lamperski, Andrew | University of Minnesota |
Salapaka, Siddharth | University of Illinois Urbana-Champaign |
Keywords: Statistical learning, Adaptive control, Machine learning
Abstract: Neural networks are widely used to approximate unknown functions in control. A common neural network architecture uses a single hidden layer (i.e. a shallow network), in which the input parameters are fixed in advance and only the output parameters are trained. A common argument asserts that if output parameters exist to approximate the unknown function with sufficient accuracy, then desired control performance can be achieved. A long-standing theoretical gap was that no conditions existed to guarantee that, for the fixed input parameters, the required accuracy could be obtained by training the output parameters. Our recent work has partially closed this gap by demonstrating that if input parameters are chosen randomly, then for any sufficiently smooth function, with high-probability there are output parameters resulting in O((1/m)1/2) approximation errors, where m is the number of neurons. However, some applications, notably continuous-time value function approximation, require that the network approximates the both the unknown function and its gradient with sufficient accuracy. In this paper, we show that randomly generated input parameters and trained output parameters result in gradient errors of O((log(m)/m)1/2), and additionally, improve the constants from our prior work. We show how to apply the result to policy evaluation problems.
|
|
13:45-14:00, Paper ThB02.2 | |
Manifold-Guided Stabilization of Nonlinear Dynamical Systems with Diffusion Models |
|
Mukherjee, Amartya | University of Waterloo |
Quartz, Thanin | University of Waterloo |
Liu, Jun | University of Waterloo |
Keywords: Statistical learning, Algebraic/geometric methods, Uncertain systems
Abstract: This paper introduces Manifold-Guided Stabilizing Control (MGSC), a novel approach to synthesizing stabilizing controllers for nonlinear dynamical systems using diffusion models. Our method formulates control synthesis as a search for the closest asymptotically stable vector field within a learned manifold of stable dynamics. We train a diffusion model on a dataset of asymptotically stable vector fields and employ Tweedie’s estimate to iteratively adjust control parameters, ensuring convergence to a stabilizing controller. This formulation enables zero-shot stabilization for previously unseen systems with significantly reduced computational cost. Our numerical experiments demonstrate that MGSC achieves stabilization in just 16 seconds, compared to 2 minutes in prior work, while generalizing effectively across different nonlinear control problems. These results highlight the potential of diffusion models as a powerful tool for fast, data-driven control synthesis.
|
|
14:00-14:15, Paper ThB02.3 | |
On the Sample Complexity of Set Membership Estimation for Linear Systems with Disturbances Bounded by Convex Sets |
|
Xu, Haonan | University of Illinois Urbana-Champaign |
Li, Yingying | UIUC |
Keywords: Statistical learning, Identification for control, Estimation
Abstract: This paper revisits the set membership identification for linear control systems and establishes its convergence rates under relaxed assumptions on (i) the persistent excitation requirement and (ii) the system disturbances. In particular, instead of assuming persistent excitation exactly, this paper adopts the block-martingale small-ball condition enabled by randomly perturbed control policies to establish the convergence rates of SME with high probability. Further, we relax the assumptions on the shape of the bounded disturbance set and the boundary-visiting condition. Our convergence rates hold for disturbances bounded by general convex sets, which bridges the gap between the previous convergence analysis for general convex sets and the existing convergence rate analysis for L-infinity balls. Further, we validate our convergence rates by several numerical experiments.
|
|
14:15-14:30, Paper ThB02.4 | |
Koopman Operator in the Weighted Function Spaces and Its Learning for the Estimation of Lyapunov and Zubov Functions |
|
Tang, Wentao | NC State University |
Keywords: Statistical learning, Lyapunov methods, Stability of nonlinear systems
Abstract: Theoretical properties and data-driven learning of the Koopman operator, which represents nonlinear dynamics as a linear mapping on a properly defined functional spaces, have become key problems in nonlinear system identification and control. However, Koopman operators that are approximately learned from snapshot data may not always accurately predict the system evolution on long horizons. In this work, by defining the Koopman operator on a space of weighted continuous functions and learning it on a weighted reproducing kernel Hilbert space, the Koopman operator is guaranteed to be contractive and the accumulated error is bounded. The weighting function, assumed to be known a priori due to the knowledge on system stability, has an exponential decay with the flow. Under such a construction, the Koopman operator learned from data is used to estimate (i) Lyapunov functions for globally asymptotically stable dynamics, and (ii) Zubov-Lyapunov functions that characterize the domain of attraction. For these estimations, probabilistic error bounds are derived.
|
|
14:30-14:45, Paper ThB02.5 | |
Time-Reversal Solution of BSDEs in Stochastic Optimal Control: A Linear Quadratic Study |
|
Mei, Yuhang | University of Washington |
Taghvaei, Amirhossein | University of Washington Seattle |
Keywords: Stochastic optimal control, Statistical learning, Stochastic systems
Abstract: This paper addresses the numerical solution of backward stochastic differential equations (BSDEs) arising in stochastic optimal control. Specifically, we investigate two BSDEs: one derived from the Hamilton-Jacobi-Bellman equation and the other from the stochastic maximum principle. For both formulations, we analyze and compare two numerical methods. The first utilizes the least-squares Monte-Carlo (LSMC) approach for approximating conditional expectations, while the second leverages a time-reversal (TR) of diffusion processes. Although both methods extend to nonlinear settings, our focus is on the linear-quadratic case, where analytical solutions provide a benchmark. Numerical results demonstrate the superior accuracy and efficiency of the TR approach across both BSDE representations, highlighting its potential for broader applications in stochastic control
|
|
14:45-15:00, Paper ThB02.6 | |
Physics-Informed Building Occupancy Detection |
|
Esmaieeli Sikaroudi, Amir Mohammad | University of Arizona |
Goikhman, Boris | Airvoice |
Chubarov, Dmitri | Airvoice |
Nguyen, Hung Dinh | Nanyang Technological University, Singapore |
Chertkov, Michael | University of Arizona |
Vorobev, Petr | Nanyang Technological University |
Keywords: Markov processes, Estimation, Statistical learning
Abstract: Energy efficiency of buildings is considered to be one of the major means of achieving the net-zero carbon goal around the world. The big part of the energy savings are supposed to be coming from optimizing the operation of the building heating, ventilation, and air conditioning (HVAC) systems. There is a natural trade-off between the energy efficiency and the indoor comfort level, and finding an optimal operating schedule/regime requires knowing the occupancy of different spaces inside of the building. Moreover, the COVID-19 pandemic has also revealed the need to sustain the high quality of the indoor air in order to reduce the risk of spread of infection. Occupancy detection from indoor sensors is thus an important practical problem. In the present paper, we propose detection of occupancy based on the carbon dioxide measurements inside the building. In particular, a new approach based on the, so-called, switching auto-regressive process with Markov regime is presented and justified by the physical model of the carbon dioxide concentration dynamics. We demonstrate the efficiency of the method compared to simple Hidden Markov approaches on simulated and real-life data. We also show that the model is flexible and can be generalized to account for different ventilation regimes, simultaneously detecting the occupancy and the ventilation rate.
|
|
ThB03 |
Plaza CF |
Networked Control Systems |
Regular Session |
Chair: Ramasubramanian, Bhaskar | Western Washington University |
Co-Chair: Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
|
13:30-13:45, Paper ThB03.1 | |
Modeling and Designing Non-Pharmaceutical Interventions in Epidemics: A Submodular Approach |
|
Cheng, Shiyu | Washington University in St. Louis |
Niu, Luyao | University of Washington |
Ramasubramanian, Bhaskar | Western Washington University |
Clark, Andrew | Washington University in St. Louis |
Poovendran, Radha | University of Washington |
Keywords: Network analysis and control, Control of networks
Abstract: This paper considers the problem of designing non-pharmaceutical intervention (NPI) strategies, such as masking and social distancing, to slow the spread of a viral epidemic. We formulate the problem of jointly minimizing the infection probabilities of a population and the cost of NPIs based on a Susceptible-Infected-Susceptible (SIS) propagation model. To mitigate the complexity of the problem, we consider a steady-state approximation based on the quasi-stationary (endemic) distribution of the epidemic, and prove that the problem of selecting a minimum-cost strategy to satisfy a given bound on the quasi-stationary infection probabilities can be cast as a submodular optimization problem, which can be solved in polynomial time using the greedy algorithm. We carry out experiments to examine effects of implementing our NPI strategy on propagation and control of epidemics on a Watts-Strogatz small-world graph network. We find the NPI strategy reduces the steady state of infection probabilities of members of the population below a desired threshold value.
|
|
13:45-14:00, Paper ThB03.2 | |
Controllability and Observability of Temporal Hypergraphs |
|
Dong, Anqi | KTH Royal Institute of Technology |
Mao, Xin | University of North Carolina at Chapel Hill |
Vasudevan, Ramanarayan | University of Michigan |
Chen, Can | University of North Carolina at Chapel Hill |
Keywords: Networked control systems, Biological systems, Time-varying systems
Abstract: Numerous complex systems, such as those arisen in ecological networks, genomic contact networks, and social networks, exhibit higher-order and time-varying characteristics, which can be effectively modeled using temporal hypergraphs. However, analyzing and controlling temporal hypergraphs poses significant challenges due to their inherent time-varying and nonlinear nature, while most existing methods predominantly target static hypergraphs. In this article, we generalize the notions of controllability and observability to temporal hypergraphs by leveraging tensor and nonlinear systems theory. Specifically, we establish tensor-based rank conditions to determine the weak controllability and observability of directed, weighted temporal hypergraphs. The proposed framework is further demonstrated with synthetic and real-world examples.
|
|
14:00-14:15, Paper ThB03.3 | |
Optimal Risk-Sensitive Scheduling Policies for Remote Estimation of Autoregressive Markov Processes |
|
Dutta, Manali | Indian Institute of Science |
Singh, Rahul | Indian Institute of Science |
Keywords: Networked control systems, Control over communications, Filtering
Abstract: We consider a remote estimation setup, where data packets containing sensor observations are transmitted over a Gilbert-Elliot channel to a remote estimator, and design scheduling policies that minimize a risk-sensitive cost, which is equal to the expected value of the exponential of the cumulative cost incurred during a finite horizon, that is the sum of the cumulative transmission power consumed, and the cumulative squared estimation error. More specifically, consider a sensor that observes a discrete-time autoregressive Markov process, and at each time decides whether or not to transmit its observations to a remote estimator using an unreliable wireless communication channel after encoding these observations into data packets. Modeling the communication channel as a Gilbert-Elliot channel allows us to take into account the temporal correlations in its fading. We pose this dynamic optimization problem as a Markov decision process (MDP), and show that there exists an optimal policy that has a threshold structure, i.e., at each time t it transmits only when the current channel state is good, and the magnitude of the current ``error'' exceeds a certain threshold.
|
|
14:15-14:30, Paper ThB03.4 | |
Effect of Antagonistic Interactions on Agreement of Agents Over a Hierarchical Ring Digraph |
|
D, Sahaya Aarti | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Keywords: Networked control systems, Decentralized control, Cooperative control
Abstract: Hierarchical cyclic pursuit has garnered significant attention among researchers in recent years. Along similar lines, this work aims to study ring digraphs, with a hierarchical "necklace" structure, involving antagonistic interactions. In the first part, identical antagonistic interactions, represented by negative edge weights, within macro vertices are considered. A study on the effect of such negative interactions on the consensus of agents is presented and bounds on negative gain, for consensus, are derived. Subsequently, heterogeneous antagonistic interactions are considered and the effect of these interactions on the consensus of the agents is analyzed. The set of points where consensus is achievable, when the edge weights are varied within the derived permissible bounds, is also investigated. Finally, numerical examples and corresponding simulations are presented to validate the analytical results.
|
|
14:30-14:45, Paper ThB03.5 | |
Detection-Rate-Oriented Watermarking for Replay Attack Detection in Cyber-Physical Systems |
|
Li, Zheng | Shanghai Jiao Tong University |
Fang, Chongrong | Shanghai Jiao Tong University |
Keywords: Networked control systems, Kalman filtering
Abstract: In this paper, we propose a novel approach for designing physical watermarks in cyber-physical systems (CPSs) with a focus on detection rate. While physical watermarks have been recognized as effective defenses against replay attacks, existing methods lack direct metrics to evaluate the detection performance. We address this gap by deriving a direct mapping between the watermark parameters and the detection rate. Our results show that, under specific conditions, physical watermarks can be effectively designed based on detection and false alarm rates. We provide a comprehensive set of conditions for this design, offering a rigorous and quantifiable framework for assessing the efficacy of watermark-based replay attack detection methods. Finally, extensive simulations are conducted to demonstrate the effectiveness of the proposed method.
|
|
ThB04 |
Governor's Sq. 15 |
Mechatronics I |
Invited Session |
Chair: Al Janaideh, Mohammad | University of Guelph |
Co-Chair: Hashim, Hashim A | Carleton University |
Organizer: Hashim, Hashim A | Carleton University |
Organizer: Al Janaideh, Mohammad | University of Guelph |
|
13:30-13:45, Paper ThB04.1 | |
An Analytical Approach to Signal Denoising Based on Singular Value Decomposition (I) |
|
Al-Tawaha, Ahmad S | Jordan University of Science and Technology |
Alshorman, Ahmad | Jordan University of Science and Technology |
Jin, Ming | Virginia Tech |
Al Janaideh, Mohammad | University of Guelph |
Aljanaideh, Khaled | Jordan University of Science and Technology |
Keywords: Computational methods
Abstract: Signal denoising is a fundamental task in signal processing that aims to extract the true underlying signal from noisy observations. Existing signal denoising methods, such as wavelet transform and Fourier-based filtering, suffer from low computational efficiency and potential loss of important signal components during reduction. Moreover, determining the optimal threshold for singular value selection remains a challenge in traditional techniques that are based on singular value decomposition. In this paper, we introduce an efficient, non-iterative algorithm for signal denoising that leverages two noisy observations. By constructing Hankel matrices from these observations, the proposed method establishes a threshold using the largest singular value of their difference, effectively separating true signal components from noise without the need for iterative optimization. We validate the approach on both synthetic data and real-world measurements, including smartphone sensor readings and displacement data from a micro-positioning system with a piezoelectric actuator.
|
|
13:45-14:00, Paper ThB04.2 | |
Koopman Operator-Based Modeling of Cable Slab Nonlinear Dynamics (I) |
|
Pumphrey, Michael Joseph | University of Guelph |
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Al-Rawashdeh, Yazan Mohammad | Al-Zaytoonah University of Jordan |
Alatawneh, Natheer | Cysca Technology |
Aljanaideh, Khaled | Jordan University of Science and Technology |
Boker, Almuatazbellah | Virginia Tech |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechanical systems/robotics
Abstract: This paper presents a higher-dimensional state-space model for predicting the nonlinear dynamics of a cable slab using Koopman operators. Currently, cable slab dynamics is a problem in precision motion systems, as the cable can impart unwanted vibrations or disturbances onto the motion stage. Currently, there is no definitive analytical model of the cable slab dynamics. The development process involved systematically evaluating various Koopman observable functions to minimize tracking errors. The model achieves an error margin of approximately ±1% for the given motion range and is robust enough to predict the movement of untrained acyclic randomized cable slab motions. Through a Koopman Operator approach the analytical formulation of the nonlinear cable slab dynamics is approximated. The model is verified through experimental results.
|
|
14:00-14:15, Paper ThB04.3 | |
A Unified Finite-Time Sliding Mode Quaternion-Based Tracking Control for Quadrotor UAVs without Time Scale Separation (I) |
|
Ali, Ali Mohamed | Carleton University |
Hashim, Hashim A | Carleton University |
Jayasiri, Awantha | National Research Council |
Keywords: Mechatronics, Stability of nonlinear systems, Control applications
Abstract: This paper presents a novel design for finite-time position control of quadrotor Unmanned Aerial Vehicles (UAVs). A robust, finite-time, nonlinear feedback controller is introduced to reject bounded disturbances in tracking tasks. The proposed control framework differs conceptually from conventional controllers that utilize Euler angle parameterization for attitude and adhere to the traditional hierarchical inner-outer loop design. In standard approaches, the translational controller and the corresponding desired attitude are computed first, followed by the design of the attitude controller based on time-scale separation between fast attitude and slow translational dynamics. In contrast, the proposed control scheme is quaternion-based and utilizes a transit feed-forward term in the attitude dynamics that anticipates the slower translational subsystem. Robustness is achieved through the use of continuously differentiable sliding manifolds. The proposed approach guarantees semi-global finite-time stability, without requiring time-scale separation. Finally, numerical simulation results are provided to demonstrate the effectiveness of the proposed controller.
|
|
14:15-14:30, Paper ThB04.4 | |
H-Infinity Robust Dynamic Decoupling for Precision Motion Systems: An LMI Approach (I) |
|
Wu, Jingjie | University of Wisconsin-Madison |
Zhou, Lei | University of Wisconsin-Madison |
Keywords: H-infinity control, LMIs, Mechatronics
Abstract: In multi-axis precision positioning systems, the plant dynamics are typically decoupled into single-input, single-output (SISO) channels to enable a decentralized controller structure. However, static decoupling often fails to provide sufficient decoupling performance in the mid-to-high frequency range, which limits the system's performance. Dynamic decoupling has been demonstrated as a promising tool for addressing cross-coupling effects while preserving the use of SISO feedback and feedforward controllers. However, existing model-based dynamic decoupling approaches cannot offer sufficient robustness with respect to model uncertainties. While data-driven methods have been developed, they often lack a stability guarantee and cannot be used for hardware-control co-optimization purposes. This paper proposes a novel model-based dynamic decoupling controller synthesis method for precision motion systems with model uncertainties explicitly considered through a robust H-infinity optimization, which can be transformed into Linear Matrix Inequalities (LMI) for solving. Simulation using a lightweight three-axis precision motion system is presented for evaluation.
|
|
14:30-14:45, Paper ThB04.5 | |
Output Feedback Decoupling Control of Deformable Mirrors for Adaptive Optics Applications (I) |
|
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Boker, Almuatazbellah | Virginia Tech |
Alatawneh, Natheer | Cysca Technology |
Al-Rawashdeh, Yazan Mohammad | Al-Zaytoonah University of Jordan |
Zhang, Lihong | Memorial University of Newfoundland |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Nonlinear output feedback, Observers for nonlinear systems
Abstract: The deformable mirror (DM) is a critical component in adaptive optic systems. However, controlling and modeling it is challenging due to its nonlinear, coupled, and position-dependent dynamics, which arise from the interaction of actuators and mechanical coupling across multiple axes. This paper presents a decoupled control strategy for the DM that addresses the issue of unknown system dynamics and position dependency. The proposed approach incorporates a proportional controller as an inner control loop and an output feedback controller as an outer control loop, combining state feedback with an extended high-gain observer (EHGO). The controller design does not require prior knowledge of the mirror’s dynamics and can be implemented independently for each actuator and axis. Simulations were conducted to assess the controller’s performance, demonstrating its effectiveness in tracking desired motions along both the X and Y axes, even in the presence of unknown dynamics.
|
|
14:45-15:00, Paper ThB04.6 | |
Current-Control Approach for Hysteresis Compensation and Linearization of Nonlinear Reluctance Actuator in Motion System Applications (I) |
|
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Alatawneh, Natheer | Cysca Technology |
Boker, Almuatazbellah | Virginia Tech |
Aljanaideh, Omar | ASML |
Xu, Binyan | Univeristy of Victoria |
Zhang, Lihong | Memorial University of Newfoundland |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Nonlinear output feedback, Observers for nonlinear systems
Abstract: Reluctance actuators (RAs) present a promising alternative to Lorentz actuators for next-generation positioning and scanning systems, such as wafer scanners and deformable mirrors. However, RAs face challenges such as nonlinear output force characteristics, negative stiffness due to gap dependency, and nonlinear magnetic hysteresis. This paper proposes a current control strategy based on an extended high-gain observer to linearize the dynamic behavior of an RA driving a motion system. The controller integrates a PI controller for tracking the desired current and utilizes the extended high-gain observer to estimate unknown uncertainties and nonlinearities in the actuator dynamics, thereby enhancing system robustness and linearizing the actuator’s behavior. The effectiveness of the proposed controller is demonstrated through simulation and experimental testing. Both simulation and experimental results confirm the controller’s ability to linearize the RA and mitigate the negative stiffness effects caused by position dependency. The experimental results demonstrate the system’s linear behavior under the proposed controller, and the controller also increases the system’s frequency bandwidth to f_{BW}=35 Hz compared to f_{BW}=9.5 Hz for the system without a controller.
|
|
ThB05 |
Governor's Sq. 9 |
Healthcare and Medical Systems III |
Invited Session |
Chair: Menezes, Amor A. | University of Florida |
Co-Chair: Mesbah, Ali | University of California, Berkeley |
Organizer: Menezes, Amor A. | University of Florida |
Organizer: Hahn, Jin-Oh | University of Maryland |
Organizer: Zhang, Wenlong | Arizona State University |
Organizer: Mesbah, Ali | University of California, Berkeley |
Organizer: Medvedev, Alexander V. | Uppsala University |
|
13:30-13:45, Paper ThB05.1 | |
Control of Mean Arterial Blood Pressure Is Not Effective Control of Cardiovascular State (I) |
|
Baum, Taylor Elise | Massachusetts Institute of Technology |
Kazemi, Mohammadreza | Florida International University |
Heldt, Thomas | Massachusetts Institute of Technology |
Brown, Emery N. | Massachusetts General Hospital |
Keywords: Biomedical, Biological systems, Control system architecture
Abstract: Regulation of the cardiovascular system is a critical component of patient care in the operating room and intensive care unit. One such goal is to regulate arterial blood pressure (ABP) to ensure end-organ perfusion. This regulation is managed by clinicians through manual titration of medications or fluids, an approach prone to human error. Physiological closed-loop control systems automate this titration and have significant potential to improve performance of cardiovascular system management. Many such control systems have been developed utilizing varied control strategies (e.g., rule-based methods, proportional-integral-derivative control, and model predictive control, etc.). These systems are often designed and evaluated without consideration of mechanistic cardiovascular states. In this work, we aim to motivate why mechanistic cardiovascular states must be considered in the design and evaluation of closed-loop systems for cardiovascular control. We use control of ABP to ensure end-organ perfusion as an illustrative example. We show theoretically and with recordings from a swine cardiovascular system model that similar mean ABP values often result from different cardiovascular states. We then present an example layered control architecture which explicitly incorporates mechanistic cardiovascular states in its decision-making scheme. At the high layer is selection of the control targets and actuators informed by mechanistic estimates of cardiovascular states, systemic resistance and cardiac output. At the low layer is a set of real-time controllers with actuators that regulate mean ABP either directly or through regulation of mechanistic cardiovascular states. We then present simulations of a representative high layer and its interaction with a representative low layer within our proposed control architecture. Overall, we assert that closed-loop systems for cardiovascular control should incorporate mechanistic cardiovascular states into their design and evaluation.
|
|
13:45-14:00, Paper ThB05.2 | |
Koopman Modeling of Human Gait Dynamics for Global Modal Analysis Using Periodic Motion Regularization (I) |
|
Kamienski, Emily | Massachusetts Institute of Technology |
Donahue, Seth | Shriners Hospital for Children |
Major, Matthew | Jesse Brown VA Medical Center, Northwestern University |
Asada, H. Harry | Massachusetts Inst. of Tech |
Keywords: Modeling, Biomedical, Hybrid systems
Abstract: This paper presents a data driven global linear model of steady state walking dynamics. Instantaneous whole body angular momentum is a physics informed aggregate quantity used as a marker for dynamic balance during locomotion. Gait dynamics are often modeled as hybrid and nonlinear. We propose using Koopman Operators to model the gait stability dynamics with a global, linear model. This is achieved by augmenting the whole body angular momentum state variables with learned observables, or basis functions, such that the dynamics look linear in the lifted dimension. Considering that the gait dynamics are periodic, a regularization term that encourages the state transition matrix to be orthonormal is added to the loss term when learning the observables. This forces a periodic model to be learned and prevents the likelihood of unstable poles. A low average MSE was obtained over 2 gait cycles for different population types, each with slightly differing gait dynamics. Furthermore, this linear representation enables the use of linear analysis tools that could have clinical implications for assessing the gait of different patient groups.
|
|
14:00-14:15, Paper ThB05.3 | |
Memristor-Based Dynamic Modeling of Muscle Fatigue (I) |
|
Richter, Hanz | Cleveland State University |
Mastropieri, Adam | Cleveland State University |
Keywords: Biomedical, Modeling, Human-in-the-loop control
Abstract: We introduce a novel dynamic modeling paradigm to represent fatigue and recovery processes in muscles by dynamic extension of the classical 3-element Hill model. A memristor and a capacitor are used in a series circuit arrangement, with a voltage source representing muscle activation. This model is shown to capture the fundamental features of fatigue accumulation and recovery in response to arbitrary motion, load and activation profiles, in contrast with other work assuming specific input shapes such as constants or periodic functions. Further, the basic model is shown to capture the gradual increase in activation spectral amplitudes with fatigue progression, which is an experimental fact. The paper shows how the fatigue modeling element is integrated into larger musculoskeletal dynamic models. Possible modifications and extensions aimed at more flexibility are also suggested.
|
|
14:15-14:30, Paper ThB05.4 | |
YourMove: A System Identification and Hybrid Model Predictive Control Personalized mHealth Intervention for Physical Activity (I) |
|
El Mistiri, Mohamed | Arizona State University |
Park, Junghwan | University of California, San Diego |
Khan, Owais | Arizona State University |
Banerjee, Sarasij | Arizona State University |
Hekler, Eric | UC San Diego |
Rivera, Daniel E. | Arizona State Univ |
Keywords: Emerging control applications, Predictive control for linear systems, Identification for control
Abstract: Control systems engineering has contributed to a paradigm shift in behavioral science and medicine. Among the applications of control systems engineering in behavioral medicine includes understanding, on an individual level, the dynamics of behavior change and leveraging this knowledge to deliver optimized, personalized interventions. These principles are the foundation of the control optimization trial (COT) framework that aims to facilitate the dissemination of data-driven, control-oriented behavioral interventions and consequently improve individual and public health. YourMove ( ClinicalTrials.gov ID NCT05598996), a first-of-its-kind COT study, is an intervention to increase physical activity in sedentary adults and the culmination of years of research into the effectiveness of system identification and model predictive control (MPC) design in behavior change interventions. This paper summarizes the methods utilized in YourMove and provides promising preliminary results for illustrative participants in the ongoing study. The results presented are consistent with scenarios simulated in prior work and validate the COT framework as an effective tool for delivering personalized closed-loop interventions. In particular, results from this study demonstrate the performance and robustness of a three-degree-of-freedom Kalman filter-based hybrid model predictive control (3DoF-KF HMPC) algorithm in real-world settings.
|
|
14:30-14:45, Paper ThB05.5 | |
Small-Sample-Size Data-Driven Early Disease-Detection and Re-Stabilization for mRNA-Protein Gene Regulatory Networks |
|
Shen, Xun | Osaka University |
Sasahara, Hampei | Institute of Science Tokyo |
Imura, Jun-ichi | Tokyo Institute of Technology |
Aihara, Kazuyuki | University of Tokyo |
Keywords: Machine learning, Fault detection, Genetic regulatory systems
Abstract: A mRNA-protein gene regulatory network is a differential equation model for gene expression that incorporates the dynamics of both mRNA and protein expressions. This paper addresses the challenges of using High-Dimensional Low-Sample-Size (HDLSS) data set for early disease detection and re-stabilization in mRNA-protein gene regulatory networks. For the first time, we demonstrate that detecting the pre-disease stage of mRNA-protein gene regulatory networks is possible using only the HDLSS data of either mRNA or protein. After detecting the pre-disease stage, it is crucial to prevent disease progression at that point. From a control engineering perspective, this prevention can be achieved by enhancing the system's stability, a process called as re-stabilization. We demonstrate that the key nodes for re-stabilization in the mRNA-protein gene regulatory network manifest as mRNA-protein duals. The intervention strategy, whether suppression or promotion, is identical for the mRNA and protein components of each key dual. Furthermore, we present a Lyapunov equation-based system identification method to estimate the system matrix using the HDLSS data set. From the estimated system matrix, the key nodes for re-stabilization can be identified. The effectiveness of the proposed method has been validated via numerical examples.
|
|
14:45-15:00, Paper ThB05.6 | |
Adjusting Aggressiveness of Depth-Of-Hypnosis PID Control by MPC-Based Feedforward (I) |
|
Paolino, Nicola | Univeristy of Brescia |
Norlund, Frida | Lund University |
Schiavo, Michele | Università Degli Studi Di Brescia |
Visioli, Antonio | University of Brescia |
Soltesz, Kristian | Lund University |
Keywords: Biomedical, Predictive control for linear systems, PID control
Abstract: In this paper we propose a technique to enhance the performance of a Proportional-Integral-Derivative (PID)-based control structure for Depth-of-Hypnosis control in total intravenous anesthesia when set-point changes are required during the maintenance phase. In particular, the PID controller, tuned for disturbance rejection, is integrated with a feedforward action based on Model Predictive Control (MPC). A tuning parameter determines the aggressiveness of the controller, thus allowing the anesthesiologist to select the most appropriate transient response depending on the kind of patient and of surgery. Simulation results show that the method is useful in providing an effective tool for the anesthesiologist to interact with the control system.
|
|
ThB06 |
Governor's Sq. 10 |
Optimal Control III |
Regular Session |
Chair: Coogan, Samuel | Georgia Institute of Technology |
Co-Chair: Rantzer, Anders | Lund University |
|
13:30-13:45, Paper ThB06.1 | |
On Minimax Optimal Dual Control for Fully Actuated Systems |
|
Rantzer, Anders | Lund University |
Keywords: Adaptive control, Robust adaptive control
Abstract: A multi-variable adaptive controller is derived as the explicit solution to a minimax dynamic game. The minimizing player selects the control action as a function of past state measurements and inputs. The maximizing player selects disturbances and model parameters for the underlying linear time-invariant dynamics. This leads to a Bellman equation that can be solved explicitly for the case with unitary B-matrix known up to a sign and no input penalty. The minimizing policy is a dual controller that optimizes the tradeoff between exploration and exploitation.
|
|
13:45-14:00, Paper ThB06.2 | |
On the Existence of Linear Observed Systems on Manifolds with Connection |
|
Liu, Changwu | Tsinghua University |
Shen, Yuan | Tsinghua University |
Keywords: Algebraic/geometric methods, Observers for nonlinear systems, Kalman filtering
Abstract: Linear observed systems on manifolds are a special class of nonlinear systems whose state spaces are smooth manifolds but possess properties similar to linear systems. Such properties can be characterized by preintegration and exact linearization with Jacobians independent of the linearization point. Non-biased IMU dynamics in navigation can be constructed into linear observed settings, leading to invariant filters with guaranteed behaviors such as local convergence and consistency. In this letter, we establish linear observed property for systems evolving on a smooth manifold through the connection structure endowed upon this space. Our key findings are the existence of linear observed systems on manifolds poses constraints on the curvature of the state space, beyond requiring the dynamics to be compatible with some connection-preserving transformations. Specifically, the flat connection case reproduces the characterization of linear observed systems on Lie groups.
|
|
14:00-14:15, Paper ThB06.3 | |
A Global Coordinate-Free Approach to Invariant Contraction on Homogeneous Manifolds |
|
Harapanahalli, Akash | Georgia Institute of Technology |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Algebraic/geometric methods
Abstract: In this work, we provide a global condition for contraction with respect to an invariant Riemannian metric on reductive homogeneous spaces. Using left-invariant frames, vector fields on the manifold are horizontally lifted to the ambient Lie group, where the Levi-Civita connection is globally characterized as a real matrix multiplication. By linearizing in these left-invariant frames, we characterize contraction using matrix measures on real square matrices, avoiding the use of local charts. Applying this global condition, we provide a necessary condition for a prescribed subset of the manifold to possibly admit a contracting system, which accounts for the underlying geometry of the invariant metric. Applied to the sphere, this condition implies that no great circle can be contained in a contraction region. Finally, we apply our results to compute reachable sets for an attitude control problem.
|
|
14:15-14:30, Paper ThB06.4 | |
Information-State Based Approach to the Optimal Output Feedback Control of Nonlinear Systems |
|
Goyal, Raman | Palo Alto Reserach Center, SRI International |
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Wang, Ran | Texas A&M University |
Sharma, Aayushman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Optimal control, Reinforcement learning, Nonlinear output feedback
Abstract: This paper develops a data-based approach to the closed-loop control of nonlinear dynamical systems with a partial nonlinear observation model. We propose an ``information-state" based approach to rigorously transform the partially observed problem into a fully observed problem where the information-state consists of the past several observations and control inputs. We further show the equivalence of the transformed and the initial partially observed optimal control problems and provide the conditions to solve for the deterministic optimal solution. We develop a data-based generalization of the iterative Linear Quadratic Regulator (ILQR) for partially-observed systems using a local linear time-varying model of the information-state dynamics approximated by an Autoregressive–moving-average (ARMA) model that is generated using only the input-output data. This open-loop trajectory optimization solution is then used to design a local feedback control law, and the composite law then provides an optimum solution to the partially observed feedback design problem. The efficacy of the developed method is shown by controlling complex high dimensional nonlinear dynamical systems in the presence of model and sensing uncertainty.
|
|
14:30-14:45, Paper ThB06.5 | |
Data-Driven Modeling for Nonlinear Optimal Control |
|
Sharma, Aayushman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Identification for control, Nonlinear systems identification, Optimal control
Abstract: In this paper, we study the use of state-of-the-art nonlinear system identification techniques for the optimal control of nonlinear systems. We show that the nonlinear system identification problem is equivalent to estimating the generalized moments of an underlying sampling distribution and is bound to suffer from ill-conditioning and variance when approximating a system to high order, requiring samples combinatorial-exponential in the order of the approximation. We show that the iterative identification of ``local" linear time varying (LTV) models around the current estimate of the optimal trajectory, coupled with a suitable optimal control algorithm such as iterative LQR (ILQR), alleviates these issues and is sufficient to accurately solve the underlying optimal control problem.
|
|
14:45-15:00, Paper ThB06.6 | |
Task Decomposition for Learning Advanced Driving Skills |
|
Dallas, James | Toyota Research Institute |
Morgan, Allison | Toyota Research Institute |
Yasuda, Hiroshi | Toyota Research Institute |
Thompson, Michael | Toyota Research Institute |
Chen, Tiffany | Toyota Research Institute |
Subosits, John | Stanford University: Dynamic Design Lab |
Keywords: Human-in-the-loop control, Automotive control, Automotive systems
Abstract: The introduction of drive-by-wire systems and advances in vehicle automation have enabled new possibilities of what a vehicle can do. Research has pushed vehicle automation to a level that only skilled drivers can achieve in racing and drifting, and decoupled systems can act as a guardian, protecting the driver in extreme situations. Introducing these systems has the potential to improve safety of the car and driver combined system, but could also lead to deskilling and over-reliance by the driver. This manuscript presents an approach to address this through leveraging task decomposition to teach advanced driving skills, namely drifting a circular donut. Using Model Predictive Control, task decomposition is achieved by allowing drivers to learn steering and throttle control independently, enabling the driver to isolate learning individual skills to enhance skill acquisition. A user study was conducted with 11 participants to experimentally validate the approach. Results reflect that the group subject to task decomposition demonstrated enhanced mastery of the skill.
|
|
ThB07 |
Governor's Sq. 11 |
Parameter Estimation and Fault Diagnostics of Energy Storage Systems |
Invited Session |
Chair: Tang, Shuxia | Texas Tech University |
Co-Chair: Siegel, Jason B. | University of Michigan |
Organizer: Zhang, Dong | University of Oklahoma |
Organizer: Soudbakhsh, Damoon | Temple University |
Organizer: Roy, Tanushree | Texas Tech University |
Organizer: Espin, Jorge Esteban | University of Oklahoma |
Organizer: Siegel, Jason B. | University of Michigan |
Organizer: Tang, Shuxia | Texas Tech University |
Organizer: Dey, Satadru | The Pennsylvania State University |
Organizer: Lin, Xinfan | University of California, Davis |
Organizer: Fogelquist, Jackson | University of California, Davis |
|
13:30-13:45, Paper ThB07.1 | |
A Discrete-Time Observer for Parallel Connected Battery Packs with Nonlinear Descriptor System Dynamics (I) |
|
Lone, Jaffar Ali | Indian Institute of Technology Patna |
Drummond, Ross | University of Sheffield |
Bhaumik, Shovan | Indian Institute of Technology Patna |
Tomar, Nutan Kumar | Indian Institute of Technology Patna |
Zhang, Dong | University of Oklahoma |
Keywords: Differential-algebraic systems, Estimation, Control applications
Abstract: An observer is developed for the nonlinear descriptor system dynamics of parallel connected lithium-ion battery packs. The observer estimates the states of the individual cells in the pack with stability and existence guarantees provided through linear matrix inequalities. When evaluated on the urban dynamometer driving schedule, the proposed observer performed well, with root mean squared errors (RMSEs) of 0.0072 and 0.0054 in the state-of-charges as well as 0.3A and 0.28A in the currents for cells 1 and 2, respectively. The results also demonstrated the value of incorporating contact resistances in the model. In particular, with the inclusion of a contact resistance of 0.02 Ohm, the mismatch in currents grew by 9.69% for cell 1 and by 8.55% for cell 2. These results highlight the potential of implementing cell-level state estimation and control in large battery packs accounting for cell-to-cell variability and contact resistances.
|
|
13:45-14:00, Paper ThB07.2 | |
Identifiability Analysis of a Pseudo-Two-Dimensional Model & Single Particle Model-Aided Parameter Estimation (I) |
|
Couto, Luis Daniel | VITO NV |
Haghverdi, Keivan | VITO |
Guo, Feng | VITO |
Trad, Khiem | VITO |
Mulder, Grietus | VITO |
Keywords: Energy systems, Modeling, Identification
Abstract: This contribution presents a parameter identification methodology for the accurate and fast estimation of model parameters in a pseudo-two-dimensional (P2D) battery model. The methodology consists of three key elements. First, the data for identification is inspected and specific features herein that need to be captured are included in the model. Second, the P2D model is analyzed to assess the identifiability of the physical model parameters and propose alternative parameterizations that alleviate possible issues. Finally, diverse operating conditions are considered that excite distinct battery dynamics which allows the use of different low-order battery models accordingly. Results show that, under low current conditions, the use of low-order models achieve parameter estimates at least 500 times faster than using the P2D model at the expense of twice the error. However, if accuracy is a must, these estimated parameters can be used to initialize the P2D model and perform the identification in half of the time.
|
|
14:00-14:15, Paper ThB07.3 | |
Fault Diagnosis of Electrolyte Lithium Concentration in Lithium-Ion Batteries (I) |
|
Sepasiahooyi, Sara | Texas Tech |
Tang, Shuxia | Texas Tech University |
Keywords: Fault detection, Energy systems, Estimation
Abstract: Faults in lithium-ion batteries can diminish performance and shorten lifespan. One notable electrochemical fault arises from the loss of lithium concentration in the electrolyte, which can lead to degradation over time. This paper studies the detection and estimation of electrolyte lithium concentration degradation fault in battery cells. The proposed model-based fault detection scheme consists of two cascaded closed-loop Partial Differential Equation (PDE) observers. The first observer, state observer, estimates the distributed lithium concentration in the electrolyte. The second observer, fault estimator, utilizes the concentration estimated by the first observer to detect and estimate the electrolyte lithium concentration fault. The scheme is validated through simulations conducted on a LiFePO4 battery cell under an Urban Dynamometer Driving Schedule (UDDS) current profile.
|
|
14:15-14:30, Paper ThB07.4 | |
Adaptive Estimation of All-Solid-State Battery Temperatures with Thermal Conductivity Uncertainties (I) |
|
Ferreira, Patryck | Texas Tech University |
Tang, Shuxia | Texas Tech University |
Keywords: Estimation, Adaptive systems, Energy systems
Abstract: All-Solid-State Batteries (ASSBs) offer enhanced safety and higher energy density compared to conventional Lithium-ion Batteries (LiBs), but their thermal management is challenging due to time-varying thermal properties. The thermal behavior of ASSBs is modeled by five Ordinary Differential Equations (ODEs) representing the temperatures of the case surface (near the cathode and anode), cathode, electrolyte, and anode. These temperatures are driven by heat from the battery, derived from an electrochemical model using two Partial Differential Equations (PDEs) for Li+ ions concentration. This study presents an adaptive observer that adjusts thermal conductivities in real-time, accurately estimating ASSB temperatures. Simulations demonstrate that the observer effectively tracks time-varying conductivities, with estimation errors converging to zero and improving thermal management accuracy.
|
|
14:30-14:45, Paper ThB07.5 | |
Efficient Fault Diagnosis in Lithium-Ion Battery Packs: A Structural Approach with Moving Horizon Estimation (I) |
|
Farakhor, Amir | University of Kansas |
Wu, Di | Pacific Northwest National Laboratory |
Wang, Yebin | Mitsubishi Electric Research Labs |
Fang, Huazhen | University of Kansas |
Keywords: Fault diagnosis, Energy systems, Estimation
Abstract: Safe and reliable operation of lithium-ion battery packs depends on effective fault diagnosis. However, model-based approaches often encounter two major challenges: high computational complexity and extensive sensor requirements. To address these bottlenecks, this paper introduces a novel approach that harnesses the structural properties of battery packs, including cell uniformity and the sparsity of fault occurrences. We integrate this approach into a Moving Horizon Estimation (MHE) framework and estimate fault signals such as internal and external short circuits and faults in voltage and current sensors. To mitigate computational demands, we propose a hierarchical solution to the MHE problem. The proposed solution breaks up the pack-level MHE problem into smaller problems and solves them efficiently. Finally, we perform extensive simulations across various battery pack configurations and fault types to demonstrate the effectiveness of the proposed approach. The results highlight that the proposed approach simultaneously reduces the computational demands and sensor requirements of fault diagnosis.
|
|
14:45-15:00, Paper ThB07.6 | |
Robust Estimation of Battery State of Health Using Reference Voltage Trajectory (I) |
|
Huang, Rui | University of California, Davis |
Fogelquist, Jackson | University of California, Davis |
Lin, Xinfan | University of California, Davis |
Keywords: Energy systems, Estimation, Identification
Abstract: Accurate estimation of state of health (SOH) is critical for battery applications. Current model-based SOH estimation methods typically rely on low C-rate constant current tests to extract health parameters like solid phase volume fraction and lithium-ion stoichiometry, which are often impractical in real-world scenarios due to time and operational constraints. Additionally, these methods are susceptible to modeling uncertainties that can significantly degrade the estimation accuracy, especially when jointly estimating multiple parameters. In this paper, we present a novel reference voltage-based method for robust battery SOH estimation. This method utilizes the voltage response of a battery under a predefined current excitation at the beginning of life (BOL) as a reference to compensate for modeling uncertainty. As the battery degrades, the same excitation is applied to generate the voltage response, which is compared with the BOL trajectory to estimate the key health parameters accurately. The current excitation is optimally designed using the Particle Swarm Optimization algorithm to maximize the information content of the target parameters. Simulation results demonstrate that our proposed method significantly improves parameter estimation accuracy under different degradation levels, compared to conventional methods relying only on direct voltage measurements. Furthermore, our method jointly estimates four key SOH parameters in only 10 minutes, making it practical for real-world battery health diagnostics, e.g., fast testing to enable battery repurposing.
|
|
ThB08 |
Governor's Sq. 12 |
Modeling and Control of Sustainable Energy Systems |
Invited Session |
Chair: Scruggs, Jeff | University of Michigan |
Co-Chair: Zhang, Dong | University of Oklahoma |
Organizer: Vermillion, Christopher | University of Michigan |
Organizer: Zhang, Dong | University of Oklahoma |
Organizer: Scruggs, Jeff | University of Michigan |
|
13:30-13:45, Paper ThB08.1 | |
Model Predictive Cooling Control of Cylindrical Battery Cells through Tab and Surface Channels (I) |
|
Peprah, Godwin | Chalmers University of Technology |
Wik, Torsten | Chalmers University of Technology |
Huang, Yicun | Chalmers University of Technology |
Altaf, Faisal | Volvo Group |
Zou, Changfu | Chalmers University of Technology |
Keywords: Energy systems, Optimal control, Control applications
Abstract: Tab cooling offers more uniform temperature distribution across the battery because of high thermal conductivity, while surface cooling is more effective at removing heat due to a larger cooling contact area. This work formulates the optimal integration of tab and surface cooling methods to synergise their unique strengths. This optimal integration control problem is solved within the model predictive control (MPC) framework, leading to minimised thermal gradients and average temperature rise. The proposed MPC scheme, which includes an advanced thermal model containing the cooling system's converter switching mechanism, is evaluated against conventional side-only and base-only %electric vehicle battery cooling schemes under the urban dynamometer driving schedule. Results demonstrate significant thermal performance improvements of the proposed MPC scheme over the conventional cooling methods.
|
|
13:45-14:00, Paper ThB08.2 | |
Optimal Control of Self-Powered Systems Using Convex-Concave MPC (I) |
|
Veurink, Madelyn | University of Michigan |
Scruggs, Jeff | University of Michigan |
Keywords: Power systems, Predictive control for linear systems, Optimal control
Abstract: Self-powered control systems use harvested dis- turbance energy to actuate the controller. These systems are of interest, for example, in vibration suppression applications where the control force cannot rely on outside energy. The control input is constrained to prevent the system from ex- hausting the stored energy, and by the energy storage capacity. In this paper we use Model Predictive Control (MPC) to minimize an integral-quadratic performance objective while meeting the enforced energy constraints. The limitation on the maximum stored energy results in a non-convex control optimization problem. We apply convex-concave techniques to the non-convex constraint to conservatively bound the stored energy. It follows that convex-concave MPC can be used to find the optimal solution to these self-powered control problems. This technique is demonstrated in a simple example vibration suppression problem.
|
|
14:00-14:15, Paper ThB08.3 | |
Multi-Objective Feedback Design for Self-Powered Control (I) |
|
Shell, Jonathan | University of Michigan |
Scruggs, Jeff | University of Michigan |
Keywords: Optimal control, Constrained control, Power electronics
Abstract: In self-powered control technologies the sensors, actuators, and control algorithm are entirely powered by harvesting energy from system disturbances. Because they are energy autonomous, self-powered control systems are ideal for disturbance rejection applications where access to external power is limited, and have shown great promise in many civil, mechanical, and biomedical engineering applications. This work expands on previous efforts in the field regarding the synthesis of feedback laws for self-powered control systems. Using a passive linear time-invariant (LTI) feedback law as a basis for design, a multi-objective optimal control design method for self-powered feedback control is presented. First, frequency domain methods are employed to find an LTI self-powered control law; then, a nonlinear self-powered feedback law is designed with performance which is guaranteed to improve on that of the LTI law. The proposed methodology is demonstrated in a design example involving a tuned vibration absorber attached to a two degree-of-freedom structure.
|
|
14:15-14:30, Paper ThB08.4 | |
OTEC Supported Energy System for Offshore Fish Farming: A Bi-Level Optimization Approach for Sizing and Operation (I) |
|
Sadoughipour, Mahsan | Florida Atlantic University |
Fung, Sasha | Florida Atlantic University |
Tang, Yufei | Florida Atlantic University |
VanZwieten, James | Florida Atlantic University |
Keywords: Energy systems, Hybrid systems
Abstract: Blue economy industries like aquaculture are expanding further offshore to leverage the ocean's vast scale. However, this shift demands reliable, regular power independent of land-based grids. This study introduces a bi-level optimization framework for the design and operation of a hybrid OTEC-diesel system equipped with battery energy storage to power offshore fish farms. The upper-level optimization aims to minimize the levelized cost of energy (LCOE) and ensure continuous operation by optimizing the battery size within the constraints of energy storage. The objective of the lower-level optimization is to minimize energy waste while also addressing an environmental goal, which is to decrease the unnecessary mixing of cold and warm water in the Rankine cycle of the ocean thermal energy conversion (OTEC) system. In our study, we investigated two system configurations: first, a traditional setup using only a diesel generator; second, an OTEC/Diesel/BESS hybrid configuration was evaluated under two scenarios for different fish farm sizes. The results show that scaling up the fish farm and hybrid OTEC-diesel system with energy storage significantly lowers the LCOE. Furthermore, regulating the working fluid’s mass flow rate cuts energy waste by nearly 20% compared to single-level optimization, which reduces environmental impact, and positions this hybrid system as a sustainable and cost-effective alternative to diesel-powered fish farms.
|
|
14:30-14:45, Paper ThB08.5 | |
Hierarchical Multi-Timescale MPC for Control of Off-Grid Renewable Energy Powered Ammonia Plant with Storage (I) |
|
Tully, Zachary | Colorado School of Mines |
Johnson, Kathryn | Colorado School of Mines |
Starke, Genevieve | National Renewable Energy Laboratory |
King, Jennifer | National Renewable Energy Laboratory |
Keywords: Hierarchical control, Predictive control for linear systems, Energy systems
Abstract: Electrifying industrial processes powered by renewable energy (RE) presents unique control challenges due to the multi-timescale dynamics and signals inherent in these systems. The coupling of the fast-timescale dynamics of the RE subsystems and the slow-timescale dynamics of industrial end-use systems creates a complex control problem. Further, variability in RE resources in both fast and slow timescales introduces an additional challenge for the storage dispatch that the controller must also consider. This paper presents a multi-timescale model predictive controller (MT-MPC) designed for an off-grid RE-powered electrified ammonia plant. The MT-MPC effectively manages storage dispatch and provides dynamic control of the end-use subsystem at slow timescales while also managing control of the fast-timescale RE subsystem with the time-varying RE resource input. The MT-MPC outperforms two single-timescale MPCs designed for only the fast or slow timescales, showing that the electrified ammonia plant benefits from multi-timescale control and that the MT-MPC is a promising controller to address that need.
|
|
14:45-15:00, Paper ThB08.6 | |
Dynamic Operating Envelopes of Distribution Systems with Virtual Power Plants under Heat and Cold Waves (I) |
|
She, Buxin | Pacific Northwest National Laboratory |
Ramachandran, Thiagarajan | Pacific Northwest National Laboratory |
Marinovici, Laurentiu Dan | Pacific Northwest National Laboratory |
Wang, Wei | Pacific Northwest National Laboratory |
Adetola, Veronica | Pacific Northwest National Lab |
Keywords: Energy systems, Power systems, Optimization
Abstract: The increasing integration of distributed energy resources (DERs) and the rising frequency of extreme weather events present significant challenges to the resilience of distribution systems. Virtual Power Plants (VPPs) offer a promising solution by providing the grid with flexible demand or generation support. This flexibility can be quantified by Dynamic Operating Envelopes (DOEs), which define the upper and lower power bounds of distributed assets. This paper investigates the impact of heat and cold waves on DOEs in distribution systems that incorporate VPPs. VPPs are modeled using high-fidelity, weather-dependent simulations of load and DERs to capture how extreme temperatures affect both demand and generation. Realistic weather data is used to explore how these conditions change the system conditions and then propagate to DOE results. This paper employs a modified IEEE 123-bus system to simulate heat and cold waves in King County, Washington State. Results indicate that extreme weather events decrease the DOEs and limit system flexibility. However, VPP integration enhances operational flexibility and improves resilience, although the benefits are not uniformly distributed across all nodes in the system.
|
|
ThB09 |
Governor's Sq. 14 |
Game Theory IV |
Regular Session |
Chair: Dayanikli, Gokce | University of Illinois Urbana-Champaign |
Co-Chair: Wu, Yuchi | Shanghai University |
|
13:30-13:45, Paper ThB09.1 | |
A Communication-Efficient and Differentially-Private Distributed Generalized Nash Equilibrium Seeking Algorithm for Aggregative Games |
|
Zhao, Wenqing | Shanghai University |
Xie, Antai | Shanghai University |
Wu, Yuchi | Shanghai University |
Yi, Xinlei | College of Electronics and Information Engineering, Tongji Unive |
Ren, Xiaoqiang | Shanghai University |
Keywords: Networked control systems, Game theory, Distributed control
Abstract: This paper studies the distributed generalized Nash equilibrium seeking problem for aggregative games with coupling constraints, where each player optimizes its strategy depending on its local cost function and the estimated strategy aggregation. The information transmission in distributed networks may go beyond bandwidth capacity and eventuate communication bottlenecks. Therefore, we propose a novel communication-efficient distributed generalized Nash equilibrium seeking algorithm, in which the communication efficiency is improved by event-triggered communication and information compression methods. The proposed algorithm saves the transmitted rounds and bits of communication simultaneously. Specifically, by developing precise step size conditions, the proposed algorithm ensures provable convergence, and is proven to achieve (0,delta)-differential privacy with a stochastic quantization scheme. In the end, simulation results verify the effectiveness of the proposed algorithm.
|
|
13:45-14:00, Paper ThB09.2 | |
Hierarchical MARL with Stackelberg Games |
|
Fiscko, Carmel | Cornell University |
Yin, Haoyu | Washington University in St. Louis |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Reinforcement learning, Game theory, Markov processes
Abstract: We consider a multi-agent system under the control of a central planner (CP). We model the system as a hierarchical multi-agent reinforcement learning (MARL) problem in which a state process is influenced by both the CP's control and the agents' actions. Each agent learns a local policy to maximize the local value function, which may or may not be aligned with the CP's control objective. The CP's goal is therefore to find a global policy such that the agents learn an equilibrium joint policy that maximizes the CP's value function. We first show that this problem is equivalent to a Stackelberg game. This equivalence can be used to derive game-theoretic properties about the desired Stackelberg equilibrium and can be used to solve for optimal policies directly if a model is known. If no game model is given, then we propose a Monte Carlo (MC)-based reinforcement learning (RL) method based on the hierarchical game structure. We demonstrate that under standard RL assumptions, this method can approximate solutions to the desired Stackelberg game. This procedure is validated in simulations on synthetic games resembling social welfare problems.
|
|
14:00-14:15, Paper ThB09.3 | |
A Scalable Game Theoretic Approach for Coordination of Multiple Dynamic Systems |
|
Shibl, Mostafa | Purdue University |
Gupta, Vijay | Purdue University |
Keywords: Distributed control, Reinforcement learning, Game theory
Abstract: Learning in games provides a powerful framework to design control policies for self-interested agents that may be coupled through their dynamics, costs, or constraints. We consider the case where the dynamics of the coupled system can be modeled as a Markov potential game. In this case, distributed learning ensures agents' control policies converge to a Nash equilibrium. However, standard algorithms like natural policy gradient require global state and action knowledge, which does not scale well with more agents. We show that by limiting information flow to local neighborhoods, we can still converge to near-optimal policies. If a game’s global cost function can be decomposed into local costs that align with agent policies at equilibrium, this approach benefits team coordination. We demonstrate this with a sensor coverage problem.
|
|
14:15-14:30, Paper ThB09.4 | |
A Stackelberg Mean Field Game for Green Regulator with a Large Number of Prosumers |
|
Bichuch, Maxim | Johns Hopkins University |
Dayanikli, Gokce | University of Illinois Urbana-Champaign |
Lauriere, Mathieu | NYU Shanghai |
Keywords: Mean field games, Game theory, Control applications
Abstract: We model a Stackelberg game in a power market with rational consumers and a benevolent regulator as a mean-field game. The Stackelberg leader, who is a government regulator, sets the grid distribution fees so as to maximize the total welfare of the consumers, while also ensuring the solvency of the electricity producers and while satisfying renewable production targets. The Stackelberg followers, who are rational prosumers of electricity, maximize their personal utilities by choosing their individual Photovoltaics investments that provides an alternative to buying electricity from the grid, and hence can also produce electricity. With the representative prosumer’s demand evolving as an Ornstein-Uhlenbeck process, we find a closed form mean-field game approximation to prosumer’s optimal strategy, and use that to calculate the optimal fees set by the regulator. Using these we numerically investigate and explain the influence of various market conditions on the optimal distribution fees.
|
|
14:30-14:45, Paper ThB09.5 | |
On Logit Dynamics for Multiplayer Trust Game |
|
Mangalindan, Dong Hae | Michigan State Unversity |
Hota, Ashish R. | Indian Institute of Technology (IIT), Kharagpur |
Srivastava, Vaibhav | Michigan State University |
Keywords: Modeling, Game theory
Abstract: We explore the evolution of trust and trustworthiness in bounded rational agents via the Logit dynamics. We focus on the N-player trust game, where multiple investors and trustees interact and make decisions based on their level of trust and trustworthiness, respectively. We characterize the equilibria of the dynamics under different parameter regimes and establish their stability properties. We show how parameters associated with rationality, incentives, and synergy influence the emergence of trustworthy behavior. Additionally, we show when investors’ contributions are synergistic, then the dynamics may exhibit a limit cycle emerging from a Hopf bifurcation. We illustrate our theoretical results and provide additional insights into these dynamics through numerical simulations.
|
|
14:45-15:00, Paper ThB09.6 | |
Higher-Order Strategy Evolution of Cooperation on Arbitrary Hypergraphs |
|
Wang, Dini | Tongji University |
Yi, Peng | Tongji University |
Keywords: Evolutionary computing, Network analysis and control, Game theory
Abstract: Cooperation plays a fundamental role is societal and biological evolution, and the population structure can significantly influence the dynamics of cooperative behaviors. While mathematical frameworks have existed for studying evolutionary games on graphs with pairwise interactions, exploration of strategy evolution on hypergraphs is relatively limited despite the widespread occurrence of multi-group (higher-order) interactions. Here, we propose the higher-order strategy update pattern based on two-stage selection of a imitating object, governing the propagation of cooperation on hypergraphs. Aiming at two specific update mechanisms, we establish mathematically rigorous conditions for cooperation in public goods games on any weighted hypergraph, achieved by adapting the higher-order random walk into the coalescence theory. These analytical conditions are in good agreement with the experimental consequences of Monte Carlo simulations, regarding both the random hypergraphs and the empirical higher-order networks. Furthermore, we surprisingly identify that one of the higher-order rule of strategy updates – selecting a high-performing group and then selecting a random member in it for imitation – can profoundly promotes the emergence of cooperation. These findings underscore a crucial role of higher-order interactions in modeling and controlling social and biological systems, favoring collective cooperation.
|
|
ThB10 |
Governor's Sq. 16 |
Autonomous Air Mobility and Sensing |
Tutorial Session |
Chair: Frew, Eric W. | University of Colorado, Bolder |
Organizer: Frew, Eric W. | University of Colorado, Bolder |
|
13:30-15:00, Paper ThB10.1 | |
Autonomous Air Mobility and Sensing (I) |
|
Frew, Eric W. | University of Colorado, Bolder |
Keywords: Autonomous systems, Cooperative control, Aerospace
Abstract: The Center for Autonomous Air Mobility and Sensing (CAAMS) is a new NSF Industry University Collaborative Research Center (IUCRC) created by the University of Colorado Boulder, Brigham Young University, Penn State University, Sinclair Community College, Texas A&M University, the University of Michigan, and Virginia Tech to address the unmet, precompetitive research needs of the aviation industry as it moves toward the design and deployment of increasingly autonomous systems. Together with industry and government partners, CAAMS researchers are developing solutions that address the critical technical challenges relevant to autonomous air mobility and sensing. This tutorial will describe the vision, goals, and structure of CAAMS. The tutorial will identify critical research thrust areas where needed advances will help the aviation industry close significant gaps in performance, safety, reliability, and sustainability. Finally, the tutorial will share results from current research and development efforts that are closing these gaps.
|
|
ThB11 |
Governor's Sq. 17 |
Distributed Systems and Control |
Regular Session |
Chair: Basilio, Joao Carlos | Federal University of Rio De Janeiro |
Co-Chair: Nurbekyan, Levon | Emory |
|
13:30-13:45, Paper ThB11.1 | |
Multilateral Monotonic Concession Protocol for Task Negotiation |
|
Kim, Donghae | The University of Texas at Austin |
Akella, Maruthi | The University of Texas at Austin |
Keywords: Decentralized control, Agents-based systems, Game theory
Abstract: In this paper, we propose a multilateral negotiation protocol for multi-agent task allocation problems. The protocol extends the Monotonic Concession Protocol (MCP), which was originally limited to two-player settings. For more than two players settings, we generalize several key concepts of MCP, including concession, agreement, and risk. Specifically, by introducing a notion of generalized risk, agents can systematically rank among various task allocations. The generalized risk metric is then employed to form a Nash strategy, leading the negotiation towards a task partition that maximizes the Nash product of the agents' utilities. The proposed mechanism ensures that the negotiation process is fully distributed, with agents acting in a completely multilateral manner, meaning their actions, roles, state, and information are symmetric. We demonstrate the performance and features of the proposed protocol through a three-player task negotiation example, which shows that the negotiation monotonically converges towards a task allocation that maximizes the Nash product as the negotiation round progresses.
|
|
13:45-14:00, Paper ThB11.2 | |
Modeling Buffer Occupancy in Bittide Systems |
|
Lall, Sanjay | Stanford University |
Spalink, Tammo | Google |
Keywords: Decentralized control, Distributed control
Abstract: The bittide mechanism enables logically synchronous computation across distributed systems by leveraging the continuous frame transmission inherent to wired networks such as Ethernet. Instead of relying on a global clock, bittide uses a decentralized control system to adjust local clock frequencies, ensuring all nodes operate with a consistent notion of time by utilizing elastic buffers at each node to absorb frequency variations. This paper presents an analysis of the steady-state occupancy of these elastic buffers, a critical factor influencing system latency. Using a fluid model of the bittide system, we prove that buffer occupancy converges and derive an explicit formula for the steady-state value in terms of system parameters, including network topology, physical latencies, and controller gains. This analysis provides valuable insights for optimizing buffer sizes and minimizing latency in bittide-based distributed systems.
|
|
14:00-14:15, Paper ThB11.3 | |
Kernel Expansions for High-Dimensional Mean-Field Control with Non-Local Interactions |
|
Vidal, Alexander | Colorado School of Mines |
Wu Fung, Samy | Colorado School of Mines |
Osher, Stanley | University of California, Los Angeles |
Tenorio, Luis | Colorado School of Mines |
Nurbekyan, Levon | Emory |
Keywords: Optimal control, Distributed control, Numerical algorithms
Abstract: Mean-field control (MFC) problems aim to find the optimal policy to control massive populations of interacting agents. These problems are crucial in areas such as economics, physics, and biology. We consider the non-local setting, where the interactions between agents are governed by a suitable kernel. For N agents, the interaction cost has O(N^2) complexity, which can be prohibitively slow to evaluate and differentiate when N is large. To this end, we propose an efficient primal-dual algorithm that utilizes basis expansions of the kernels. The basis expansions reduce the cost of computing the interactions, while the primal-dual methodology decouples the agents at the expense of solving for a moderate number of dual variables. We also demonstrate that our approach can further be structured in a multi-resolution manner, where we estimate optimal dual variables using a moderate N and solve decoupled trajectory optimization problems for large N. We illustrate the effectiveness of our method on an optimal control of 5000 interacting quadrotors.
|
|
14:15-14:30, Paper ThB11.4 | |
Distributed Personalized Optimization on Riemannian Manifolds with Gradient Tracking |
|
Zhao, Yixian | Zhejiang University |
Huang, Yan | KTH - Kungliga Tekniska Högskolan |
Zhang, Haochang | Shandong University |
Xu, Jinming | Zhejiang University |
Keywords: Agents-based systems, Optimization algorithms, Distributed control
Abstract: In this paper, we consider a distributed Riemannian optimization problem with personalization, where cost functions are defined on manifolds and rely on a decision variable comprising both global (shared) and local (private) components. This problem structure arises in various contexts where the global part encapsulates common characteristics across nodes, while the local part represents personalized features specific to individual nodes within the network. To solve this problem, we propose a personalized distributed Riemannian gradient tracking method where each node updates its private variables locally while performing distributed gradient tracking on the shared component with neighboring nodes to facilitate coordination, leading to greater algorithm flexibility and robustness against data heterogeneity. Leveraging the Riemannian consensus mechanism and the law of cosines for non-Euclidean geometry, we prove that the proposed algorithm achieves a suboptimality gap of O(1/sqrt{K}) for geodesically convex objective functions. Numerical examples are conducted to verify the effectiveness of the proposed algorithm within the realm of Riemannian optimization.
|
|
14:30-14:45, Paper ThB11.5 | |
Repeated Fault Diagnosability of Discrete Event Systems with Decentralized Structure |
|
Ottoni, Guilherme | Universidade Federal Do Rio De Janeiro |
Basilio, Joao Carlos | Federal University of Rio De Janeiro |
Keywords: Automata, Discrete event systems, Fault detection
Abstract: This paper extends the results on repeated fault diagnosability of monolithic discrete-event systems to systems with decentralized structure. To this end, the definition of K-codiagnosability, which basically consists of ensuring that at least one local diagnoser is able to accurately determine whether a fault event has occurred at least K times is introduced. Moreover, a necessary and sufficient condition for K-codiagnosability of regular languages is presented and an algorithm with polynomial-time complexity is proposed for its verification. A numerical examples illustrates the efficiency of the proposed method.
|
|
14:45-15:00, Paper ThB11.6 | |
Structure-Preserving Uncertainty Quantification and Control of Population Balance Models |
|
Tan, Wallace | Gian Yion |
Ganko, Krystian | Massachusetts Institute of Technology |
Braatz, Richard D. | Massachusetts Institute of Technology |
Keywords: Distributed parameter systems, Uncertain systems, Predictive control for nonlinear systems
Abstract: This article describes a systematic method for propagating and controlling the effects of parametric uncertainties in population balance models (PBMs) using polynomial chaos expansions. PBMs are an important element in the model-based control of many particulate manufacturing processes. Direct stochastic Galerkin projections of PBMs with uncertain parameters allow for structure-preserving uncertainty quantification of the population distribution across the full intrinsic variable domain. With the proposed method, more accurate real-time risk analysis to ensure safe and controlled particulate manufacturing under distributed parameter system model constraints becomes accessible on compute-limited hardware. The method in particular shows a considerable computational speedup and memory usage decrease relative to conventional sampling-based methods, enabling fast on-line model predictive control implementations. The method also retains full population distribution information in contrast to less-accurate, structure-reducing techniques such as the method of moments. Throughout, we discuss approaches to mitigate common numerical accuracy losses appearing in the discretization of PBMs.
|
|
ThB12 |
Plaza Court 1 |
Vehicle Automation and ADAS |
Invited Session |
Chair: Soudbakhsh, Damoon | Temple University |
Co-Chair: Zhao, Junfeng | Arizona State University |
Organizer: Soudbakhsh, Damoon | Temple University |
Organizer: Zhao, Junfeng | Arizona State University |
Organizer: Nazari, Shima | UC Davis |
|
13:30-13:45, Paper ThB12.1 | |
A Traffic Adaptive Physics-Informed Learning Control for Energy Savings of Connected and Automated Vehicles (I) |
|
Shao, Yunli | University of Georgia |
Li, Zongtan | University of Georgia |
Keywords: Optimal control, Automotive control, Automotive systems
Abstract: Model predictive control has emerged as an effective approach for real-time optimal control of connected and automated vehicles. However, the nonlinear dynamics of vehicle and traffic systems make accurate modeling and real-time optimization challenging. Learning-based control offers a promising alternative through offline control policy learning and low computational cost real-time inference. In vehicle and traffic systems, designing an effective learning control framework is nontrivial, as optimal control depends on both ego vehicle's state and predicted states of surrounding vehicles. This work proposes a traffic adaptive learning control method that allows the control strategy to intelligently adapt to varying traffic conditions. By leveraging an augmented state-space formulation, the predicted states of surrounding vehicles are transcribed to only alter initial conditions. This approach ensures that the system dynamics remain independent of surrounding vehicle trajectories to facilitate learning of a time-invariant system. Additionally, a physics-informed learning control framework is introduced that integrates the value function and its derivatives into a unified loss function. By incorporating system dynamics information into the learning process, the proposed method reduces the required training data and time while enhancing robustness and efficiency. The proposed control method is applied to car-following scenarios in real-world data calibrated simulation environments. The results show that this learning control approach alleviates real-time computational demands while achieving car-following behaviors comparable to model-based methods, leading to a 9% energy savings.
|
|
13:45-14:00, Paper ThB12.2 | |
Large-Spacing Truck Platooning under Windy Conditions (I) |
|
Jiang, Luo | University of Alberta |
Shahbakhti, Mahdi | University of Alberta |
Keywords: Multivehicle systems, Predictive control for nonlinear systems
Abstract: Large-spacing truck platooning strikes a balance between maintaining the benefits of platooning—such as the fuel efficiency and emissions reduction—while addressing safety, flexibility, and operational challenges of close-spacing systems. To improve the performance of large-spacing truck platooning under windy conditions, a nonlinear model predictive controller (NMPC) is developed. This controller ensures platooning safety while optimizing fuel efficiency and reducing tailpipe nitrogen oxides (NOx) emissions. Simulations of a two-truck platoon with a 3-sec time gap under windy conditions were conducted, using models validated with on-road experimental data. The results demonstrate that the proposed NMPC effectively keeps spacing errors within the preset safety buffer. Moreover, the follower truck achieves a fuel saving of 5.4% in the Alberta Highway 2 driving cycle with headwinds, and tailpipe NOx emissions are reduced by up to 56.8% by suppressing rapid engine torque changes. This study offers valuable insights into the feasibility of deploying large-spacing platoons in windy environments.
|
|
14:00-14:15, Paper ThB12.3 | |
Control Barrier Functions for Shared Control and Vehicle Safety (I) |
|
Dallas, James | Toyota Research Institute |
Talbot, John | Toyota Research Institute |
Suminaka, Makoto | Toyota Research Institute |
Thompson, Michael | Toyota Research Institute |
Lew, Thomas | Toyota Research Institute |
Orosz, Gabor | University of Michigan |
Subosits, John | Stanford University: Dynamic Design Lab |
Keywords: Automotive control, Constrained control, Human-in-the-loop control
Abstract: This manuscript presents a control barrier function based approach to shared control for preventing a vehicle from entering the part of the state space where it is unrecoverable. The maximal phase recoverable ellipse is presented as a safe set in the sideslip angle-yaw rate phase plane where the vehicle's state can be maintained. An exponential control barrier function is then defined on the maximal phase recoverable ellipse to promote safety. Simulations demonstrate that this approach enables safe drifting,that is, driving at the handling limit without spinning out. Results are then validated for shared control drifting with an experimental vehicle in a closed course. The results show the ability of this shared control formulation to maintain the vehicle's state within a safe domain in a computationally efficient manner, even in extreme drifting maneuvers.
|
|
14:15-14:30, Paper ThB12.4 | |
Truck Fleet Coordination for Warehouse Trailer Management by Temporal Logic with Energy Constraints (I) |
|
Cardona, Gustavo | Lehigh University |
Vasile, Cristian Ioan | Lehigh University |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Multivehicle systems, Formal verification/synthesis, Automotive control
Abstract: We consider the coordination of a fleet of tractor trucks to manage trailers in a large warehouse complex and propose an approach that leverages Metric Temporal Logic (MTL) to describe missions to be executed. Each mission includes multiple tasks, such as reaching a trailer, connecting to it, moving it to a sequence of specific warehouse regions, such as loading docks, internal holding areas, and departure parking lots, and eventually disconnecting from it. The electric-powered tractor trucks must also be recharged by visiting charging stations. The MTL formulation avoids an operator manually designing a mission specification, which can quickly become unfeasible with many requests and possible assignments of tractor trucks. MTL specifications and motion dynamics are formulated as a mixed integer linear programming (MILP) approach, where the cost function includes performance objectives such as minimizing the trailer motions and energy-efficient usage. Since missions are added and removed during operation and to also reduce the computation time, we modify the method to allow for a receding horizon approach that allows for partial satisfaction of the MTL specification and uses the cost function to favor the progress towards completion of partially satisfied specifications. We compare different MILP formulations in simulations.
|
|
14:30-14:45, Paper ThB12.5 | |
Data-Driven Robust Control for Multi-Fuel Compression Ignition Engines (I) |
|
Govind Raju, Sathya Aswath | University of Minnesota - Twin Cities |
Sun, Zongxuan | University of Minnesota |
Kim, Kenneth | DEVCOM Army Research Laboratory |
Kweon, Chol-Bum | DEVCOM Army Research Laboratory |
Keywords: Automotive control, Robust control, Machine learning
Abstract: Designing controllers for obtaining optimal performance from a multi-fuel compression ignition engine is a challenging problem. Multiple actuators are used to achieve desired combustion phasing across various operating conditions. Traditionally, feedforward maps are used, which are built by performing experiments at different operating conditions in steady-state. In recent years, the use of data-driven models to construct FF maps has gained traction. However, performance using FF maps while moving between steady-state points and in the presence of disturbance is not guaranteed. This paper explores the design of feedback controllers for reducing the transients observed due to uncertain actuator dynamics while moving between steady-state points and in the presence of disturbance. Robust control framework was used for controller design, and simulations representing scenarios that can be seen during the engine operation were used to demonstrate the effectiveness of the developed controllers.
|
|
14:45-15:00, Paper ThB12.6 | |
Lateral and Longitudinal Control of an Autonomous Unicycle (I) |
|
Vizi, Mate Benjamin | Budapest University of Technology and Economics |
Orosz, Gabor | University of Michigan |
Takacs, Denes | Budapest University of Technology and Economics |
Stepan, Gabor | Budapest University of Technology and Economics |
Keywords: Nonholonomic systems, Modeling, Mechanical systems/robotics
Abstract: Trajectory tracking with an autonomous unicycle is considered in three-dimensional space. It is shown that with the appropriate choice of pseudo-velocities the lateral and longitudinal dynamics and control can be decoupled at the linear level. Linear state feedback controllers are designed separately for lateral and longitudinal subsystems and these controllers are tested simultaneously for the nonlinear model via numerical simulations.
|
|
ThB13 |
Plaza Court 2 |
Optimization III |
Regular Session |
Chair: Kia, Solmaz S. | University of California Irvine (UCI) |
Co-Chair: Vasak, Mario | University of Zagreb Faculty of Electrical Engineering and Computing |
|
13:30-13:45, Paper ThB13.1 | |
High-Temperature Measurement Method Based on Ensemble Learning with Combined Dual-Color and Tri-Color Colorimetric Thermometry |
|
Tan, Xutong | University of Electronic Science and Technology of China |
Yin, Chun | University of Electronic Science and Technology of China |
Huang, Xuegang | Aerodynamics Institute, China Aerodynamics Research and Developm |
Dadras, Sara | Company |
Liu, Junyang | School of Automation Engineering, University of Electronic Scien |
Keywords: Optimization, Machine learning, Aerospace
Abstract: Rapid and accurate measurement of the temperature field is crucial in temperature control systems. However, conventional CCD-based thermometry treats the choice between dual-channel and tri-channel information as an either-or decision. To combine the advantages of both methods, this paper proposes an ensemble learning-based two-color and three-color joint thermometry method. The entire process, from spectral radiation reception to pixel intensity generation, was analyzed to understand its impact on selecting the optimal thermometry formula. By integrating channel reliability, noise parameters, and coefficients of variation in both spatial and colorimetric domains, a random forest model was used to predict the optimal thermometry formula. Experimental results demonstrate that the proposed method effectively selects the optimal thermometry method for each thermographic image.
|
|
13:45-14:00, Paper ThB13.2 | |
On the Convergence and Implementation of High-Order Primal Dual Algorithms for Affine Constrained Convex Optimization |
|
Tian, Qiuchen | Zhejiang University |
Zhao, Yixian | Zhejiang University |
Xu, Jinming | Zhejiang University |
Chai, Li | Zhejiang University |
Keywords: Optimization algorithms, Optimization, Agents-based systems
Abstract: Recently, Nesterov proposed an implementable tensor method for unconstrained convex optimization problems, and showed that the p-th order algorithm converges at a rate of O( epsilon^{-1/p+1} ) , where p is the degree of Taylor approximation. In this paper, we extend this high-order method to affine constrained convex optimization problems. Under the same assumption for the objective function, we show that the p-th order primal-dual algorithm converges. Moreover, we introduce two inner-loop algorithms for the practical implementation of the proposed high-order primal-dual method. Extensive numerical experiments show that, as long as the step size of dual gradient ascent updates is not too small, the proposed high-order primal-dual method outperforms vanilla primal-dual methods with low-order primal updates. However, if using optimized number of inner loops without accurately solving the inner problem in primal updates, algorithms with 2nd- and 3rd-order primal updates yield comparable results.
|
|
14:00-14:15, Paper ThB13.3 | |
Projected Forward Gradient-Guided Frank-Wolfe Algorithm Via Variance Reduction |
|
Rostami, Mohammadreza | University of California, Irvine |
Kia, Solmaz S. | University of California Irvine (UCI) |
Keywords: Optimization, Optimization algorithms, Machine learning
Abstract: This paper aims to enhance the use of the Frank-Wolfe (FW) algorithm for training deep neural networks. Similar to any gradient-based optimization algorithm, FW suffers from high computational and memory costs when computing gradients for DNNs. This paper introduces the application of the recently proposed projected forward gradient (Projected-FG) method to the FW framework, offering reduced computational cost similar to backpropagation and low memory utilization akin to forward propagation. Our results show that trivial application of the Projected-FG introduces non-vanishing convergence error due to the stochastic noise that the Projected-FG method introduces in the process. This noise results in an non-vanishing variance in the Projected-FG estimated gradient. To address this, we propose a variance reduction approach by aggregating historical Projected-FG directions. We demonstrate rigorously that this approach ensures convergence to the optimal solution for convex functions and to a stationary point for non-convex functions. These convergence properties are validated through a numerical example, showcasing the approach’s effectiveness and efficiency.
|
|
14:15-14:30, Paper ThB13.4 | |
On the O(1/k) Convergence of Distributed Gradient Methods under Random Quantization |
|
Dutta, Amit | Virginia Polytechnic Institute and State University |
Doan, Thinh T. | University of Texas at Austin |
Keywords: Optimization algorithms, Optimization, Cooperative control
Abstract: We revisit the so-called distributed two-time-scale stochastic gradient method for solving a strongly convex optimization problem over a network of agents in a bandwidth-limited regime. In this setting, the agents can only exchange the quantized values of their local variables using a limited number of communication bits. Due to quantization errors, the existing best known convergence results of this method can only achieve a suboptimal rate mathcal{O}(1/sqrt{k}), while the optimal rate is mathcal{O}(1/k) under no quantization, where k is the time iteration. The main contribution of this paper is to address this theoretical gap, where we study a sufficient condition and develop an innovative analysis and step-size selection to achieve the optimal convergence rate mathcal{O}(1/k) for the distributed gradient methods given any number of quantization bits. We provide numerical simulations to illustrate the effectiveness of our theoretical results.
|
|
14:30-14:45, Paper ThB13.5 | |
Sequential Linear Programming with Adaptive Linearization Error Limits for All-Time Feasibility |
|
Leko, Dorijan | University of Zagreb Faculty of Electrical Engineering and Compu |
Vasak, Mario | University of Zagreb Faculty of Electrical Engineering and Compu |
Keywords: Optimization algorithms, Optimization, Numerical algorithms
Abstract: This paper presents an enhanced Trust Region Method (TRM) for Sequential Linear Programming (SLP) designed to improve the initial feasible solution to a constrained nonlinear programming problem while maintaining the interim solutions feasibility throughout the SLP iterations. The method employs a polytopic sub-approximation of the feasible region, defined around the interim solution as a level set based on variable limits for the linearization error. This polytopic feasible region is established by using a trust region that ensures that maximum limits of the linearization errors are respected. The method adaptively adjusts the size of the feasible region during iterations to achieve convergence to a local optimum by employing variable linearization error limits. Local convergence is attained by reducing the size of the trust radius. A case study illustrates the effectiveness of the proposed method, which is compared to the benchmark TRM that uses heuristic limits on the permissible changes in manipulated variables.
|
|
14:45-15:00, Paper ThB13.6 | |
Local Linear Convergence of Infeasible Optimization with Orthogonal Constraints |
|
Sun, Youbang | Northeastern University |
Chen, Shixiang | University of Science and Technology of China |
Garcia, Alfredo | Texas A&M University |
Shahrampour, Shahin | Northeastern University |
Keywords: Optimization algorithms, Optimization
Abstract: Many classical and modern machine learning algorithms require solving optimization tasks under orthogonality constraints. Solving these tasks with feasible methods requires a gradient descent update followed by a retraction operation on the Stiefel manifold, which can be computationally expensive. Recently, an infeasible retraction-free approach, termed the landing algorithm, was proposed as an efficient alternative. Motivated by the common occurrence of orthogonality constraints in tasks such as principle component analysis and training of deep neural networks, this paper studies the landing algorithm and establishes a novel linear convergence rate for smooth non-convex functions using only a local Riemannian PŁ condition. Numerical experiments demonstrate that the landing algorithm performs on par with the state-of-the-art retraction-based methods with substantially reduced computational overhead.
|
|
ThB14 |
Plaza Court 3 |
Spacecraft Control |
Regular Session |
Chair: Taheri, Ehsan | Auburn University |
Co-Chair: Makumi, Wanjiku A. | University of Florida |
|
13:30-13:45, Paper ThB14.1 | |
Fractional PID Attitude Control of Multi-Agent Rigid Body Systems Using Rotation Matrices (I) |
|
Maadani, Mohammad | University of Arizona |
Butcher, Eric | University of Arizona |
Keywords: Cooperative control, PID control, Networked control systems
Abstract: A fractional proportional-integral-derivative (PID) attitude controller for multi agent rigid-body systems is proposed in order to achieve attitude synchronization or balancing. The controller is designed on the tangent bundle of the special orthogonal group (SO(3)) and employs states comprising the rotation matrices and angular velocities of the bodies along with proportional, fractional derivative, and fractional integral feedback of these states. The performance of the closed-loop responses are evaluated numerically for both cases of synchronization and balancing.
|
|
13:45-14:00, Paper ThB14.2 | |
Optimal Multi-Spacecraft Refueling Planning for Cislunar Operations Using Multi-Fidelity Models (I) |
|
Rommel, Quentin | University of Texas at Austin |
Hibbard, Michael | University of Texas, Austin |
Chubick, John | Westwood High School |
Scheeres, Daniel J. | The University of Colorado |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Spacecraft control, Markov processes, Autonomous systems
Abstract: As space activities expand within the cislunar environment, developing efficient refueling strategies becomes essential for sustaining long-term missions. The multi-spacecraft refueling problem focuses on optimizing fuel consumption to extend mission lifetimes. A multi-fidelity modeling approach addresses varying levels of precision, using the two-body problem for initial trajectory estimation, the circular restricted three-body problem for reference orbits and transfers, and the bicircular restricted four-body problem with solar radiation pressure for realistic station-keeping costs. The refueling problem is formulated as an infinite-horizon Markov decision process (MDP), optimizing the refueling strategy while preventing fuel depletion. The refueling process consists of two phases: first, the refueler transfers into a temporary phasing orbit, followed by a final phasing maneuver to rendezvous with the target spacecraft. We map the available transfers between orbits using a fast sampling technique and select the phasing orbit to minimize fuel consumption within a limited synodic period. The case study samples orbits from the Lyapunov and Halo families around the Lagrange points of the Earth-Moon system, L1, L2, and L3. Simulations demonstrate that the MDP-based planning achieves a 36% reduction in fuel consumption compared to a greedy strategy.
|
|
14:00-14:15, Paper ThB14.3 | |
Aerocapture Guidance for Augmented Bank Angle Modulation (I) |
|
Sonandres, Kyle | MIT |
Palazzo, Thomas | Draper |
How, Jonathan P. | MIT |
Keywords: Spacecraft control, Optimal control
Abstract: This paper presents an optimal control solution for an aerocapture vehicle with two control inputs, bank angle and angle of attack, referred to as augmented bank angle modulation (ABAM). We derive the optimal control profiles using Pontryagin’s Minimum Principle, validate the result numerically using the Gauss pseudospectral method (implemented in GPOPS), and introduce a novel guidance algorithm, ABAMGuid, for in-flight decision making. High-fidelity Monte Carlo simulations of a Uranus aerocapture mission demonstrate that ABAMGuid can greatly improve capture success rates and reduce the propellant needed for orbital correction following the atmospheric pass.
|
|
14:15-14:30, Paper ThB14.4 | |
Adaptive Satellite Attitude Control with Coulombic Actuator Using Backstepping Approach |
|
Saathvika, Kasukurthi | Indian Institute of Technology Bombay |
Das, Arya | Indian Institute of Technology Kanpur |
Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
Giri, Dipak Kumar | IIT Kanpur |
Keywords: Spacecraft control, Adaptive control, Lyapunov methods
Abstract: The stability and accuracy of satellite missions in space are significantly reliant on satellite attitude control. This paper presents a comprehensive study of satellite attitude tracking using Coulombic actuators and back-stepping adaptive control using signal-chasing analysis. Mathematical models which describe the dynamics and kinematics of satellite motion, and the features of the Coulombic actuator are discussed. The Lyapunov function is used to show that the closed-loop system is stable and that it is converging toward the desired equilibrium point. To manage parameter uncertainty, the backstepping adaptive control approach is used, assuring robustness and stability. Finally, extensive simulation results are presented to evaluate the performance of the proposed approach.
|
|
14:30-14:45, Paper ThB14.5 | |
Application of Costate Mapping for Enforcing Classical Orbital Elements Boundary Conditions in Orbit Transfer Maneuvers |
|
Taheri, Ehsan | Auburn University |
Keywords: Optimal control, Spacecraft control, Constrained control
Abstract: In astrodynamics and space flight mechanics problems, it is known that the choice of coordinates (and/or elements) has a significant influence on the convergence performance of the Hamiltonian/indirect boundary-value problems (HBVPs). It is oftentimes required to enforce the orbit-departure (or orbit-insertion) boundary conditions in terms of the classical orbital elements (COEs), which are more intuitive compared to the modified equinoctial elements (MEEs). In trade studies, it is quite common to seek an answer to the following question: What is the best orbit geometry to which a spacecraft can be transferred? We propose an easy-to-implement approach to enforcing orbit-departure/insertion transversality conditions using the costate mapping theory. This approach allows practitioners to formulate HBVPs using the set of MEEs (which is advantageous from a computational performance point of view) while enforcing the boundary conditions using the more intuitive COEs. Application of the method is demonstrated for generating minimum-fuel low-thrust trajectories between geocentric elliptical and near-circular orbits.
|
|
14:45-15:00, Paper ThB14.6 | |
Underactuated Spacecraft Detumbling Using Predictive Cost Adaptive Control |
|
Vander Schaaf, Jacob | University of Michigan |
Auerbach, Samuel | University of Michigan |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Adaptive control, Spacecraft control, Closed-loop identification
Abstract: We consider detumbling for a rigid spacecraft with 3, 2, or 1 torque actuators. The inertia matrix of the spacecraft is assumed to be unknown, and the axes of the torque actuators and angular-rate sensors are assumed to be mutually orthogonal but are otherwise unknown. Predictive cost adaptive control (PCAC) is applied to this system using online system identification and receding-horizon optimization. PCAC is applied to Euler's equation as a sampled-data controller. The goal of this numerical investigation is to assess the effectiveness of PCAC for fully and underactuated detumbling with high modeling uncertainty and hard constraints on the torque magnitude and torque rate.
|
|
ThB15 |
Plaza Court 6 |
Biomass Production and Wastewater Treatment Using Microalgae: Modelling and
Control Challenges |
Tutorial Session |
Chair: Guzman, Jose Luis | University of Almeria |
Co-Chair: Berenguel, Manuel | University of Almeria |
Organizer: Guzman, Jose Luis | University of Almeria |
Organizer: Berenguel, Manuel | University of Almeria |
|
13:30-14:20, Paper ThB15.1 | |
Microalgae Production at Industrial Scale: Modelling and Control Challenges (I) |
|
Guzman, Jose Luis | University of Almeria |
Berenguel, Manuel | University of Almeria |
Rodríguez-Miranda, Enrique | University of Almeria |
Acien, F. Gabriel | University of Almeria |
Keywords: Process Control, Chemical process control, Biotechnology
Abstract: The 21st century society faces significant sustainability challenges, including climate change, resource depletion, and environmental degradation. Microalgae have emerged as a versatile and sustainable solution to mitigate these problems. The microalgae production process involves harnessing their ability to convert sunlight, carbon dioxide, and nutrients (contaminants in wastewater media or agriculture effluents) into valuable biomass through photosynthesis, thus providing a synergy between biomass production and environmental remediation. When microalgae are produced, the culture conditions, including temperature, light intensity, nutrient concentration, dissolved oxygen, and pH, are critical factors that must be carefully controlled to ensure optimal microalgae growth. The growth dynamics of microalgae is influenced by a multitude of interacting factors that, together with the biological nature of microalgae and the variability of weather conditions, pose a significant barrier to the implementation of effective control algorithms. This tutorial paper summarizes the results of more than 20 years of experience working in the scale-up and development of modeling, control, and optimization methods for microalgae production systems. Theoretical and experimental results at industrial scale will be presented to show the potential of microalgae production systems and how control engineering solutions can contribute to make them more competitive in the market for bioproduct generation and wastewater regeneration.
|
|
14:20-14:40, Paper ThB15.2 | |
A Novel Sensor for Monitoring and Control of Biomass Concentration in Raceway Photobioreactors (I) |
|
Gonzalez-Hernandez, Jose | University of Almeria |
Guzman, Jose Luis | University of Almeria |
Berenguel, Manuel | University of Almeria |
Acien, F. Gabriel | University of Almeria |
Keywords: Chemical process control, Process Control
Abstract: When optimization control approaches are developed to maximize biomass production in open photobiorreactos, one of the main challenges is to have reliable online sensors to measure the biomass concentration in real time. Typically, biomass concentration is measured using turbidimeters, which allow the concentration of the culture to be related to its turbidity. These techniques are inadequate in open reactors due to the high level of contamination that these systems are subject to, and the concentration estimation is strongly affected. This presentation will introduce a novel sensor for monitoring and control of biomass concentration suitable for open photobioreactors. The sensor includes a procedure for measuring biomass concentration in microalgae cultures that transforms a measured absorption signal into the final biomass concentration of the culture. The device and procedure allow online measurement of the concentration of microalgae biomass (including cyanobacteria) in microalgae cultures at any scale, from laboratory to closed reactors of 3,000 L and open reactors of up to 100 m 3 in volume. The developed device presents a high robustness against the contamination that occurs in open reactors. An international patent was obtained for this sensor, which underlines its relevance and potential impact on the sector. In this presentation, not only the biomass estimator device is presented, but also its validation in a real semi-industrial microalgae plant, what supports its effectiveness in real operating conditions. Moreover, the integration of culture concentration measurement into the control algorithms will be represented, demonstrating an important advance in the industrial automation of this type the microalgae production systems. This work has been financed by the following projects: PID2023-150739OB-I00 project financed by the Spanish Ministry of Science and also by the European Union (Grant agreement ID: 101060991, REALM).
|
|
14:40-15:00, Paper ThB15.3 | |
Model-Based Optimization and Control of Solar Photo-Fenton Plants As Complement to Microalgae Processes for Microcontaminants Removal in Urban Wastewater (I) |
|
Rodriguez-Garcia, Daniel | University of Almeria |
Casas-Lopez, Jose Luis | University of Almeria |
Guzman, Jose Luis | University of Almeria |
Garcia-Sanchez, Jose Luis | University of Almeria |
Keywords: Chemical process control, Process Control, Control applications
Abstract: The solar photo-Fenton process operated in raceway pond reactors (RPRs) has garnered significant attention as a promising advanced oxidation technology for the simultaneous removal of pathogens and organic microcontaminants contained in conventional UWWTP secondary effluents. Its feasibility has been proved from a physical, operational, and economical perspective at demonstration scale under real operating environments. Furthermore, recent research trends are exploring the integration of non-conventional secondary and tertiary treatments, such as microalgae-based wastewater treatment, and solar photo-Fenton (as a quaternary treatment), to reduce operating costs and mitigate the carbon footprint of the treatment. Nevertheless, further research is still required to address the automation and optimization of large-scale solar photo-Fenton plants, as no commercial equipment has been still developed for the real-time measurement of microcontaminant concentration (target variable). In addition, reaction stage indicators (such as hydrogen peroxide) have limitations in their online quantification using commercial probes, due to the interacting wastewater matrix effect and relatively low but rapid concentration changes. As such, the quantification of these variables requires sample analysis using sophisticated and highly-sensitive analytical equipment. This leads to significant delays in measurement, making it difficult to control and optimize the treatment in real time when environmental conditions (UVA irradiance) or inlet wastewater composition suddenly change. Therefore, control and optimization techniques should be oriented towards the use of reliable and accurate kinetic models to bridge the lack of real-time measuring equipment. The presentation will showcase the kinetic modelling of different solar photo-Fenton demonstration plants. It will present the use of these models for control and optimization purposes at different levels of complexity and accuracy, advantages and disadvantages being discussed in each case.This work has been partially financed by: PROJECT TED2021-130458B-I00, CAFIRA PROJECT PID2023-152519OB-I00 and INACUA PROJECT PLSQ_00330.
|
|
ThB16 |
Plaza Court 7 |
Robust Control |
Regular Session |
Chair: Seiler, Peter | University of Michigan, Ann Arbor |
Co-Chair: Yao, Bin | Purdue University |
|
13:30-13:45, Paper ThB16.1 | |
H_{infty}/mu-based Indirect Adaptive Robust Control of a Servo-Table System with Significant Flexible Modes and Bounded Nonlinear Uncertainties |
|
Chen, Zeshen | Purdue University |
Yao, Bin | Purdue University |
Keywords: Robust control, Adaptive control, Mechatronics
Abstract: This paper introduces a novel H_{infty}/mu-based indirect adaptive robust control (ARC) strategy aiming to better handle high-order dynamics often neglected by conventional rigid-body control designs. The proposed strategy integrates H∞/μ robust control with an adaptive mechanism to adapt the parameters of the modeled structural uncertainties and disturbances, and then effectively compensate for them. With a decoupled design that separates adaptation from feedback, the proposed strategy allows for high adaptation rates without exciting flexible modes or destabilizing the overall system. Validation on a motor-driven servo table with significant flexible modes demonstrates that the proposed strategy effectively enhances track performance compared to traditional methods, including direct-type ARC and PID control. Simulations and comparative experiments confirm the superior performance of the proposed strategy.
|
|
13:45-14:00, Paper ThB16.2 | |
Model-Free Generic Robust Control for Servo-Driven Actuation Mechanisms with Layered Insight into Energy Conversions |
|
Shahna, Mehdi Heydari | Tampere University |
Mattila, Jouni | Tampere University |
Keywords: Robust control, Mechatronics, Adaptive control
Abstract: To advance theoretical solutions and address limitations in modeling complex servo-driven actuation systems experiencing high non-linearity and load disturbances, this paper aims to design a practical model-free generic robust control (GRC) framework for these mechanisms. This framework is intended to be applicable across all actuator systems encompassing electrical, hydraulic, or pneumatic servomechanisms, while also functioning within complex interactions among dynamic components and adhering to control input constraints. In this respect, the state-space model of actuator systems is decomposed into smaller subsystems that incorporate the first principle equation of actuator motion dynamics and interactive energy conversion equations. This decomposition operates under the assumption that the comprehensive model of the servo-driven actuator system and energy conversion, uncertainties, load disturbances, and their bounds are unknown. Then, the GRC employs subsystem-based adaptive control strategies for each state-variant subsystem separately. Despite control input constraints and the unknown interactive system model, the GRC-applied actuator mechanism ensures uniform exponential stability and robustness in tracking desired motions. It features straightforward implementation, experimentally evaluated by applying it to two industrial applications.
|
|
14:00-14:15, Paper ThB16.3 | |
Nonlinear Robust Position Tracking Control of Electro-Hydraulic Systems without Velocity Measurements |
|
Taskingollu, Sule | Ege University |
Bayrak, Alper | Izmir Institute of Technology |
Selim, Erman | Ege University |
Tatlicioglu, Enver | Ege University |
Zergeroglu, Erkan | Gebze Technical University |
Keywords: Robust control, Observers for nonlinear systems, Lyapunov methods
Abstract: Dynamical equations representing an electro-hydraulic system (EHS) is composed of hydraulic, mechanical, and electrical subsystems. As a consequence, monitoring and/or controlling an EHS system requires the measurement of multiple state variables, which in turn mandates the use of additional sensory equipment. In particular, in systems where multiple hydraulic actuators are required, such as robotic arms or excavators, each additional sensor would increase the overall manufacturing and maintenance cost. In this study, we propose a robust observer/controller formulation to eliminate the need of velocity measurements in EHSs. The proposed formulation solely rely on nominal model information. The stability of the closed--loop system is ensured via novel Lyapunov-based arguments and the global uniform ultimate boundedness of the position tracking error and the velocity observation error are established. The effectiveness of the method is evaluated through simulation studies.
|
|
14:15-14:30, Paper ThB16.4 | |
Using Fractional-Order Extremum Seeking Based MPPT for Photovoltaic Applications under Partial Shaded and Varying Condition |
|
Gao, Yan | School of Automation Engineering, University of Electronic Scien |
Yin, Chun | University of Electronic Science and Technology of China |
Huang, Xuegang | Aerodynamics Institute, China Aerodynamics Research and Developm |
Dadras, Sara | Company |
Tan, Xutong | University of Electronic Science and Technology of China |
Keywords: Robust control, Optimal control, Adaptive control
Abstract: Partial shading conditions (PSC) and dynamic environmental variations often result in multi-peak power-voltage characteristics in photovoltaic (PV) systems. Traditional maximum power point tracking (MPPT) techniques struggle to track the maximum power point (MPP) under these conditions. To address this challenge, this paper presents an enhanced extremum seeking control strategy designed to improve MPPT performance in complex environments. The proposed employs an adaptive search mechanism that responds quickly to changes, effectively avoiding local maxima and ensuring global power optimization. Simulation results demonstrate that the algorithm achieves faster convergence and higher power stability under shading and changing environmental conditions. This method provides an efficient solution for maximizing energy utilization in PV systems operating in challenging conditions, offering significant potential for practical applications.
|
|
14:30-14:45, Paper ThB16.5 | |
Finite-Time Input-To-State Stabilization of Discrete-Time Systems |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Keywords: Robust control, Time-varying systems, Output regulation
Abstract: We prove finite-time input-to-state stability estimates for discrete-time time-varying linear systems in closed loop with sample-data control laws. The upper bounds for the norms of the states in the estimates are suprema of the uncertainties over time intervals of constant finite lengths. We cover output feedback stabilization and input delays. We combine our results with a trajectory based approach, to prove novel global exponential input-to-state stability estimates for nonlinear systems with state delays. We illustrate our work using a dynamics containing an unknown nonlinearity and an unknown state delay.
|
|
14:45-15:00, Paper ThB16.6 | |
Control Synthesis Along Uncertain Trajectories Using Integral Quadratic Constraints |
|
Biertümpfel, Felix | Technische Universität Dresden |
Seiler, Peter | University of Michigan, Ann Arbor |
Pfifer, Harald | Technische Universität Dresden |
Keywords: Robust control, Time-varying systems, Uncertain systems
Abstract: The paper presents a novel approach to synthesize robust controllers for nonlinear systems along perturbed trajectories. The approach linearizes the system with respect to a reference trajectory. In contrast to existing methods rooted in robust linear time-varying synthesis, the approach accurately includes perturbations that drive the system away from the reference trajectory. Hence, the controller obtained in the linear framework provides a significantly more robust nonlinear performance. The calculation of the controller is derived from robust synthesis approaches rooted in the integral quadratic constraints framework. The feasibility of the approach is demonstrated on a pitch tracker design for a space launcher.
|
|
ThB17 |
Plaza Court 8 |
Autonomous Risk-Aware Perception, Planning, and Control |
Invited Session |
Chair: Motee, Nader | Lehigh University |
Co-Chair: Liu, Guangyi | Amazon Robotics |
Organizer: Liu, Guangyi | Amazon Robotics |
Organizer: Zavlanos, Michael M. | Duke University |
Organizer: Topcu, Ufuk | The University of Texas at Austin |
Organizer: Motee, Nader | Lehigh University |
|
13:30-13:45, Paper ThB17.1 | |
Risk-Sensitive Affine Control Synthesis for Stationary LTI Systems |
|
Hu, Yang | Harvard University |
Talebi, Shahriar | Harvard University |
Li, Na | Harvard University |
Keywords: Stochastic systems, Optimal control, Uncertain systems
Abstract: To address deviations from expected performance in stochastic systems, we propose a risk-sensitive control synthesis method to minimize certain risk measures over the limiting stationary distribution. Specifically, we extend Worst-case Conditional Value-at-Risk (W-CVaR) optimization for Linear Time-invariant (LTI) systems to handle nonzero-mean noise and affine controllers, using only the first and second moments of noise, which enhances robustness against model uncertainty. Highlighting the strong coupling between the linear and bias terms of the controller, we reformulate the synthesis problem as a Bilinear Matrix Inequality (BMI), and propose an alternating optimization algorithm with guaranteed convergence. Finally, we demonstrate the numerical performance of our approach in two representative settings, which shows that the proposed algorithm successfully synthesizes risk-sensitive controllers that outperform the naive LQR baseline.
|
|
13:45-14:00, Paper ThB17.2 | |
Risk-Aware MPPI for Stochastic Hybrid Systems (I) |
|
Parwana, Hardik | University of Michigan |
Black, Mitchell | MIT Lincoln Laboratory |
Hoxha, Bardh | Toyota Motor North America |
Okamoto, Hideki | Toyota |
Fainekos, Georgios | Toyota NA-R&D |
Prokhorov, Danil | Toyota Technical Center |
Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Hybrid systems, Autonomous robots
Abstract: Path Planning for stochastic hybrid systems presents a unique challenge of predicting distributions of future states subject to a state-dependent dynamics switching function. In this work, we propose a variant of Model Predictive Path Integral Control (MPPI) to plan kinodynamic paths for such systems. Monte Carlo may be inaccurate when few samples are chosen to predict future states under state-dependent disturbances and lacks computational efficiency when scaling to more particles. We employ recently proposed Unscented Transform-based methods to capture stochasticity in the states as well as the state-dependent switching surfaces. This is in contrast to previous works that perform switching based only on the mean of predicted states. We focus our motion planning application on the navigation of a mobile robot in the presence of dynamically moving agents whose responses are based on sensor-constrained attention zones. We evaluate our framework on a simulated mobile robot and show faster convergence to a goal without collisions when the robot exploits the hybrid human dynamics versus when it does not.
|
|
14:00-14:15, Paper ThB17.3 | |
Friedkin-Johnsen Model with Diminishing Competition (I) |
|
Ballotta, Luca | Delft University of Technology |
Vékássy, Áron | Harvard University |
Gil, Stephanie | Harvard University |
Yemini, Michal | Bar Ilan University |
Keywords: Distributed control, Cooperative control, Linear systems
Abstract: This letter studies the Friedkin-Johnsen (FJ) model with diminishing competition, or stubbornness. The original FJ model assumes fixed competition that is manifested through a constant weight that each agent gives to its initial opinion in addition to its contribution through a consensus dynamic. This letter investigates the effect of diminishing competition on the convergence point and speed of the FJ dynamics. We show that, if the competition is uniform across agents and vanishes asymptotically, the convergence point coincides with the nominal consensus reached with no competition. However, the diminishing competition slows down convergence according to its own rate of decay. We evaluate this phenomenon analytically and provide upper and lower bounds on the convergence rate. If competition is not uniform across clients, we show that the convergence point may not coincide with the nominal consensus point. Finally, we evaluate and validate our analytical insights numerically.
|
|
14:15-14:30, Paper ThB17.4 | |
Computationally Efficient Safe Control of Linear Systems under Severe Sensor Attacks (I) |
|
Tan, Xiao | California Institute of Technology |
Ong, Pio | California Institute of Technology |
Tabuada, Paulo | University of California at Los Angeles |
Ames, Aaron D. | California Institute of Technology |
Keywords: Fault tolerant systems, Linear systems, Constrained control
Abstract: Cyber-physical systems are prone to sensor attacks that can compromise safety. A common approach to synthesizing controllers robust to sensor attacks is secure state reconstruction (SSR)---but this is computationally expensive, hindering real-time control. In this paper, we take a safety-critical perspective on mitigating severe sensor attacks, leading to a computationally efficient solution. Namely, we design feedback controllers that ensure system safety by directly computing control actions from past input-output data. Instead of fully solving the SSR problem, we use conservative bounds on a control barrier function (CBF) condition, which we obtain by extending the recent eigendecomposition-based SSR approach to severe sensor attack settings. Additionally, we present an extended approach that solves a smaller-scale subproblem of the SSR problem, taking on some computational burden to mitigate the conservatism in the main approach. Numerical comparisons confirm that the traditional SSR approaches suffer from combinatorial issues, while our approach achieves safety guarantees with greater computational efficiency.
|
|
14:30-14:45, Paper ThB17.5 | |
Ergodic-Risk Constrained Policy Optimization: The Linear Quadratic Case |
|
Talebi, Shahriar | Harvard University |
Li, Na | Harvard University |
Keywords: Stochastic optimal control, Iterative learning control, Reinforcement learning
Abstract: Risk-sensitive control balances performance with resilience to unlikely events in uncertain systems. This paper introduces ergodic-risk criteria, which capture long-term cumulative risks through probabilistic limit theorems. By ensuring the dynamics exhibit strong ergodicity, we demonstrate that the time-correlated terms in these limiting criteria converge even with potentially heavy-tailed process noises as long as the noise has a finite fourth moment. Building upon this, we proposed the ergodic-risk constrained policy optimization which incorporates an ergodic-risk constraint to the classical Linear Quadratic Regulation (LQR) framework. We then propose a primal-dual policy optimization method that optimizes the average performance while satisfying the ergodic-risk constraints. Numerical results demonstrate that the new risk-constrained LQR not only optimizes average performance but also limits the asymptotic variance associated with the ergodic-risk criterion, making the closed-loop system more robust against sporadic large fluctuations in process noise.
|
|
14:45-15:00, Paper ThB17.6 | |
Safe Interactive Motion Planning by Differentiable Optimal Control and Online Preference Learning (I) |
|
Chavez Armijos, Andres | Boston University |
Berntorp, Karl | Walmart Advanced Systems Robotics |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Autonomous systems, Constrained control, Learning
Abstract: We present an interactive motion planner that integrates online learning of human driver preferences with parametric control barrier functions. Using stochastic models with Gaussian disturbances to capture human-driven vehicle behavior uncertainty, we update parameters in real-time parameter by Kalman filtering while ensuring safety by control barrier functions. A case study on highway lane-changing tasks demonstrates improved traffic flow, reduced disruptions, and lighter actuation effort compared to non-adaptive algorithms.
|
|
ThB18 |
Director's Row E |
Estimation and Control of Distributed Parameter Systems III |
Invited Session |
Chair: Hu, Weiwei | University of Georgia |
Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Hu, Weiwei | University of Georgia |
|
13:30-13:45, Paper ThB18.1 | |
A New Semi-Discretization of the Fully Clamped Euler-Bernoulli Beam Preserving Boundary Observability Uniformly (I) |
|
Aydin, Ahmet Kaan | University of Maryland, Baltimore County |
Haider, Md Zulfiqur | Iowa State University |
Ozer, Ahmet Ozkan | Western Kentucky University |
Keywords: Model/Controller reduction, Estimation, Distributed parameter systems
Abstract: This paper extends a Finite Difference model reduction method to the Euler-Bernoulli beam equation with fully clamped boundary conditions. The corresponding partial differential equation (PDE) is exactly observable in the energy space with a single boundary observer in arbitrarily short observation times. However, standard Finite Difference spatial discretization fails to achieve uniform exact observability as the mesh parameter approaches zero, with minimal observation time potentially depending on the filtering parameter. To address this, we propose a Finite Difference algorithm incorporating an averaging operator and discrete multipliers, leveraging Haraux’s theorem on the spectral gap to ensure uniform observability. This approach eliminates the need for artificial viscosity or Fourier filtering. Our method achieves uniform observability for arbitrarily small times with dual observers—the tip moment and average tip velocity—mirroring results from mixed Finite Elements applied to the wave equation with homogeneous Dirichlet boundary conditions, where dual controllers converge to the single controller of the PDE model [Castro, Micu—Numerische Mathematik'06]. Our reduced model is applicable to more complex systems involving Euler-Bernoulli beam equations.
|
|
13:45-14:00, Paper ThB18.2 | |
Sensor Distribution Partitioning and Filter Design for Parabolic PDEs Using Modified Centroidal Voronoi Tessellations (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems, Kalman filtering
Abstract: This paper utilizes computational geometry methods to partition the sensor spatial distribution in order to arrive at realistic sensor distributions. An ideal sensor representing a sensor distribution with enhanced observability and filter performance properties is approximated by a network of sensor distributions that describe available sensing devices. Such approximate sensor distributions take the form of spatial delta functions representing realistic sensing devices. The approximation is made feasible via computational geometry methods using a modification of the Centroidal Voronoi Tessellations (CVT). This modified version of CVT ensures that the ideal sensor distribution is partitioned into cells of equal areas and a single pointwise sensor is placed in each cell. Once the ideal sensor distribution is approximated, an optimal filter is subsequently designed for the original process resulting in a state estimator using realistic sensing devices. Extensive numerical studies on a parabolic PDE examine the various aspects of the decomposition using a single and multiple pointwise sensors.
|
|
14:00-14:15, Paper ThB18.3 | |
Boundary Stabilization of a Bending and Twisting Beam by Linear Quadratic Regulation (I) |
|
Krener, Arthur J | Naval Postgraduate School |
Keywords: Distributed parameter systems, Flexible structures, Numerical algorithms
Abstract: We use Linear Quadratic Regulation (LQR) to stabilize a bending and twisting beam by boundary control . The thin beam is rectangular and of moderate to high aspect ratio. This is the first step in using boundary actuation to stabilize a fluttering wing. The next step will be to add a model of the aerodynamic forces and moments that result ffom the motions of the wing. We leave this to the future. We find a full state feedback that atabilizes the bending and torsion of the beam using two point actuators located at the root of the beam. One actuator delivers a bending moment to the beam and the other delivers a twisting torque to the beam. The model is the Euler Bernouli beam equation inertially coupled to the wave equation. We express the optimal cost as a Fredholm quadratic form. Using integration by parts and completing the square we derive ta Riccati partial differential equation (RPDE) that the kernel of the optimal cost weakly satisfies. This RPDE is elliptic with a quadratic nonlinearity. We seek a continuous, weak solution. The continuity of the solution is important because it confirms that the optimal linear feedback has moved all the open loop eigenvalues off the imaginary axis and into the open left half plane. We use Fourier methods involving two families of eigenfunctions, the family of eigenfunctions of the second order spatial partial differential operator subject to Neumann boundary conditions and the family of eigenfunctions of the fourth order spatial partial differential operator subject to modified cantilever boundary conditions. In this way we convert the RPDE into an infinite dimensional Algebraic Riccati Equation (ARE). The ARE has four coupled blocks corresponding to the two families of eigenfunctions in the first or second position. We close with an example where the infinite dimensional ARE is approximated by a sixteen dimensional ARE.
|
|
14:15-14:30, Paper ThB18.4 | |
Consensus of Hyperbolic Multi-Agent Systems under Markov Switching Topologies (I) |
|
Cao, Lei | Beijing University of Technology |
Zhan, Jingyuan | Beijing University of Technology |
Zhang, Liguo | Beijing University of Technology |
Keywords: Distributed parameter systems, Cooperative control, Traffic control
Abstract: This paper investigates the consensus problem of hyperbolic multi-agent systems (MASs) under Markov switching topologies. Firstly, we propose a boundary feedback consensus protocol for the hyperbolic MASs of conservation laws, in which the communication topology of the agents is subject to a Markov chain. Secondly, by employing the Lyapunov approach, we provide the consensus analysis under Markov switching topologies, obtaining sufficient conditions w.r.t. the boundary control matrices, Laplacian matrices and generator of Markov process for ensuring the exponential mean-square consensus. To simplify the inequality conditions, we further combine spectral decomposition techniques with the Lyapunov approach to establish sufficient conditions w.r.t. Laplacian eigenvalues, which are more tractable. Finally, we apply the proposed boundary consensus protocol to synchronize a multi-lane road traffic flow system under Markov switching topologies, and present numerical simulation results to illustrate the effectiveness of the boundary consensus protocol.
|
|
14:30-14:45, Paper ThB18.5 | |
Closed-Form Adaptive Tracking Control of Heat Equations Aided by Fourier Regularization and Bi-Orthogonal Series |
|
Ma, Tong | Northeastern University |
Zhu, Xuwen | Northeastern University |
Keywords: Distributed parameter systems, Adaptive control, Numerical algorithms
Abstract: This paper proposes a closed-form adaptive tracking control approach for linear heat equations with unknown parameters to achieve full temperature profile tracking by leveraging Fourier regularization and bi-orthogonal series. A state predictor which copies the plant with unknown parameters replaced by their estimates is built and an adaptive law is designed to estimate the unknown parameters. The state predictor is decomposed into two subsystems for tracking control synthesis: the first subsystem involves terms from the original heat equation, while the second subsystem is simpler and can be reformulated as a standard heat equation. Specifically, the first subsystem is regarded as an unforced PDE whose terminal states always follow the desired temperature profile such that its initial condition can be calculated by solving the backward heat equation at every time step. To address the blow-up issue in backward calculation, a Fourier regularization scheme is explored to cut off the higher-order Fourier modes and an appropriate tradeoff between approximation accuracy and robustness is achieved. Given the solutions from the first subsystem, the initial condition for the second subsystem can be subsequently calculated. We propose a numerical algorithm to calculate a set of bi-orthogonal series offline and employ them to compute the boundary control function that drives the second subsystem to zero at every time step. Combining these two subsystems, it guarantees that the overall system follows the desired temperature profile. We demonstrate that the proposed closed-form adaptive tracking control algorithm achieves full temperature profile tracking with <2% error.
|
|
14:45-15:00, Paper ThB18.6 | |
Output Regulation for Transport-Reaction Three-Dimensional Hyperbolic PDE in Cylindrical Coordinates |
|
Akbarnezhad, Mahdis | University of Alberta |
Ozorio Cassol, Guilherme | University of Alberta |
Dubljevic, Stevan | University of Alberta |
Koch, Charles Robert | University of Alberta |
Keywords: Distributed parameter systems, Linear systems, Chemical process control
Abstract: This work addresses the design of an output regulator for a three-dimensional transport-reaction model in cylindrical coordinates described by a first-order hyperbolic partial differential equation. By utilizing the Laplace transform in space and time a closed-form semigroup operator is derived to capture the system’s evolution in time, and the Lyapunov equation is solved to address the stability analysis. An output regulator is designed to track periodic signals generated by a finite-dimensional exosystem. This is achieved by solving the continuous-time Sylvester output regulation equation. Numerical simulations demonstrate the controller’s efficiency in properly tracking the desired signal.
|
|
ThB19 |
Director's Row H |
Contrasting and Unifying Process and Mechatronic Perspectives on PID
Control |
Tutorial Session |
Chair: Abramovitch, Daniel Y. | Agilent Technologies |
Organizer: Abramovitch, Daniel Y. | Agilent Technologies |
|
13:30-14:00, Paper ThB19.1 | |
Different Perceptions of PID Control in the Mechatronic and Process Control Worlds (I) |
|
Abramovitch, Daniel Y. | Agilent Technologies |
Keywords: PID control, Mechatronics, Process Control
Abstract: It is not uncommon for graduate students on the mechatronics side of the control world to treat the Proportional plus Integral plus Derivative (PID) controller with a certain amount of disdain. This is not surprising since most control texts from this end of the control world treat PIDs as simple, basic structures, to be quickly replaced by more advanced methods. To that end, these texts devote only a handful of pages to the subject. It seems that – at least in the mechatronics world – PIDs are considered too simple for much interest in academia while practicing engineers do not seem to care why they were working. This is a far cry from the treatment of PIDs in the chemical and bio-process control worlds (CPC and BPC, respectively). At this end of the control spectrum, PID controllers are studied in far more depth obtaining entire books or book series. Despite this volumetric expansion of material, it seems that in the latter worlds, many of the issues and concerns one sees in the mechatronic world are treated as obscure corner cases. Depending upon the teaching text, issues of sampling and digital representation may have been completely omitted. There were other surprises. While PIDs were almost universal and standard, they were almost never unified or standardized. Furthermore, what seemed to limit performance was not the structure of the controller itself, but the lack of accurate system/process models based on repeated physical system measurements. However, the mechatronic and process PID goals and foibles were not that different once one considered the different system, time constant, and measurement constraints. We will discuss these issues with the goal of getting a more unified view of PIDs across our application domains. We will provide a handful of common PID forms and show how they are related, so that we can approach any PID structure with the same analytical approach. We will finally look forward to how PIDs can be used, not only as a fundamental teaching tool for explaining control outside of our research circles, but as a critical component for advanced control methods.
|
|
14:00-14:30, Paper ThB19.2 | |
More Unified View of Anti-Windup Methods (I) |
|
Abramovitch, Daniel Y. | Agilent Technologies |
Keywords: PID control, Mechatronics, Process Control
Abstract: A key feature of most Proportional-Integral-Derivative (PID) controllers is the need to prevent integrator windup in the event of control signal saturation. Providing a means of anti-windup can be viewed as a main motivator for having the integrator block separated out from the rest of the controller. The PID literature contains several popular methods for implementing anti-windup, including integrator reset, back calculation, and integrator clamping (also known as conditional integration). The effectiveness of these methods depends upon when the saturation occurs and how the PID is implemented (in continuous or discrete time). Some methods make sense in either domain, while some are only feasible in a discrete-time implementation. This talk will discuss the different methods in the context of which are most popular in the process control world versus which are used in the mechatronic control world.
|
|
14:30-15:00, Paper ThB19.3 | |
More Unified View of Loop Shaping for PIDs (I) |
|
Abramovitch, Daniel Y. | Agilent Technologies |
Keywords: PID control, Mechatronics, Process Control
Abstract: Loop shaping typically refers to using the frequency response function (FRF) of a system to guide the compensation so that the overall loop (open or closed) takes on a desired shape. One simple loop shaping metric is to make the open-loop response resemble that of an integrator to as high a frequency for which one can maintain phase margin. Not only does this show up across multiple applications, but its simplicity makes it easier to explain and transfer into industrial products. Furthermore, if the loop gain crossover is chosen to guarantee sufficient gain margin, typically around 60◦ or more, then the closed-loop response resembles a first-order low-pass filter (LPF) with minimal peaking. While this is a popular way to work in the mechatronic control world, frequency response methods are far less prevalent in the process control world. However, it can be shown that Internal Model Control (IMC) is algebraically performing loop shaping. We will compare and unify these different views of loop shaping for PID controllers.
|
|
ThB20 |
Director's Row I |
Nonlinear Estimation and Filtering |
Regular Session |
Chair: Ebeigbe, Donald | Pennsylvania State University |
Co-Chair: Spall, James C. | Johns Hopkins Univ |
|
13:30-13:45, Paper ThB20.1 | |
Stochastic Stability of Kalman-Type Nonlinear Filters |
|
Wei, Shihong | Johns Hopkins University |
Spall, James C. | Johns Hopkins Univ |
Keywords: Kalman filtering, Stability of nonlinear systems, Filtering
Abstract: We analyze the stochastic stability of a class of generic Kalman-type nonlinear filtering algorithms with the aim of providing practically verifiable conditions. We investigate conditions under which the Kalman gain matrix and the approximate error covariance matrix are bounded in nonlinear systems. By stochastic Lyapunov stability theory, we show that the estimation error for the nonlinear filter is bounded exponentially in mean square, with its norm bounded in probability. The analysis in this work applies to popular nonlinear Kalman filter extensions such as the extended Kalman filter (EKF) and the constant-gain Kalman filter (CGKF).
|
|
13:45-14:00, Paper ThB20.2 | |
Nonlinear Kalman Filtering in the Absence of Direct Functional Relationships between Measurement and State |
|
Alsaggaf, Abdulrahman U | King Abdulaziz University, Penn State University |
Saberi, Maryam | Penn State University |
Berry, Tyrus | George Mason University |
Ebeigbe, Donald | Pennsylvania State University |
Keywords: Estimation, Filtering, Kalman filtering
Abstract: This letter introduces a Kalman Filter framework for systems with process noise and measurements characterized by state-dependent, nonlinear conditional means and covariances. Estimating such general nonlinear models is challenging because traditional methods, such as the Extended Kalman Filter, linearize only functions – not noise – and require state-independent covariances. These limitations often necessitate Bayesian approaches that rely on specific distribution assumptions. To address these challenges, we propose a framework that employs a recursive least squares method that relies solely on conditional means and covariances, eliminating the need for explicit probability distributions. By applying first-order linearizations and incorporating targeted modifications to manage state dependence, the filter simplifies implementation, reduces computational demands, and provides a practical solution for systems that deviate from the assumptions underlying traditional Kalman filters. Simulation results on a compartmental model demonstrate performance comparable to sequential Monte Carlo methods while significantly lowering computational costs, effectively addressing real-world challenges of scalability and precision.
|
|
14:00-14:15, Paper ThB20.3 | |
A Robust and Global Hybrid Complementary Filter on SO(3) Using Morse Functions on RP3 |
|
Jirwankar, Piyush P. | University of California Santa Cruz |
Montgomery, Richard | University of California, Santa Cruz |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Estimation, Hybrid systems, Algebraic/geometric methods
Abstract: This paper addresses the problem of global attitude filtering on the special orthogonal group SO(3). A hysteresis-based hybrid switching strategy is used to switch between two filters that operate in different regions of SO(3). The first filter is the passive complementary filter, whereas the second filter is designed using an appropriately chosen Morse function. To this end, a novel approach to design Morse functions on SO(3) is proposed. The proposed hybrid filter is shown to be input-to-state stable with respect to measurement noise. Simulations validate the stability properties of the hybrid filter.
|
|
14:15-14:30, Paper ThB20.4 | |
A New Type of Nonlinear Disturbance Rejection |
|
Kuang, Simon | University of California, Davis |
Lin, Xinfan | University of California, Davis |
Keywords: Stability of nonlinear systems, Robust control, Lyapunov methods
Abstract: Asymptotic disturbance rejection (equivalently tracking) for nonlinear systems has been studied only in qualitative terms (the state is asymptotically stable under bounded disturbances). We show how to prove quantitative performance guarantees for the nonlinear servomechanism problem. Our technique originates by applying a gain inequalities point of view to an emph{ad fontes} reexamination of the linear problem: what is the nonlinear equivalent of a sensitivity transfer function with a zero at the origin? We answer: a nonlinear input-output system is high-pass if its output is stable with respect to the emph{derivative} of the input. We first show that definition generalizes high-pass resistor-capacitor circuit analysis to accommodate nonlinear resistors. We then show that this definition generalizes the steady-state disturbance rejection property of integral feedback controllers for linear systems. The theoretical payoff is that low-frequency disturbance rejection is captured by a quantitative, non-asymptotic output cost bound. Finally, we raise theoretical questions about compositionality of nonlinear operators.
|
|
14:30-14:45, Paper ThB20.5 | |
Wasserstein Regularity of Nonlinear Filters As Belief-MDPs, and Implications on Ergodicity, Optimality and Learning for POMDPs |
|
Demirci, Yunus emre | Queen's University |
Kara, Ali Devran | Florida State University |
Yuksel, Serdar | Queen's University |
Keywords: Markov processes, Stability of nonlinear systems, Reinforcement learning
Abstract: We obtain conditions for Wasserstein regularity of non-linear filters, which serve as reduced MDP (Markov Decision Process) models for Partially Observable Markov Decision Process (POMDPs). In particular, we obtain general and unified conditions which recover several recent studies. These conditions lead to (i) unique ergodicity and (ii) geometric ergodicity of control-free non-linear filter kernels, as well as (iii) existence of optimal solutions and (iv) their rigorous approximations and Q-learning for discounted cost as well as average cost optimal stochastic control problems. Specifically, we identify complementary conditions under which finite-window policies remain near-optimal and establish refined error bounds. While previous work assumed transition kernels to be continuous in total variation and Wasserstein regularity of measurement channels with moduli of continuity, we provide a general conditions which both recovers the aforementioned and also shows that Wasserstein regularity of the transition kernel and total variation continuity of the measurement channels with moduli of continuity is sufficient for existence, approximations and robustness results involving POMDPs.
|
|
ThB21 |
Director's Row J |
AI Engineering for Safety-Critical Control Systems: An Aerospace
Perspective |
Tutorial Session |
Chair: Inalhan, Gokhan | Sloane Institute |
Co-Chair: Atkins, Ella | Virginia Tech |
Organizer: Durak, Umut | German Aerospace Center (DLR) |
Organizer: Zamira, Daw | University of Stuttgart |
Organizer: Topcu, Ufuk | The University of Texas at Austin |
Organizer: Atkins, Ella | Virginia Tech |
Organizer: Cofer, Darren | Rockwell Collins |
Organizer: Uzun, Mevlüt | Istanbul Technical University |
Organizer: Inalhan, Gokhan | Sloane Institute |
Organizer: Kosmidis, Leonidas | Barcelona Supercomputing Center (BSC) |
Organizer: Paunicka, James | Boeing Phantom Works |
Organizer: Pham, Trung T, | FAA |
|
13:30-15:00, Paper ThB21.1 | |
AI Engineering for Safety-Critical Control Systems: An Aerospace Perspective (I) |
|
Durak, Umut | German Aerospace Center (DLR) |
Zamira, Daw | University of Stuttgart |
Topcu, Ufuk | The University of Texas at Austin |
Atkins, Ella | Virginia Tech |
Cofer, Darren | Rockwell Collins |
Uzun, Mevlüt | Istanbul Technical University |
Inalhan, Gokhan | Sloane Institute |
Kosmidis, Leonidas | Barcelona Supercomputing Center (BSC) |
Paunicka, James | Boeing Phantom Works |
Pham, Trung T, | FAA |
Keywords: Neural networks, Machine learning, Aerospace
Abstract: This tutorial highlights the engineering aspects of developing AI-based control systems. It renders the whole systems engineering spectrum from safety assessment to requirements engineering, from design to implementation, from verification to deployment and further more certification and operation of the aerospace products that include control systems that utilize AI techniques. The position paper not only provides a examples of AI-based safety-critical control systems in aerospace, it also shares the opinions of the authors about the important techniques that are being developed or to be developed at each above mentioned activities of systems engineering.
|
|
ThC01 |
Plaza AB |
Data-Driven Control III |
Regular Session |
Chair: Ozay, Necmiye | Univ. of Michigan |
Co-Chair: Ornik, Melkior | University of Illinois Urbana-Champaign |
|
15:30-15:45, Paper ThC01.1 | |
Data-Driven Composite Nonlinear Feedback Control for Semi-Global Output Regulation of Unknown Linear Systems with Input Saturation |
|
Cai, Hanwen | Xiamen University |
Lan, Weiyao | Xiamen University |
Yu, Xiao | Xiamen University |
Keywords: Output regulation, Data driven control, Stability of nonlinear systems
Abstract: This paper addresses the semi-global output regulation problem for continuous-time linear systems with input saturation and unknown dynamics. First, we employ a low-gain technique to design a state-feedback linear control law such that the control input operates within the linear region of the actuator. Then, taking it as the linear part, we construct a composite nonlinear feedback (CNF) control law, consisting of both linear and nonlinear parts, to improve the transient performance of the closed-loop system. Without requiring prior knowledge of the system dynamics or an initial stabilizing control policy, we propose a novel adaptive dynamic programming (ADP) learning algorithm. This algorithm learns both the linear part and the nonlinear part of the CNF control law using the same set of data. In addition, the algorithm uses single-layer filters, eliminating the need for integral operations during the learning process. Finally, the effectiveness of the proposed algorithm is demonstrated by an illustrative example.
|
|
15:45-16:00, Paper ThC01.2 | |
Noise Sensitivity of Direct Data-Driven Linear Quadratic Regulator by Semidefinite Programming |
|
Zeng, Xiong | University of Michigan, Ann Arbor |
Bako, Laurent | Ecole Centrale De Lyon |
Ozay, Necmiye | Univ. of Michigan |
Keywords: Statistical learning, Behavioural systems, Identification for control
Abstract: In this paper, we study the noise sensitivity of the semidefinite program (SDP) used in the direct data-driven infinite horizon linear quadratic regulator (LQR) problem for discrete-time linear time-invariant systems. While this SDP is shown to find the true LQR controller in the noise-free setting, we show that it leads to a trivial solution when data is corrupted by noise, even when the noise is arbitrarily small. Hence, a “certainty equivalence” approach that uses the original SDP with noisy data is not appropriate.
|
|
16:00-16:15, Paper ThC01.3 | |
Sum-Of-Squares Data-Driven Robustly Stabilizing and Contracting Controller Synthesis for Polynomial Nonlinear Systems |
|
El-Kebir, Hamza | University of Illinois at Urbana-Champaign |
Ornik, Melkior | University of Illinois Urbana-Champaign |
Keywords: Robust adaptive control, Stability of nonlinear systems, Nonlinear systems identification
Abstract: This work presents a computationally efficient approach to data-driven robust contracting controller synthesis for polynomial control-affine systems based on a sum-of-squares program. In particular, we consider the case in which a system alternates between periods of high-quality sensor data and low-quality sensor data. In the high-quality sensor data regime, we focus on robust system identification based on the data informativity framework. In low-quality sensor data regimes we employ a robustly contracting controller that is synthesized online by solving a sum-of-squares program based on data acquired in the high-quality regime, so as to limit state deviation until high-quality data is available. This approach is motivated by real-life control applications in which systems experience periodic data blackouts or occlusion, such as autonomous vehicles undergoing loss of GPS signal or solar glare in machine vision systems. We apply our approach to a planar unmanned aerial vehicle model subject to an unknown wind field, demonstrating its uses for verifiably tight control on trajectory deviation.
|
|
16:15-16:30, Paper ThC01.4 | |
Stochastic Data-Driven Predictive Control: Chance-Constraint Satisfaction with Identified Multi-Step Predictors |
|
Balim, Haldun | ETH Zurich |
Carron, Andrea | ETH |
Zeilinger, Melanie N. | ETH Zurich |
Köhler, Johannes | ETH Zurich |
Keywords: Predictive control for linear systems, Data driven control, Identification for control
Abstract: We propose a novel data-driven stochastic model predictive control framework for uncertain linear systems with noisy output measurements. Our approach leverages multi-step predictors to efficiently propagate uncertainty, ensuring chance constraint satisfaction. In particular, we present a strategy to identify multi-step predictors and quantify the associated uncertainty using a surrogate (data-driven) state space model. Then, we utilize the derived distribution to formulate a constraint tightening that ensures chance constraint satisfaction despite the parametric uncertainty. A numerical example highlights the reduced conservatism of handling parametric uncertainty in the proposed method compared to state-of-the-art solutions.
|
|
16:30-16:45, Paper ThC01.5 | |
Kernelized Offset-Free Data-Driven Predictive Control for Nonlinear Systems |
|
de Jong, Thomas O. | Eindhoven University of Technology |
Lazar, Mircea | Eindhoven University of Technology |
Keywords: Predictive control for nonlinear systems, Data driven control, Machine learning
Abstract: This paper presents a kernelized offset-free data-driven predictive control scheme for nonlinear systems. Traditional model-based and data-driven predictive controllers often struggle with inaccurate predictors or persistent disturbances, especially in the case of nonlinear dynamics, leading to tracking offsets and stability issues. To overcome these limitations, we employ kernel methods to parameterize the nonlinear terms of a velocity model, preserving its structure and efficiently learning unknown parameters through a least squares approach. This results in a offset-free data-driven predictive control scheme formulated as a nonlinear program, but solvable via sequential quadratic programming. We provide a framework for analyzing recursive feasibility and stability of the developed method and we demonstrate its effectiveness through simulations on a nonlinear benchmark example.
|
|
16:45-17:00, Paper ThC01.6 | |
Data-Driven Approach to the Design of Fault Isolation Filter |
|
Gomez Munoz, Daniel | Technical University of Kaiserslautern |
Zhang, Ping | University of Kaiserslautern-Landau |
Keywords: Fault diagnosis, Identification, Subspace methods
Abstract: This paper proposes a data-driven approach to design a fault isolation filter to isolate actuator faults in discrete linear time-invariant systems. It is shown that a fault isolation filter can be obtained directly from the system input and output data without any knowledge about the system model. The subspace identification technique is applied to obtain the fault detectability indices and the fault detectability matrix directly from the data. The design procedure involves a singular value decomposition and an LU factorization with partial pivoting. The identified fault isolation filter is able to isolate multiple faults with only a single filter, which significantly reduces the online computational efforts needed for fault isolation. A simulation example is given to illustrate the proposed datadriven approach.
|
|
ThC02 |
Plaza DE |
Model-Based Reinforcement Learning for High Dimensional Nonlinear Dynamical
Systems |
Tutorial Session |
Chair: Chakravorty, Suman | Texas A&M University |
Co-Chair: Goyal, Raman | Palo Alto Reserach Center, SRI International |
Organizer: Chakravorty, Suman | Texas A&M University |
Organizer: Goyal, Raman | Palo Alto Reserach Center, SRI International |
Organizer: Wang, Ran | Texas A&M University |
Organizer: Gul Mohamed, Mohamed Naveed | Texas A&M University |
Organizer: Sharma, Aayushman | Texas A&M University |
Organizer: Abhijeet, Fnu | Texas A&M University |
|
15:30-16:15, Paper ThC02.1 | |
The Search for Feedback in Reinforcement Learning (I) |
|
Wang, Ran | Texas A&M University |
Sharma, Aayushman | Texas A&M University |
Parunandi, Karthikeya Sharma | Texas A&M University |
Goyal, Raman | Palo Alto Reserach Center, SRI International |
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Reinforcement learning, Optimal control, Iterative learning control
Abstract: Reinforcement Learning for control of an unknown nonlinear dynamical system is equivalent to the search for an optimal feedback law utilizing only data from the simulations/ rollouts of the dynamical system. Most RL techniques search over a complex global nonlinear feedback parametrization, making them suffer from high training times and variance. Instead, we advocate a model-based reinforcement learning (MBRL) framework that searches over a local feedback representation consisting of an open-loop sequence and an associated optimal linear feedback law. We show that this approach results in highly efficient training, the answers obtained are globally optimum, repeatable with negligible variance, and hence reliable, and the resulting closed performance is superior to the state-of-the-art RL techniques that search for a global nonlinear feedback law. Finally, if we replan whenever required, akin to Model Predictive Control (MPC), which is feasible due to the fast and reliable local solution, it allows us to recover the optimal global feedback law.
|
|
16:15-16:30, Paper ThC02.2 | |
Model Based Reinforcement Learning for Partially Observed Nonlinear Systems (I) |
|
Goyal, Raman | Palo Alto Reserach Center, SRI International |
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Wang, Ran | Texas A&M University |
Sharma, Aayushman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Reinforcement learning, Nonlinear output feedback, Optimal control
Abstract: This talk develops a model-based Reinforcement Learning approach to the closed-loop control of nonlinear dynamical systems with a partial nonlinear observation model. We propose an “information-state” based approach to rigorously transform the partially observed problem into a fully observed problem where the information-state consists of the past several observations and control inputs. We further show the equivalence of the transformed and the initial partially observed optimal control problems and provide the conditions to solve for the deterministic optimal solution. We develop a data-based generalization of the iterative Linear Quadratic Regulator (ILQR) for the RL of partially-observed systems using a local linear time-varying model of the information-state dynamics approximated by an Autoregressive–moving-average (ARMA) model that is generated using only the input-output data. This open-loop trajectory optimization solution is then used to design a local feedback control law, and the composite law then provides an optimum solution to the partially observed feedback design problem. The efficacy of the developed method is shown by controlling complex high dimensional nonlinear dynamical systems in the presence of model and sensing uncertainty.
|
|
16:30-16:45, Paper ThC02.3 | |
An Optimal Solution to Infinite Horizon Nonlinear Control Problems (I) |
|
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Goyal, Raman | Palo Alto Reserach Center, SRI International |
Sharma, Aayushman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Optimal control, Stability of nonlinear systems, Predictive control for nonlinear systems
Abstract: In this talk, we consider the infinite horizon optimal control problem for nonlinear systems. Under the conditions of controllability of the linearized system around the origin, and nonlinear controllability of the system to a terminal set containing the origin, we establish an approximate regularized solution approach consisting of a “finite free final time” optimal transfer problem to the terminal set, and an infinite horizon linear regulation problem within the terminal set, that is shown to render the origin globally asymptotically stable. Further, we show that the approximations converge to the true optimal cost function as the size of the terminal set decreases to zero. The talk also discusses the extension to nonholonomic systems, which are not linearly controllable and show similar guarantees. The proposed data-based approach is empirically evaluated on the robotics problems to show that the finite time transfer is far shorter than the effective horizon required to solve the infinite horizon problem without the proposed regularization. We also do comparisons of our approach with nonlinear Model Predictive Control.
|
|
16:45-17:00, Paper ThC02.4 | |
A Reduced Order MBRL Approach for the Control of Nonlinear Partial Differential Equations (I) |
|
Sharma, Aayushman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Keywords: Large-scale systems, Reduced order modeling, Optimal control
Abstract: This talk will introduce a reduced order model based reinforcement learning (MBRL) approach, utilizing the Iterative Linear Quadratic Regulator (ILQR) algorithm for the optimal control of nonlinear partial differential equations (PDEs). The approach proposes a novel modification of the ILQR technique: it uses the Method of Snapshots to identify a reduced order Linear Time Varying (LTV) approximation of the nonlinear PDE dynamics around a current estimate of the optimal trajectory, utilizes the identified LTV model to solve a time-varying reduced order LQR problem to obtain an improved estimate of the optimal trajectory along with a new reduced basis, and iterates till convergence, in an ‘Iteratively-Reduce-then-Control (IRTC)’ manner. The proposed approach is tested on the viscous Burger’s equation and two phase-field models for microstructure evolution in materials, and the results show that there is a significant reduction in the computational burden over the standard ILQR approach, without significantly sacrificing performance.
|
|
ThC03 |
Plaza CF |
Multiagent Systems |
Regular Session |
Chair: Stockar, Stephanie | The Ohio State University |
Co-Chair: Wang, Yang | Shanghai Technology Unversity |
|
15:30-15:45, Paper ThC03.1 | |
A Novel Plug-And-Play Cooperative Disturbance Compensator for Heterogeneous Uncertain Linear Multi-Agent Systems |
|
Gong, Yizhou | ShanghaiTech University |
Wang, Yang | Shanghai Technology Unversity |
Keywords: Cooperative control, Robust adaptive control, Output regulation
Abstract: Cooperative output regulation (COR) for multi-agent systems (MAS) has garnered significant attention due to its broad applications. This paper offers a fresh perspective on the COR problem for a class of heterogeneous, uncertain, linear SISO MAS facing two major challenges simultaneously: (1) the agents are highly uncertain and heterogeneous, and (2) communication is restricted to a directed spanning tree with only output information exchanged among agents. We propose a novel plug-and-play cooperative feedforward disturbance compensator that requires little prior knowledge of follower agents' dynamics. Unlike traditional methods, our compensator is fully distributed, adaptive, and highly robust to agent heterogeneity. It eliminates the need for system identification and handles large uncertainties without relying on typical assumptions such as minimum phase, identical dimensionality, or uniform relative degree across agents. Moreover, it is designed for scalability, allowing the seamless addition or removal of agents without the need for controller redesign, provided the network maintains a spanning tree. Theoretical analysis and simulations demonstrate the compensator's effectiveness in solving the COR problem across various scenarios.
|
|
15:45-16:00, Paper ThC03.2 | |
Optimality Loss Minimization in Distributed Control: A Multi-Agent Partitioning Approach |
|
Blizard, Audrey | The Ohio State University |
Stockar, Stephanie | The Ohio State University |
Keywords: Distributed control, Agents-based systems, Large-scale systems
Abstract: This paper presents a method for finding the partition that minimizes the total cost at the Nash Equilibrium in a communication-based control of large-scale systems via an evolving tree search. By directly considering the control objectives, including the cost function and constraints, the proposed partitioning method finds the distributed control structure that minimizes the cost increase introduced when converting from a centralized control problem to a distributed one. The performance of the proposed partitioning method is compared to a state-coupling-based method for a set of 30 constrained linear quadratic regulator control problems, where it finds feasible solutions 23% more often and reduces the average total cost by 14.9%.
|
|
16:00-16:15, Paper ThC03.3 | |
Distributed Control for Heterogeneous Multi-Agent Systems in Higher-Order Voronoi Coverage |
|
Zhang, Hang | Zhejiang University |
Zheng, Ronghao | Zhejiang University, ZJU |
Zhang, Senlin | Zhejiang University |
Liu, Meiqin | Zhejiang University |
Keywords: Distributed control, Cooperative control, Autonomous robots
Abstract: This paper presents a distributed coverage control law for heterogeneous agents to achieve higher-order Voronoi coverage. Unlike most existing works, which assume that an event occurring at a certain location within the domain requires only a single homogeneous agent’s response, we are motivated by applications where multiple agents of different types are required to respond to the event. In this paper, we introduce a partitioning method designed for heterogeneous agents in higher-order Voronoi coverage. Additionally, we present various forms of coverage cost functions, each tailored to meet the diverse needs of different applications. Based on these, we develop a controller that enables heterogeneous multi-agent systems to achieve equilibrium in a distributed manner. The convergence of the control law is proved and the performance is evaluated through simulations.
|
|
16:15-16:30, Paper ThC03.4 | |
We Are Legion: High Probability Regret Bound in Adversarial Multiagent Online Learning |
|
Jaladi, Sri | Stanford |
Bistritz, Ilai | Tel Aviv University |
Keywords: Agents-based systems, Large-scale systems, Reinforcement learning
Abstract: We study a large-scale multiagent online learning problem where the number of agents N is significantly larger than the number of arms K. The agents face the same adversarial online learning problem with K arms over T rounds, where the adversary chooses the cost vectors boldsymbol{l}(1),ldots,boldsymbol{l}(T) before the game begins. Each round t, each agent n picks an arm a_n(t) and incurs a cost of l_{a_n(t)}(t). Then, at the end of the round, all agents observe the costs of all arms l_{1}(t),ldots,l_{K}(t). The exponential weights algorithm achieves an order-wise optimal textit{expected} regret of O(sqrt{T}) for each agent. However, the textit{variance} of the sum of regrets scales linearly with the number of agents, which is unacceptable for a large-scale multi-agent system. To mitigate this, we propose a simple fully distributed algorithm that achieves the same optimal expected sum of regrets but reduces the variance of the sum of regrets from O(N) to O(text{min}(N,K)) with no communication required between the agents.
|
|
16:30-16:45, Paper ThC03.5 | |
Optimization of Linear Multi-Agent Dynamical Systems Via Feedback Distributed Gradient Descent Methods |
|
Mehrnoosh, Amir | Universite Catholique De Louvain |
Bianchin, Gianluca | University of Louvain |
Keywords: Optimization algorithms, Distributed control, Networked control systems
Abstract: Feedback optimization is a control paradigm for optimizing dynamical systems at steady-state. Existing methods rely on centralized architectures, limiting scalability and privacy in large-scale systems. We propose a distributed feedback optimization approach inspired by the Distributed Gradient Descent method, where each agent updates its control variable using local gradients and average of neighbors. Under convexity and smoothness assumptions, we establish convergence to a critical optimization point, and under restricted strong convexity, we prove linear convergence to a neighborhood of the optimum, with its size dependent on the stepsize. Simulations corroborate the theoretical results.
|
|
16:45-17:00, Paper ThC03.6 | |
Orthogonal Modal Representation in Long-Term Risk Quantification for Dynamic Multi-Agent Systems |
|
Yasunaga, Ryoma | Keio University |
Nakahira, Yorie | Carnegie Mellon University |
Hori, Yutaka | Keio University |
Keywords: Large-scale systems, Stochastic systems, Autonomous systems
Abstract: Quantifying long-term risk in large-scale multi-agent systems is critical for ensuring safe operation. However, the high dimensionality of these systems and the rarity of risk events can make the required computations prohibitively expensive. To overcome this challenge, we introduce a graph-based representation and efficient risk quantification techniques tailored for stochastic multi-agent systems. A key technical innovation is a systematic approach to decompose the estimation problem of system-wide safety probabilities into smaller, lower-dimensional sub-systems with sub-safe sets. This decomposition leverages the graph Fourier basis of the agent interaction network, providing a natural and scalable representation. The safety probabilities for these sub-systems are derived as solutions to a set of low-dimensional partial differential equations (PDEs). The proposed decomposition enables existing risk quantification approaches but does so without an exponential increase in computational complexity with respect to the number of agents. The proposed PDE characterization allows physics-informed learning to be used to estimate long-term risk probability using short-term samples or without sufficient risk events.
|
|
ThC04 |
Governor's Sq. 15 |
Mechatronics II |
Invited Session |
Chair: Zuo, Shan | University of Connecticut |
Co-Chair: Han, Feng | New York Institute of Technology |
Organizer: Barton, Kira | University of Michigan, Ann Arbor |
Organizer: Su, Hao | North Carolina State University |
Organizer: Mazumdar, Yi | Georgia Institute of Technology |
Organizer: Vikas, Vishesh | University of Alabama |
Organizer: Xia, Fangzhou | The University of Texas at Austin |
Organizer: Zhang, Jun | University of Nevada Reno |
Organizer: He, Binghan | The University of Texas at San Antonio |
Organizer: Zhang, Qiang | The University of Alabama |
Organizer: Han, Feng | New York Institute of Technology |
Organizer: Zuo, Shan | University of Connecticut |
|
15:30-15:45, Paper ThC04.1 | |
End-Effector Position Estimation on Off-Road Vehicles Using IMUs (I) |
|
Daroudi, Sajjad | University of Minnesota |
Gust, Michael J | University of Minnesota |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Fluid power control, Mechanical systems/robotics, Sensor fusion
Abstract: Estimating the position of the bucket or tool on an agricultural/construction vehicle is becoming increasingly important to enable operator assistance such as automation of repetitive movements. Such end-effector position estimation is normally done through measurement of individual actuator’s movements inside kinematic linkage mechanisms that move the end-effectors. This paper develops an alternate inertial measurement unit (IMU) based end-effector position estimation system that offers significant advantages of low cost and easy installation. An IMU located on a rotating linkage in a mechanism is used to estimate the angular motion of the linkage. Key challenges arise from the fact that the accelerometer signals of the IMU experience significant disturbances from dynamic accelerations and from vehicle and terrain-induced vibrations. First, an adaptive feedforward algorithm is used to remove the influence of vibrations on the accelerometer signals. Then a nonlinear observer is utilized to combine accelerometer and gyroscope signals and reject the influence of vehicle accelerations. Experimental results are presented from a laboratory test rig and preliminary experimental results from a full-scale tracked skid steer loader vehicle. The results show that an accuracy better than 1 degree in linkage orientation estimation is achieved in the presence of vibration disturbances.
|
|
15:45-16:00, Paper ThC04.2 | |
Digital Implementation of Tracking and Damping Control Based on Hybrid Integrator-Gain System for a MEMS Force Sensor (I) |
|
Dadkhah, Diyako | University of Texas at Dallas |
Khodabakhshi, Erfan | University of Texas at Dallas |
Moheimani, S.O. Reza | University of Texas at Dallas |
Keywords: MEMs and Nano systems, Mechatronics, Control applications
Abstract: We propose a novel control approach for a lightly damped single-input, single-output microelectromechanical system (MEMS) force sensor. The dual-loop control scheme consists of an inner damping loop and an outer tracking loop. A damping controller improves the closed-loop bandwidth by damping the dominant mode of the force sensor, followed by a tracking controller in the outer loop for accurate reference tracking. A discrete-time Hybrid Integrator-Gain System (HIGS) is utilized for damping, and a discrete-time HIGS-based integrator is tuned for reference tracking. We report a substantial damping of pmb{24.52,mathrm{dB}} at the resonance. The effectiveness of the proposed control strategy is demonstrated experimentally through multiple tests on a single-axis MEMS force sensor used for dynamic force measurement. The force measurement error remains below pmb{18.5,%} for external forces up to a bandwidth of pmb{100,mathrm{Hz}}.
|
|
16:00-16:15, Paper ThC04.3 | |
Physics-Informed Machine Learning-Based Chattering Prediction in Milling Process (I) |
|
Huang, Yi | Rutgers University |
Han, Feng | New York Institute of Technology |
Zheng, Tianyuan | Rutgers University |
Hu, Liwen | Rutgers |
Yi, Jingang | Rutgers University |
Guo, Yuebin | Rutgers University |
Keywords: Manufacturing systems, Mechatronics, Machine learning
Abstract: Chattering is a self-excited vibration phenomenon that results in poor surface quality of the workpiece in machining process. Analytical prediction of chattering initiation requires exact knowledge of milling process dynamics. Machine learning (ML) classification tries to effectively incorporate measurement data for real-time chattering prediction and suppression control. This paper proposes a physics-informed ML-based approach by integrating a deep neural network model for off-line dynamic parameter identification and a long short-term memory model for online real-time chattering prediction. In the offline phase, we extract critical dynamics parameters that inform the subsequent online chattering prediction. Experiments are conducted to validate and demonstrate the chattering prediction design. The comparison with another ML-based chattering prediction method is also presented to confirm the superior performance and reliability of the proposed approach.
|
|
16:15-16:30, Paper ThC04.4 | |
Enhanced Modeling of Twisted String Actuators with Low-Torque Motors Accounting for Strings’ Friction and Opposing Torque (I) |
|
Konda, Revanth | Georgia Institute of Technology |
Zhang, Jun | University of Nevada Reno |
Keywords: Modeling
Abstract: Twisted string actuators (TSAs) have shown strong promise in emerging applications, such as soft robotics and assistive robotics. To construct compact and lightweight TSAs, it is often inevitable to use low-torque and low-speed motors. An accurate TSA model can facilitate the appropriate design of motor-string components and enable TSA’s reliable operation. However, existing models often neglect the twisted strings’ friction and the exerted opposing torque on the motor, which would result in significant discrepancies in predicting the dynamic behaviors of TSAs with low-torque and low-speed motors. This work presents an enhanced model to accurately capture TSA’s dynamics by accounting for the aforementioned phenomena. The theory of torsional closed-wrapped helical springs is used to capture the friction between the twisted strings. The total elastic potential energy of the strings considering string curvature is used to derive the total opposing torque exerted by the twisted strings. The proposed model is experimentally identified, validated, and compared with an existing TSA model to confirm its superior accuracy.
|
|
16:30-16:45, Paper ThC04.5 | |
Dynamic Modeling and Motion Control of a TCA-Actuated Robotic Arm with Elbow and Wrist Joints (I) |
|
Zhang, Yunsong | Peking University |
Zhang, Feitian | Peking University |
Keywords: Robotics, Modeling, Control applications
Abstract: Twisted and coiled actuators (TCAs) have found extensive application in robotic devices due to their inherent compliance, large force generation, and lightweight characteristics. These attributes endow the TCA-driven robotic arm with significant potential in human-robot interaction. This paper presents a TCA-actuated robotic arm system, where one module emulates the motion of the human upper arm with an elbow joint and the other mimics the wrist, resulting in a highly anthropomorphic structural design. For such a robotic arm system, this paper develops both a motion kinematics model and a Lagrangian dynamics model. Additionally, a nonlinear model predictive controller (NMPC) is designed for trajectory tracking while ensuring the safe TCA actuation of the dynamic system. Experiments are systematically conducted, the results of which demonstrate that the robotic arm system offers a wide range of motion, capable of tracking three-dimensional reference movements with relatively high accuracy.
|
|
16:45-17:00, Paper ThC04.6 | |
Optimal Deployment of FMCW Radar for Detection of Car Doors in Multiple Scenarios (I) |
|
Lei, Zike | University of Windsor |
Chen, Xi | Wuhan University of Science and Technology |
Chen, Xiang | University of Windsor |
Tan, Ying | University of Melbourne |
Keywords: Sensor networks, Optimal control, Automotive systems
Abstract: Frequency-modulated continuous wave (FMCW) radars are increasingly installed on car doors to enhance collision avoidance. Determining the optimal placement of these radars to effectively cover the movements of both doors, while accounting for their opening and closing dynamics, presents a significant challenge. This work focuses on developing a deployment strategy for FMCW radars that addresses three worst-case scenarios: both front and rear doors closed, one door open while the other remains closed, and vice versa. For each scenario, corresponding costs are defined to evaluate the radar’s coverage performance. These costs are incorporated into an integrated cost function that assesses overall coverage efficacy across multiple car door scenarios. The cost function is then maximized using the Luus-Jaakola algorithm to derive an optimized deployment solution. The effectiveness of this optimized deployment is validated through simulations, with comparisons to commonly used strategies, demonstrating the improvements achieved by the proposed approach.
|
|
ThC05 |
Governor's Sq. 9 |
Biomedical Systems |
Regular Session |
Chair: Romagnoli, Raffaele | Duquesne University |
Co-Chair: Kumar, Gautam | San Jose State University |
|
15:30-15:45, Paper ThC05.1 | |
Proportional-Integral Controller-Based Deep Brain Stimulation Strategy for Controlling Excitatory-Inhibitory Network Synchronization |
|
Olumuyiwa, Aanuoluwapo | San Jose State University |
Kumar, Gautam | San Jose State University |
Keywords: Biomedical, Emerging control applications
Abstract: Deep brain stimulation (DBS) has emerged as a potential therapy for disrupting pathological synchronous firing patterns of neurons and restoring healthy oscillations in many brain disorders when pharmacological interventions fail. However, existing DBS devices require extensive clinical interventions in tuning stimulation parameters over the treatment period. In our previous works, we developed a novel neurostimulation motif, referred to as ”Forced Temporal Spike-Time Stimulation” (FTSTS), which can reliably and robustly desynchronize the excessively synchronized excitatory-inhibitory (E- I) networks by harnessing E-to-I synaptic plasticity. However, our FTSTS protocol was an open loop and required us to manually titrate the FTSTS parameters, such as amplitude, frequency, and pulse width. In this work, we close the DBS loop by developing a proportional-integral (PI) controller-based FTSTS strategy to tune the stimulation amplitude. Using an E-I network model, we perform the spectral analysis of spiking data to determine the correlation between the network synchrony and the mean population firing rate of E and I neurons. We systematically investigate the stimulation parameter space to investigate the effects of amplitude and frequency on the mean firing rate of E and I neurons. We design a PI controller to tune the stimulation amplitude using the mean firing rate of I neurons as a feedback signal. We demonstrate the feasibility of our approach in controlling the neuronal synchronization in the E-I network consisting of 400 E and 100 I neurons through various case studies.
|
|
15:45-16:00, Paper ThC05.2 | |
Observer-Based Control for a Tumor Growth Model with Delayed Output Measurement |
|
Arezki, Hasni | University of Genova (Italy) University of Lorraine (France) |
Zemouche, Ali | CRAN UMR CNRS 7039 & Université De Lorraine |
Bagnerini, Patrizia | University of Genoa |
Keywords: Nonlinear output feedback, Delay systems, Biomedical
Abstract: This paper addresses the design of control laws for systems with delayed outputs. We first propose a new observer design technique that employs a dynamic extension method to transform a system with delayed nonlinear outputs into one with linear outputs and a delay-dependent integral term in the dynamic process. We then introduce a novel controller-based observer for nonlinear systems with delayed outputs, specifically tackling challenges related to sensor latency and communication delays. Our approach is applied to a tumor growth model, demonstrating its effectiveness in stabilizing the system.
|
|
16:00-16:15, Paper ThC05.3 | |
Expediting Human Motor Learning in High-Dimensional De-Novo Tasks Via Online Curriculum Design |
|
Kamboj, Ankur | Michigan State University |
Ranganathan, Rajiv | Michigan State University |
Tan, Xiaobo | Michigan State University |
Srivastava, Vaibhav | Michigan State University |
Keywords: Human-in-the-loop control, Predictive control for nonlinear systems
Abstract: While recent advancements in motor learning have emphasized the critical role of systematic task scheduling in enhancing task learning, the heuristic design of task schedules remains predominant. Random task scheduling can lead to sub-optimal motor learning, whereas performance-based scheduling might not be adequate for complex motor skill acquisition. This paper addresses these challenges by proposing a model-based approach for online skill estimation and individualized task scheduling in de-novo (novel) motor learning tasks. We introduce a framework utilizing a personalized human motor learning model and particle filter for skill state estimation, coupled with a stochastic nonlinear model predictive control (SNMPC) strategy to optimize curriculum design for a high-dimensional motor task. Simulation results show the effectiveness of our framework in estimating the latent skill state, and the efficacy of the framework in accelerating skill learning. Furthermore, a human subject study shows that the group with the SNMPC-based curriculum design exhibited expedited skill learning and improved task performance. Our contributions offer a pathway towards expedited motor learning across various novel tasks, with implications for enhancing rehabilitation and skill acquisition processes.
|
|
16:15-16:30, Paper ThC05.4 | |
Adaptive Ankle Torque Control for Bipedal Humanoid Walking on Surfaces with Unknown Horizontal and Vertical Motion |
|
Stewart, Jacob | University of Southern California |
Chang, I-Chia | Purdue University |
Gu, Yan | Purdue University |
Ioannou, Petros A. | Univ. of Southern California |
Keywords: Adaptive control, Robotics, Hybrid systems
Abstract: Achieving stable bipedal walking on surfaces with unknown motion remains a challenging control problem due to the hybrid, time-varying, partially unknown dynamics of the robot and the difficulty of accurate state and surface motion estimation. Surface motion imposes uncertainty on both system parameters and non-homogeneous disturbance in the walking robot dynamics. In this paper, we design an adaptive ankle torque controller to simultaneously address these two uncertainties and propose a step-length planner to minimize the required control torque. Typically, an adaptive controller is used for a continuous system. To apply adaptive control on a hybrid system such as a walking robot, an intermediate command profile is introduced to ensure a continuous error system. Simulations on a planar bipedal robot, along with comparisons against a baseline controller, demonstrate that the proposed approach effectively ensures stable walking and accurate tracking under unknown, time-varying disturbances.
|
|
16:30-16:45, Paper ThC05.5 | |
Control-Oriented Models Inform Synthetic Biology Strategies in CAR T Cell Immunotherapy |
|
Romagnoli, Raffaele | Duquesne University |
Keywords: Biological systems, Feedback linearization, Systems biology
Abstract: Chimeric antigen receptor (CAR) T cell therapy is revolutionizing the treatment of blood cancers. Mathematical models that can predict the effectiveness of immunotherapies such as CAR T are of increasing interest due to their ability to reduce the number of experiments performed and to guide the theoretical development of new therapeutic strategies. Following this rationale, we propose the use of control-oriented models to guide the augmentation of CAR T therapy with synthetic gene circuitry. Here we present an initial investigation where we adapt a previously developed CAR T model for control-oriented purposes. We model this enhancement of CAR T as an additional control input and we propose an analysis to show its effectiveness in terms of tumor clearance.
|
|
16:45-17:00, Paper ThC05.6 | |
Body Fluid Estimation During Standard Ultrafiltration in Chronic Kidney Disease |
|
Abohtyra, Rammah | The University of Texas Permian Basin |
Beg, Omar | The University of Texas Permian Basin |
Keywords: Estimation, Biomedical, Biological systems
Abstract: Background: Effective management of body fluid volumes and precise ultrafiltration (UF) prescription are critical challenges in treating Chronic Kidney Disease (CKD) patients undergoing hemodialysis (HD). Current fluid estimation techniques rely on fluid infusion or restricted UF protocols, which are difficult to implement consistently in daily clinical practice. Objective: This work aims to evaluate whether current blood concentration measurement techniques can identify fluid and absolute blood volumes during regular HD treatments with standard ultrafiltration (UF) profiles (constant rates). Methods: The proposed method is independent of any specific hematocrit sensor, UF rate, or volume infusion protocol. It utilizes modeling and prediction algorithms to quantify errors in fluid volume estimations. Results: The method was tested on model-generated data from two patients under constant UF profiles. Extracellular (plasma and interstitial) fluid and absolute blood volumes were accurately estimated. In one case, specific blood volume dropped from 65 mL/kg to 61 mL/kg, while in the other, it remained above the critical threshold of 65 mL/kg. Conclusion: This estimation algorithm can be easily integrated into existing HD machines, potentially improving treatment outcomes for CKD patients.
|
|
ThC06 |
Governor's Sq. 10 |
Optimal Control IV |
Regular Session |
Chair: Kia, Solmaz S. | University of California Irvine (UCI) |
Co-Chair: Liu, Jun | University of Waterloo |
|
15:30-15:45, Paper ThC06.1 | |
Hybrid Feedback for Three-Dimensional Convex Obstacle Avoidance |
|
Sawant, Mayur | Lakehead University |
Polushin, Ilia G. | Western University |
Tayebi, Abdelhamid | Lakehead University |
Keywords: Autonomous robots, Hybrid systems
Abstract: We propose a hybrid feedback control scheme for the autonomous robot navigation problem in three-dimensional environments with arbitrarily-shaped convex obstacles. The proposed hybrid control strategy, which consists in switching between the move-to-target mode and the obstacle-avoidance mode, guarantees global asymptotic stability of the target location in the obstacle-free workspace. We also provide a procedure for the implementation of the proposed hybrid controller in a priori unknown environments and validate its effectiveness through simulation results.
|
|
15:45-16:00, Paper ThC06.2 | |
Signal Temporal Logic Planning with Time-Varying Robustness |
|
Yuan, Yating | University of Waterloo |
Quartz, Thanin | University of Waterloo |
Liu, Jun | University of Waterloo |
Keywords: Optimization, Robotics, Control applications
Abstract: This letter aims to generate a continuous-time trajectory consisting of piecewise Bézier curves that satisfy signal temporal logic (STL) specifications with piecewise time-varying robustness. The time-varying robustness is less conservative than the real-valued robustness, which enables more effective tracking in practical applications. Specifically, the continuous-time trajectories account for dynamic feasibility, leading to smaller tracking errors and ensuring that the STL specifications can be met by the tracking trajectory. Comparative experiments demonstrate the efficiency and effectiveness of the proposed approach.
|
|
16:00-16:15, Paper ThC06.3 | |
Active Perception with Initial-State Uncertainty: A Policy Gradient Method |
|
Shi, Chongyang | University of Florida |
Han, Shuo | University of Illinois Chicago |
Dorothy, Michael | US Army Research Laboratory |
Fu, Jie | University of Florida |
Keywords: Markov processes, Information theory and control
Abstract: This paper studies the synthesis of an active perception policy that maximizes the information leakage of the initial state in a stochastic system modeled as a hidden Markov model (HMM). Specifically, the emission function of the HMM is controllable with a set of perception or sensor query actions. Given the goal is to infer the initial state from partial observations in the HMM, we use Shannon conditional entropy as the planning objective and develop a novel policy gradient method with convergence guarantees. By leveraging a variant of observable operators in HMMs, we prove several important properties of the gradient of the conditional entropy with respect to the policy parameters, which allow efficient computation of the policy gradient and stable and fast convergence. We demonstrate the effectiveness of our solution by applying it to an inference problem in a stochastic grid world environment.
|
|
16:15-16:30, Paper ThC06.4 | |
On Output-Feedback Control of Unknown Nonlinear Systems Via Prescribed Performance Observers |
|
Trakas, Panagiotis | University of Patras |
Verginis, Christos | Uppsala University |
Bechlioulis, Charalampos P. | University of Patras |
Keywords: Nonlinear output feedback, Observers for nonlinear systems, Constrained control
Abstract: In this work, we introduce a low-complexity output-feedback control scheme imposing prescribed performance characteristics for unknown high-order nonlinear systems. We design a novel robust observer with adaptive gains in order to mitigate undesirable high steady-state gains. The controller incorporates an adaptive mechanism to ensure closed-loop signal boundedness, particularly during transient. Furthermore, we provide a nonlinear separation principle to demonstrate the recovery of closed-loop performance under state-feedback. Simulation results validate the theoretical findings and demonstrate the efficacy of the proposed controller.
|
|
16:30-16:45, Paper ThC06.5 | |
FORWARD: Feasibility Oriented Random-Walk Inspired Algorithm for Radial Reconfiguration in Distribution Networks |
|
Vendrell Gallart, Joan | University of California Irvine |
Bent, Russell | Los Alamos National Laboratory |
Kia, Solmaz S. | University of California Irvine (UCI) |
Keywords: Transportation networks, Power systems, Energy systems
Abstract: We consider an optimal flow distribution problem in which the goal is to find a radial configuration that minimizes resistance-induced quadratic distribution costs while ensuring delivery of inputs from multiple sources to all sinks to meet their demands. This problem has critical applications in various distribution systems, such as electricity, where efficient energy flow is crucial for both economic and environmental reasons. Due to its complexity, finding an optimal solution is computationally challenging and NP-hard. In this paper, we propose a novel algorithm called FORWARD, which leverages graph theory to efficiently identify feasible configurations in polynomial time. By drawing parallels with random walk processes on electricity networks, our method simplifies the search space, significantly reducing computational effort while maintaining performance. The FORWARD algorithm employs a combination of network preprocessing, intelligent partitioning, and strategic sampling to construct radial configurations that meet flow requirements, finding a feasible solution in polynomial time. Numerical experiments demonstrate the effectiveness of our approach, highlighting its potential for real-world applications in optimizing distribution networks.
|
|
16:45-17:00, Paper ThC06.6 | |
Accelerated Controller Tuning Using Human Feedback and Multi-Task Preferential Bayesian Optimization |
|
Coutinho, João | University of Coimbra |
Peng, You | Dow |
Rendall, Ricardo | Dow Inc |
Rizzo, Caterina | Dow Chemical |
Ma, Kaiwen | The Dow Chemical Company |
Chin, Swee-Teng | The Dow Chemical |
Castillo, Ivan | The Dow Chemical Company |
Reis, Marco | University of Coimbra |
Keywords: Human-in-the-loop control, Machine learning, Computer-aided control design
Abstract: Controller design often requires balancing multiple conflicting objectives, making it hard to define a single objective function or trade-offs between multiple objectives to achieve desired performance. Users find it easier to express preferences through pairwise comparisons such as 'I prefer A over B'. Preferential Bayesian Optimization (PBO) uses this feedback to optimize a utility function that reflects the user's preferences towards different criteria. However, when a new task is present, PBO typically ignores previous comparison data, leading to learning of preferences for new tasks from scratch. We introduce Multi-Task PBO (MTPBO), which leverages preferential data from related tasks to speed up optimization for new tasks. Applied to both synthetic case studies and a human-in-the-loop controller tuning problem, MTPBO demonstrates superior performance, finding more preferred responses with a lower experimental budget.
|
|
ThC07 |
Governor's Sq. 11 |
Energy Management in Vehicles |
Invited Session |
Chair: Shao, Yunli | University of Georgia |
Co-Chair: Pangborn, Herschel | The Pennsylvania State University |
Organizer: Kwak, Kyoung Hyun | University of Michigan - Dearborn |
Organizer: Pangborn, Herschel | The Pennsylvania State University |
Organizer: Sawodny, Oliver | University of Stuttgart |
Organizer: Nazari, Shima | UC Davis |
|
15:30-15:45, Paper ThC07.1 | |
Energy-Efficient Automated Driving for Everyday Maneuvers: Fundamentals to Experimentation (I) |
|
Ard, Tyler | Argonne National Lab |
Han, Jihun | Argonne National Laboratory |
Gupta, Prakhar | Clemson University |
Karbowski, Dominik | Argonne National Laboratory |
Jia, Yunyi | Clemson Universtiy |
Vahidi, Ardalan | Clemson University |
Keywords: Autonomous vehicles, Optimal control
Abstract: Energy-efficient driving is a key advancement in the deployment of automated vehicles once safety concerns are addressed. This paper formulates the energy-efficient driving problem with constraints and explores various solution methods for common driving scenarios. The findings, rooted in theory of optimal control and Pontryagin's Minimum Principle (PMP), offer fundamental insights into energy-efficient driving strategies in every-day driving scenarios. Analytical insights from PMP coupled with fast analytical solution of respective boundary value problem, enabled implementation in a real-time control system and near-optimal energy savings. The proposed approach was validated through real vehicle testing on the track, with results demonstrating that automated eco-driving can achieve significant energy savings over human drivers in basic daily driving scenarios. This study not only highlights the effectiveness of the proposed approach but also provides practical guidance for integrating energy-efficient driving strategies into real-world automated driving and advanced driver assistance systems.
|
|
15:45-16:00, Paper ThC07.2 | |
Model Predictive Control with AI Based Predictors for Energy Management in Hybrid Vehicles (I) |
|
Cavanini, Luca | Università Politecnica Delle Marche |
Majecki, Pawel | University of Strathclyde |
Grimble, Michael John | University of Strathclyde |
Sasikumar, Lakshmy Vazhayil | NXP Semiconductors |
Hillier, Curt | NXP Semiconductors |
Keywords: Automotive control, Automotive systems, Machine learning
Abstract: Linear Parameter-Varying Model Predictive Control has been shown to provide an effective design approach for developing an Energy Management System for Hybrid Electric Vehicles. However, despite the good performance achieved, modern data-driven Artificial Intelligence methods can improve the performance due to the approximations involved in generating the models. An approach is described for reducing the sub-optimality due to the modelling problem in predictive control using an AI algorithm belonging to the class of data-driven Machine Learning techniques. This provides more effective vehicle speed and driver torque demand predictions that are used within the predictive controller. The proposed combined policy is compared with a baseline control design developed using the well-known Equivalent Consumption Minimization Strategy and an MPC neglecting the use of AI predictors.
|
|
16:00-16:15, Paper ThC07.3 | |
Tri-Level Control Co-Design for Series Electric-Hydraulic Hybrid Vehicles (I) |
|
Taaghi, Amirhossein | Oakland University |
Yoon, Yongsoon | Oakland University |
Keywords: Automotive control, Optimization, Machine learning
Abstract: This paper presents a tri-level control co-design method for series electric-hydraulic hybrid vehicles, aimed at reducing battery energy consumption, thereby extending battery lifetime and driving range, particularly for heavy-duty transportation. The method formulates a tri-level optimization problem to achieve global optimality in system configuration, hydraulic energy storage sizing, and high-level energy control. It employs three nested loops: the outer loop selects the optimal configuration of hydraulic pumps and motors for efficient bi- directional energy conversion; the middle loop uses a genetic algorithm to optimize the volume and pre-charge pressure of the accumulator; and the inner loop applies dynamic programming to optimizes the energy control strategy, generating an unstructured open-loop solution. A single objective function, i.e. battery energy consumption, is used in all loops to find globally optimal solutions offline. Finally, for online implementation of the offline energy control, particularly in the closed-loop structure, a recurrent neural network is trained to replicate the dynamic programming solution. Numerical simulations show that the proposed method outperforms the conventional sequential design method in terms of battery energy saving.
|
|
16:15-16:30, Paper ThC07.4 | |
Energy Consumption in Electric School Buses at Cold Conditions: A Study of Thermal Conditioning Strategies (I) |
|
Ma, Jingchen | University of Michigan |
Tran, Vivian | University of Michigan, Ann Arbor |
Siegel, Jason B. | University of Michigan |
Kim, Youngki | University of Michigan - Dearborn |
Stefanopoulou, Anna G. | University of Michigan |
Keywords: Energy systems, Modeling, Control applications
Abstract: This study investigates the impact of thermal conditioning (TC) on the energy consumption and driving range of electric vehicles in cold weather, especially for vehicles with low utilization and long parking durations, such as electric school buses (ESBs). Two types of TC are considered during parking. The trickle thermal conditioning (TTC) strategy maintains pack temperature above freezing throughout the parking duration. The fast thermal conditioning (FTC), where the pack is heated fast from potentially subfreezing temperatures depending on the parking time and the environmental temperature. The FTC strategy may require a bigger heater than the TTC, but the TTC strategy may consume more grid energy than the FTC, depending on the parking time. We compare TTC and FTC with no thermal conditioning (NoTC) with respect to the driving range and efficiency. The three strategies (TTC, FTC, and NoTC) are analyzed using an electrical-thermal battery pack model coupled with simple control algorithms to capture the effects of a battery thermal management system (BTMS). The simulated model is validated using real-world winter operating data from three ESBs in three Michigan school districts, covering both the transportation of pupils and extended parking periods. To emulate the BTMS two control approaches are designed: (i) a PI controller for the heating during driving and charging to emulate the heating needed at subfreezing conditions because ESBs drive at low speeds and do not generate enough self-heating and (ii) a thermostatic controller for the TTC and FTC applied to two different size heaters. The simulation results show that FTC can recover 40% of the driving range lost when operating at -10 °C compared to operation at 25 °C, and reduce specific energy consumption by 5% and 3% compared to TTC and NoTC, respectively. These thermal conditioning simulations can be transformed to a digital twin that weighs capital (heater size) and operational cost (TC setpoint) for ESB fleet management in cold regions.
|
|
16:30-16:45, Paper ThC07.5 | |
Co-Optimization of Vehicle Dynamics and Powertrain Management for Connected and Automated Electric Vehicles (I) |
|
Li, Zongtan | University of Georgia |
Shao, Yunli | University of Georgia |
Keywords: Automotive control, Automotive systems, Optimal control
Abstract: Connected and automated vehicles (CAVs) represent the future of transportation, utilizing detailed traffic information to enhance control and decision-making. Eco-driving of CAVs has the potential to significantly improve energy efficiency, and the benefits are maximized when both vehicle speed and powertrain operation are optimized. In this paper, we studied the co-optimization of vehicle speed and powertrain management for energy savings in a dual-motor electric vehicle. Control-oriented vehicle dynamics and electric powertrain models were developed to transform the problem into an optimal control problem specifically designed to facilitate real-time computation. Simulation validation was conducted using real-world data calibrated traffic simulation scenarios in Chattanooga, TN. Evaluation results demonstrated a 12.80-24.52% reduction in the vehicle's power consumption under ideal predicted traffic conditions. Energy benefits are maintained with various prediction uncertainties, such as Gaussian process uncertainties on acceleration and time-shift effects on predicted speed. The energy savings of the proposed eco-driving strategy are achieved through effective speed control and optimized torque allocation. The proposed model can be extended to various CAV and electric vehicle applications, with potential adaptability to diverse traffic scenarios.
|
|
16:45-17:00, Paper ThC07.6 | |
Integrated Power and Thermal Management for Reducing Battery Degradation in Electrified Connected and Automated Vehicles |
|
Li, Dongjun | National University of Singapore |
Hu, Qiuhao | University of Michigan |
Dong, Haoxuan | National University of Singapore |
Song, Ziyou | University of Michigan, Ann Arbor |
Keywords: Automotive control, Automotive systems
Abstract: Electrified connected and automated vehicles (CAVs) offer advanced prediction capabilities to revolutionize battery thermal management, thereby reducing energy consumption and battery degradation. The main challenges, however, lie in the complexities of coupled multi-objective optimization and multi-timescale dynamics, including both fast vehicle dynamics and slow battery thermal dynamics. In this study, we introduce an integrated power and thermal management strategy designed to enhance energy efficiency while minimizing battery degradation, all while ensuring thermal and traffic safety for CAVs. Leveraging the multi-horizon model predictive control framework, the objective function is reformulated based on the degradation loss term to better address these challenges. Our findings suggest that an effective management strategy requires reducing peak power and scheduling battery cooling when traction power is low to balance the trade-off between cooling energy efficiency and battery degradation loss. Simulation results show the proposed approach achieves a 5.60% reduction in cooling energy, a 3.02% reduction in traction energy, and more than 12% reduction in degradation loss, ensuring optimal energy efficiency and battery longevity across various driving conditions.
|
|
ThC08 |
Governor's Sq. 12 |
Design and Operation of Energy Systems |
Invited Session |
Chair: Fleming, Paul | National Renewable Energy Laboratory |
Co-Chair: van Wingerden, Jan-Willem | Delft University of Technology |
Organizer: Blizard, Audrey | The Ohio State University |
Organizer: Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Organizer: Deshpande, Vedang M. | Mitsubishi Electric Research Laboratories |
Organizer: Jain, Neera | Purdue University |
Organizer: Docimo, Donald | Texas Tech University |
Organizer: Pangborn, Herschel | The Pennsylvania State University |
Organizer: Mulders, Sebastiaan Paul | Delft University of Technology |
Organizer: Sinner, Michael | National Renewable Energy Laboratory |
Organizer: Bay, Christopher | National Renewable Energy Laboratory |
Organizer: van Wingerden, Jan-Willem | Delft University of Technology |
Organizer: Fleming, Paul | National Renewable Energy Laboratory |
|
15:30-15:45, Paper ThC08.1 | |
Learning-Enhanced Distributed MPC for Optimal Building Control (I) |
|
Wald, Dylan | Colorado School of Mines, National Renewable Energy Laboratory |
Johnson, Kathryn | Colorado School of Mines |
Sinner, Michael | National Renewable Energy Laboratory |
King, Jennifer | National Renewable Energy Laboratory |
Keywords: Distributed control, Machine learning, Energy systems
Abstract: Distributed algorithms have proven successful in the control of large, complex systems such as buildings. However, many distributed control algorithms, such as distributed model predictive control (DMPC), depend on reliable communication between decoupled subsystems to converge to the optimal centralized solution. In practice, these networks are subject to communication losses. This proof-of-concept work proposes a learning-enhanced DMPC method to infer local subsystem values lost due to communication failure. Using the gated recurrent unit (GRU) deep learning architecture, these lost values are inferred from only local subsystem states, then used to update a subsystem’s control action. By analyzing system cost, we show that some performance of DMPC can be recovered even when the lost values are not perfectly predicted by the GRU models, improving system resilience.
|
|
15:45-16:00, Paper ThC08.2 | |
Decomposition-Based Control Co-Design of Energy Systems Using Graph Models (I) |
|
Smith, Kayla | University of Illinois at Urbana-Champaign |
Alleyne, Andrew G. | University of Minnesota |
Keywords: Optimization, Energy systems, Large-scale systems
Abstract: Efficient design and operation of energy systems are crucial for minimizing wasted energy, lowering greenhouse gas emissions, and minimizing energy-related costs for individuals and businesses. Control co-design techniques have been applied to energy systems to optimize them. Control codesign techniques are conventionally formulated to optimize an entire energy system in one problem formulation. However, as energy systems can be composed of many subsystems, this can result in optimization problems that are near intractable and computationally expensive. Therefore, it can be beneficial to decompose the control co-design formulations. A decomposition-based control co-design framework is developed using graph-based dynamic models and augmented Lagrangian coordination techniques. The graph-based modeling framework is based on conservation of energy and can be applied to multiple energy domains, making them versatile and able to capture the interactions among components. This framework is applied to optimize a notional aircraft fuel thermal management system. The decomposition-based control co-design approach is shown to result in a design that performs within 1.3% of the optimized all-at-once approach design. This approach leads to a component-wise system optimization with results near the optimal design.
|
|
16:00-16:15, Paper ThC08.3 | |
Co-Design of Multi-Terminal DC Transmission Systems Topology and Energy Storage for Offshore Wind Farm Grid Interconnection (I) |
|
Wang, Wei | Pacific Northwest National Laboratory |
Sharma, Himanshu | Pacific Northwest National Laboratory |
Huang, Bowen | PNNL |
She, Buxin | Pacific Northwest National Laboratory |
Ramachandran, Thiagarajan | Pacific Northwest National Laboratory |
Adetola, Veronica | Pacific Northwest National Lab |
Keywords: Optimization, Power systems, Energy systems
Abstract: Offshore wind farms (OWFs) are poised to play a crucial role in meeting U.S. clean energy goals. Efficiently integrating OWFs into the onshore grid is vital for both developers and the power grid. For renewable developers, optimizing OWF interconnection topology while considering net revenue is essential. Recent studies highlight the advantages of integrating energy storage with renewables for stable opera- tions, emphasizing correct sizing and operation. To address this, we developed a co-design optimization framework for OWFs. It selects the optimal interconnection topology and determines energy storage sizing and operations, aiming to maximize net profit while considering energy market participation. The framework accounts for uncertainties associated with renewable resources and employs a multi-timescale stochastic optimization approach. We demonstrated the effectiveness of this approach on a proposed U.S. West Coast offshore wind farm site with five OWFs. The co-design solutions with energy storage show a significant improvement in net revenue compared to a baseline case with fixed interconnection topology and static energy storage sizing and operations.
|
|
16:15-16:30, Paper ThC08.4 | |
Optimal and Reinforcement Learning-Based Control of a Motion-Powered Winder for Wave Energy Harvesting (I) |
|
Khan, Arsh | University of California at Berkeley |
Kuo, Ming Hon Evan | UC Berkeley |
Shorri, Arlind | University of California, Berkeley |
Alam, Reza | University of California, Berkeley |
Keywords: Fluid power control, Mechanical systems/robotics, Optimal control
Abstract: Autonomous sailboats for long-term ocean monitoring face significant energy challenges. While solar panels and indirect wind energy capture (e.g., hydroturbines installed below the hull) are common power sources, this work explores wave energy harvesting as a complementary solution. We investigate the use of an unbalanced mass in the sailboat's keel to capture wave energy, employing optimal control and reinforcement learning strategies to maximize power generation. Our results demonstrate that an optimal control strategy can enhance power generation efficiency by 70%, while a real-time reinforcement learning-based control algorithm can achieve a 20% improvement under simulated oceanic conditions. The latter algorithm learns and adapts to the dynamics of motion in changing sea states and varying wave spectra, including both frequency and directional components, to optimize system parameters and maximize energy absorption.
|
|
16:30-16:45, Paper ThC08.5 | |
Optimizing Electrolyzers: Simultaneous Degradation Minimization and Hydrogen Flow Maximization with Mixed-Integer Programming (I) |
|
Vijayshankar, Sanjana | NREL |
Tully, Zachary | Colorado School of Mines |
Koleva, Mariya | NREL |
Reznicek, Evan | National Renewable Energy Laboratory |
Johnson, Kathryn | Colorado School of Mines |
King, Jennifer | National Renewable Energy Laboratory |
Keywords: Energy systems, Optimization, Optimization algorithms
Abstract: We present a comprehensive framework for the simultaneous hydrogen flow maximization and degradation mitigation using mixed integer programming in proton exchange membrane electrolyzers. Our approach begins by modeling the electrolyzer as a linear representation that aligns with available data. We then introduce a novel method for quantifying the costs associated with electrolyzer shutdowns and startups, recognizing their impact on degradation. Finally, we introduce an optimization framework for simultaneous hydrogen flow maximization and degradation minimization using mixed integer linear programming. Our findings offer valuable insights into operational strategies for electrolyzers powered by wind energy across diverse regions within the United States.
|
|
16:45-17:00, Paper ThC08.6 | |
Propagation of Reactive-Power Disturbances in Inverter-Based Microgrids (I) |
|
Roy, Sandip | Washington State University |
Nandanoori, Sai Pushpak | Pacific Northwest National Laboratory |
Kundu, Soumya | Pacific Northwest National Laboratory |
Adetola, Veronica | Pacific Northwest National Lab |
Keywords: Control of networks, Power systems, Network analysis and control
Abstract: The disturbance response of the voltage dynamics for droop-controlled inverter-based microgrids is examined, from a network-theoretic perspective. Specifically, conditions are derived such that disturbances are attenuated away from their source in the network. These conditions are interpreted to give insight into droop-control design and also reactive-power compensation in microgrids. Additionally, when the conditions are not met, the frequency ranges over which disturbance inputs are amplified vs attenuated are determined. Examples are presented to illustrate the formal results, and explore their tightness.
|
|
ThC09 |
Governor's Sq. 14 |
Structure Exploiting Reinforcement Learning for Networked Systems |
Tutorial Session |
Chair: Wierman, Adam | California Institute of Technology |
Co-Chair: Qu, Guannan | Carnegie Mellon University |
Organizer: Qu, Guannan | Carnegie Mellon University |
Organizer: Wierman, Adam | California Institute of Technology |
Organizer: Li, Na | Harvard University |
|
15:30-17:00, Paper ThC09.1 | |
Structure-Exploiting Reinforcement Learning for Networked Systems (I) |
|
Qu, Guannan | Carnegie Mellon University |
Keywords: Reinforcement learning, Distributed control, Large-scale systems
Abstract: Networked systems are ubiquitous and play an indispensable role in advancing our modern society. The control and operation of such systems have long been a tremendous challenge. In the meantime, the recent advancement of Machine Learning (ML), particularly Reinforcement Learning (RL), has achieved tremendous success across different domains, exhibiting impressive capability to learn to control complex and unknown systems. Due to these advantages of RL, it has been recognized to hold great potential for revolutionizing the way we control and operate these large-scale networked systems. However, despite a rich literature on RL and Multi-Agent RL (MARL), (MA)RL algorithms are widely recognized to suffer from scalability, stability, and safety issues when it comes to large-scale networked systems. To address these challenges, there has been recent lines of works in the literature that exploits structural properties of networked systems to design more scalable MARL algorithms. This tutorial provides a holistic overview of these results, covering various types of structural properties and how to integrate these properties into MARL.
|
|
ThC10 |
Governor's Sq. 16 |
Control Applications II |
Regular Session |
Chair: Pourghorban, Arman | University of North Carolina at Charlotte |
Co-Chair: Jagtap, Pushpak | Indian Institute of Science |
|
15:30-15:45, Paper ThC10.1 | |
Towards Mitigating Sim2Real Gaps: A Formal Quantitative Approach |
|
P, Sangeerth | Indian Institute of Science |
Lavaei, Abolfazl | Newcastle University |
Jagtap, Pushpak | Indian Institute of Science |
Keywords: Control applications, Simulation, Robotics
Abstract: In this paper, we introduce the notion of simulation-gap functions to formally quantify the potential gap between an approximate nominal mathematical model and the high-fidelity simulator representation of a real system. Given a nominal mathematical model alongside a quantified simulation gap, the system can be conceptualized as one characterized by bounded states and input-dependent disturbances. This allows us to leverage the existing powerful model-based control algorithms effectively, ensuring the enforcement of desired specifications while guaranteeing a seamless transition from simulation to real-world application. To provide a formal guarantee for quantifying the simulation gap, we develop a data-driven approach. In particular, we collect data using high-fidelity simulators, leveraging recent advancements in Real-to-Sim transfer to ensure close alignment with reality. We demonstrate the effectiveness of the proposed method through experiments conducted on a nonlinear pendulum system and a nonlinear Turtlebot model in simulators.
|
|
15:45-16:00, Paper ThC10.2 | |
Controllability Gramians Make Water Safer: Water Quality and Hydraulic Regulation in Drinking Networks |
|
Elsherif, Salma M. | Vanderbilt University |
Kazma, Mohamad | Vanderbilt University |
Taha, Ahmad | Vanderbilt University |
Keywords: Emerging control applications, Optimization, Control of networks
Abstract: The operation and regulation of water distribution networks is a complex procedure aimed at efficiently supplying consumers with sufficient water while ensuring its safe quality. A critical aspect is the direct impact of the network's hydraulics on the dynamics of water quality; the former is typically needed to inform the latter. Previous studies have tackled hydraulic optimization and water quality regulation as separate problems, although they are naturally coupled. While some attempts have been made to couple these problems into a single one, such studies have neglected control-theoretic virtues of the integrated problems. This paper fills this research gap by formulating a pump control problem that accounts for water quality controllability via classic Gramians. This is achieved by integrating water quality controllability metrics into the control problem, the goal is to enhance both water quality regulation and reducing pumping costs. A case study is presented to illustrate that utilizing controllability Gramians leads to an improved overall system performance.
|
|
16:00-16:15, Paper ThC10.3 | |
Novel Angle-Constrained Guidance with Virtual Velocity Technique |
|
Yang, Luhua | Tsinghua University |
Shi, Heng | Tsinghua University |
Kuang, Minchi | Tsinghua University |
Zhu, Jihong | Tsinghua University |
Keywords: Aerospace, Control applications, Optimization
Abstract: A novel angle-constrained guidance method based on proportional navigation with the virtual velocity of the target is proposed. This method features simplicity without the need for estimating time-to-go or target acceleration information. By introducing a virtual velocity in the desired direction relative to the true target, negative feedback on the line-of-sight angle is established, ensuring the satisfaction of the terminal heading angle constraint. The dynamics of the guidance command with varying parameters are analyzed in a linear engagement scenario, specifically involving a low-speed, non-maneuvering target. Additionally, two parameter configuration approaches are proposed for terminal command convergence and optimization, clarifying their connection with previous work. Comparative numerical simulations verified the effectiveness of the proposed method in satisfying angle constraints and demonstrated its robustness against maneuvering targets.
|
|
16:15-16:30, Paper ThC10.4 | |
Bounded Input and Field-Of-View Constrained Impact Time Guidance |
|
Samrat, Ashok | Indian Institute of Technology Bombay |
Singh, Swati | Indian Institute of Technology Bombay |
Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
Keywords: Aerospace, Control applications, Autonomous systems
Abstract: This paper proposes a novel nonlinear guidance scheme tailored for intercepting stationary targets precisely at a desired impact time in a planar engagement scenario. The strategy addresses the challenges posed by the bounded field-of-view of seeker-equipped interceptors and physical actuator constraints, which can degrade the performance if unaccounted for. By integrating known actuator bounds directly into the design, the proposed guidance scheme enhances the overall effectiveness of the interceptor. The proposed approach employs an input-affine acceleration saturation model within the autopilot to handle the input constraints effectively. The acceleration model is appended to the kinematic equations to derive the guidance command. The seeker's field-of-view limitation is incorporated by utilizing the backstepping concepts using heading angle. The efficacy of the proposed strategies is demonstrated through comprehensive numerical simulations across various scenarios and compared against an existing guidance strategy.
|
|
16:30-16:45, Paper ThC10.5 | |
Cooperative Target Defense under Communication and Sensing Constraints |
|
Maity, Dipankar | University of North Carolina at Charlotte |
Pourghorban, Arman | University of North Carolina at Charlotte |
Keywords: Agents-based systems, Cooperative control
Abstract: We consider a variant of the target defense problems where a group of defenders are tasked to simultaneously capture an intruder. The intruder's objective is to reach a target without being simultaneously captured by the defender team. Some of the defenders are sensing-limited and do not have any information regarding the intruder's position or velocity at any time. The defenders may communicate with each other using a connected communication graph. We propose a decentralized feedback strategy for the defenders, which transforms the simultaneous capture problem into a nonlinear consensus problem. We derive a sufficient condition for simultaneous capture in terms of the agents' speeds, sensing, and communication capabilities. The proposed decentralized controller is evaluated through extensive numerical simulations.
|
|
16:45-17:00, Paper ThC10.6 | |
Statistical Process Monitoring of Cryogenic Air Separation Unit Startups |
|
Hassani, Bahareh | Auburn University |
Wang, Yajun | Linde Plc |
Kumar, Ankur | Praxair Technology Center |
Flores-Cerrillo, Jesus | Linde |
Wang, Jin | Auburn University |
He, Peter | Auburn University |
Keywords: Fault detection, Fault diagnosis, Manufacturing systems
Abstract: The startup phase of cryogenic air separation units (ASUs) includes complex, non-routine events characterized by dynamic, nonlinear, and nonstationary conditions. Developing an effective process monitoring framework, which is crucial for ensuring safe and efficient ASU startups, is a challenging task. This study introduces a statistical process monitoring system based on variable-wise multi-way principal component analysis (MPCA) for monitoring the startup of cryogenic ASUs. By leveraging the multi-stage multi-unit nature of startups, the proposed framework addresses the unique complexities of the system, enabling the prompt detection of slow or faulty startups. The effectiveness of the proposed monitoring approach is demonstrated in an industrial case study, which successfully identifies faulty startups.
|
|
ThC11 |
Governor's Sq. 17 |
Opinion Dynamics |
Regular Session |
Chair: Franci, Alessio | University of Liege |
Co-Chair: Zhang, Fumin | Hong Kong University of Science and Technology |
|
15:30-15:45, Paper ThC11.1 | |
Mixed Opinion Dynamics on the Unit Sphere for Multi-Agent Systems in Social Networks |
|
Zhang, Ziqiao | Purdue University |
Li, Yingke | Massachusetts Institute of Technology |
Al-Abri, Said | Georgia Institute of Technology |
Zhang, Fumin | Hong Kong University of Science and Technology |
Keywords: Stability of nonlinear systems, Networked control systems, Lyapunov methods
Abstract: In this paper, we study a multi-agent system with heterogeneous opinion dynamics, where the opinions of individuals evolve on the unit sphere according to different dynamics. We name the proposed system mixed opinion dynamics, as it comprises more than one type of opinion dynamics. We demonstrate that such a mixed system exhibits novel equilibrium behaviors that cannot be attained by a homogeneous opinion dynamic system, through theoretical analysis and simulation experiments. This work provides new insights into understanding emergent social behaviors in the real world and offers potential developments in modeling opinions' formations.
|
|
15:45-16:00, Paper ThC11.2 | |
Spatially-Invariant Opinion Dynamics on the Circle |
|
Amorim, Giovanna | Princeton University |
Bizyaeva, Anastasia | Cornell University |
Franci, Alessio | University of Liege |
Leonard, Naomi Ehrich | Princeton University |
Keywords: Adaptive systems, Biologically-inspired methods, Robotics
Abstract: We propose and analyze a nonlinear opinion dynamics model for an agent making decisions about a continuous distribution of options in the presence of input. Inspired by perceptual decision-making, we develop new theory for opinion formation in response to inputs about options distributed on the circle. Options on the circle can represent, e.g., the possible directions of perceived objects and resulting heading directions in planar robotic navigation problems. Interactions among options are encoded through a spatially invariant kernel, which we design to ensure that only a small (finite) subset of options can be favored over the continuum. We leverage the spatial invariance of the model linearization to design flexible, distributed opinion-forming behaviors using spatiotemporal frequency domain and bifurcation analysis. We illustrate our model’s versatility with an application to robotic navigation in crowded spaces.
|
|
16:00-16:15, Paper ThC11.3 | |
Opinion Dynamics with Set-Based Confidence: Convergence Criteria and Periodic Solutions |
|
Zabarianska, Iryna | Moscow Institute of Physics and Technology |
Proskurnikov, Anton V. | Politecnico Di Torino |
Keywords: Agents-based systems, Emerging control applications, Control of networks
Abstract: This paper introduces a new multidimensional extension of the Hegselmann-Krause (HK) opinion dynamics model, where opinion proximity is not determined by a norm or metric. Instead, each agent trusts opinions within the Minkowski sum x+O, where x is the agent's current opinion and O is the confidence set defining acceptable deviations. During each iteration, agents update their opinions by simultaneously averaging the trusted opinions. Unlike traditional HK systems, where O is a ball in some norm, our model allows the confidence set to be non-convex and even unbounded. We demonstrate that the new model, referred to as SCOD (Set-based Confidence Opinion Dynamics), can exhibit properties absent in the conventional HK model. Some solutions may converge to non-equilibrium points in the state space, while others oscillate periodically. These ``pathologies'' disappear if the set O is symmetric and contains zero in its interior: similar to the usual HK model, SCOD then converges in a finite number of iterations to one of the equilibrium points. The latter property is also preserved if one agent is "stubborn" and resists changing their opinion, yet still influences the others; however, two stubborn agents can lead to oscillations.
|
|
16:15-16:30, Paper ThC11.4 | |
Spiking Nonlinear Opinion Dynamics (S-NOD) for Agile Decision-Making |
|
Cathcart, Charlotte | Princeton University |
Belaustegui, Ian Xul | Princeton University |
Franci, Alessio | University of Liege |
Leonard, Naomi Ehrich | Princeton University |
Keywords: Adaptive systems, Biologically-inspired methods, Robotics
Abstract: We present, analyze, and illustrate a first-of-its-kind model of two-dimensional excitable (spiking) dynamics for decision-making over two options. The model, Spiking Nonlinear Opinion Dynamics (S-NOD), provides superior agility, characterized by fast, flexible, and adaptive response to rapid and unpredictable changes in context, environment, or information received about available options. S-NOD derives through the introduction of a single extra term to the previously presented Nonlinear Opinion Dynamics (NOD) for fast and flexible multi-agent decision-making behavior. The extra term is inspired by the fast-positive, slow-negative mixed-feedback structure of excitable systems. The agile behaviors brought about by the new excitable nature of decision-making driven by S-NOD are analyzed in a general setting and illustrated in an application to multi-robot navigation around human movers.
|
|
16:30-16:45, Paper ThC11.5 | |
Analysis of Stubborn Opinions on Networked SIS Epidemic Dynamics |
|
Xu, Qiulin | Tokyo Institute of Technology |
Masada, Tatsuya | Tokyo Institute of Technology |
Ishii, Hideaki | University of Tokyo |
Keywords: Modeling, Stability of nonlinear systems, Network analysis and control
Abstract: This paper investigates the spread of infectious diseases within a networked community, integrating the dynamics of both epidemic transmission and public opinion. We propose a novel discrete-time networked Susceptible-Infectious-Susceptible (SIS) epidemic model coupled with opinion dynamics, which takes account of the presence of stubborn agents with extreme views. The model captures the interplay between an individual's perception of epidemic severity and the actual spread of the disease. It is capable to offer a more comprehensive understanding of epidemic dynamics in a socially interconnected environment. We introduce the concept of the SIS-opinion reproduction number to assess the severity of the epidemic and analyze the conditions for natural disease eradication and the global stability of the endemic equilibrium. Numerical examples illustrate our theoretical findings.
|
|
ThC12 |
Plaza Court 1 |
Quantum Information and Control |
Regular Session |
Chair: Zlotnik, Anatoly | Los Alamos National Laboratory |
Co-Chair: Narasimhan, Shilpa | Wayne State University |
|
15:30-15:45, Paper ThC12.1 | |
Tools to Design Algorithms for Implementing Control Over Quantum Computers |
|
Narasimhan, Shilpa | Wayne State University |
Abou Halloun, Jihan | Wayne State University |
Nieman, Kip | Wayne State University |
Durand, Helen | Wayne State University |
Keywords: Quantum information and control, Chemical process control, Optimization
Abstract: Quantum computers (QCs) may find future applications within control systems that operate manufacturing processes. For application within control engineering, quantum algorithm development must be led by control engineers. However, control engineers may face challenges in designing quantum algorithms for control engineering problems. In this work, we provide several path-finding studies that leverage engineering tools such as optimization, encryption, and computational ``short-cuts'' toward making algorithm design for QC easier for control engineers.
|
|
15:45-16:00, Paper ThC12.2 | |
Modeling of Linear Quantum Networks with Frequency Transfer Function |
|
Fujimoto, Aoi | Meiji University |
Ichihara, Hiroyuki | Meiji University |
Keywords: Quantum information and control, Modeling
Abstract: This paper proposes a modeling approach for linear quantum networks composed of cavity systems using frequency transfer functions derived from input-output theory. Our approach enables the systematic modeling of arbitrary quantum networks through the series, parallel, and feedback connections of transfer functions corresponding to individual subsystems. Furthermore, applying the Bloch-Messiah/Euler decomposition to the frequency transfer functions, which approximates the systems to the steady-state gains, gives more flexible and physically meaningful models than conventional models.
|
|
16:00-16:15, Paper ThC12.3 | |
Heuristic Strategies for Process Stabilization Using Proportional Control Implemented by a Noisy Quantum Simulator |
|
Kasturi Rangan, Keshav | Wayne State University |
Durand, Helen | Wayne State University |
Keywords: Quantum information and control, Stability of linear systems
Abstract: Incremental processing and storage demands of industrial processes are causing fields such as optimization, scheduling, and control to assess the effectiveness of quantum devices in their applications. A key objective of control systems is to ensure process safety. This paper focuses on the potential of quantum devices to compute control inputs that maintain system safety despite sources of nondeterminism inherent to currently available quantum devices (quantum noise). In our previous work, we employed a quantum simulator to assess whether a quantum implementation of a proportional (P) control law could stabilize a single-input/single-output system under quantum noise approximated from a real quantum device. While one algorithm achieved this objective, another performed poorly, raising questions about the underlying reasons for their differing performance. This study explores how conventional control engineering techniques for handling nondeterminism (e.g., modifying sampling period lengths or applying only stabilizing control actions based on steady-state tracking) could enhance system response in the presence of noise. Utilizing IBM's Qiskit, a quantum simulator was used to evaluate algorithmic and heuristic control strategies to improve the steady-state tracking performance of a P-control law under quantum noise. The objective here is to provide insight into the feasibility of near-term quantum devices to execute control actions with safety considerations.
|
|
16:15-16:30, Paper ThC12.4 | |
Robust Quantum Gate Preparation in Open Environments |
|
Baker, Luke | Los Alamos National Laboratory |
Shah, Syed Alamdar | Los Alamos National Lab |
Zlotnik, Anatoly | Los Alamos National Laboratory |
Piryatinski, Andrei | Los Alamos National Laboratory |
Keywords: Robust control, Optimal control, Quantum information and control
Abstract: We develop an optimal control algorithm for robust quantum gate preparation in open environments with the state of the quantum system represented using the Lindblad master equation. The algorithm is based on adaptive linearization and iterative quadratic programming to progressively shape the control signal into an optimal form. Robustness is achieved with exponential rates of convergence by introducing uncertain parameters into the master equation and expanding the parameterized state over the basis of Legendre polynomials. We prove that the proposed control algorithm reduces to GRadient Ascent Pulse Engineering (GRAPE) when the robustness portion of the algorithm is bypassed and signal restrictions are relaxed. The control algorithm is applied to prepare Controlled NOT and SWAP gates with high precision. Using only second order Legendre polynomials, the examples showcase unprecedented robustness to 100% parameter uncertainty in the interaction strength between the qubits, while simultaneously compensating for 20% uncertainty in signal intensity. The results could enable new capabilities for robust implementation of quantum gates and circuits subject to harsh environments and hardware limitations.
|
|
16:30-16:45, Paper ThC12.5 | |
Response of Dynamic Processes with Control Implemented on a Noisy Quantum Computer |
|
Narasimhan, Shilpa | Wayne State University |
Messina, Dominic | Wayne State University |
Oyama, Henrique | Wayne State University |
Durand, Helen | Wayne State University |
Keywords: Lyapunov methods, Chemical process control, Quantum information and control
Abstract: A major challenge to determining the applicability (and potential outperformance over classical computers) of a quantum computer (QC) within chemical manufacturing processes is quantum noise. Computations by a QC are error-prone due to the influence of quantum noise inherent to the hardware. Errors in control inputs may destabilize a chemical process and lead to unsafe conditions for manufacturing personnel and the environment. The response of a process with control implemented on a QC to errors due to noise must be investigated thoroughly. In this work, the impacts of control input errors due to quantum noise on a process are modeled as bounded additive exogenous signals. Process and control element dynamics dictate the response of the process due to perturbations from exogenous signals. A theoretical analysis of the influence of process and control element dynamics on the response of a process to errors in control implemented on a QC is presented. Using simulations of a process example, a demonstration of the interplay between process response to noise, process dynamics, and control valve dynamics is illustrated. The results evaluate, for a specific process, aspects of how process and control element dynamics may mitigate the impact of noise.
|
|
16:45-17:00, Paper ThC12.6 | |
An Adaptive Observer Design for State and Parameter Estimation of Quantum Systems Via Averaging Theory |
|
Taslima, Eram | IIT BHU |
Kamal, Shyam | IIT(BHU) Varanasi |
Saket, R K | IIT (BHU) Varanasi |
Dinh, Thach N. | CNAM Paris |
Keywords: Estimation, Lyapunov methods, Quantum information and control
Abstract: This paper presents an adaptive asymptotic observer-based approach for estimating the states and the atom-laser coupling constant (parameter) of a two-level, closed quantum system driven by a constant control input. Using the interaction frame and averaging theory, the proposed observer ensures asymptotic estimation of both the system states and the unknown parameter. In addition to the asymptotic observer, we also propose an adaptive finite-time observer for the estimation of states in finite time, considering the parameter is unknown. The proposed finite-time observer is designed to provide the minimum output horizon required for accurate state and parameter estimation. To ensure the estimation of unknown parameter, the extended Lyapunov method is employed. Through simulations on a two-level quantum dynamical system, the proposed observers are demonstrated to exhibit robust performance in the presence of noise in the control and output measurement.
|
|
ThC13 |
Plaza Court 2 |
Optimization and Control |
Regular Session |
Chair: Liao-McPherson, Dominic | University of British Columbia |
Co-Chair: Koeln, Justin | University of Texas at Dallas |
|
15:30-15:45, Paper ThC13.1 | |
A Log-Domain Interior Point Method for Convex Quadratic Games |
|
Liu, Bingqi | University of British Columbia |
Liao-McPherson, Dominic | University of British Columbia |
Keywords: Numerical algorithms, Game theory
Abstract: We propose an equilibrium-seeking algorithm for finding generalized Nash equilibria of non-cooperative monotone convex quadratic games. Specifically, we recast the Nash equilibrium-seeking problem as variational inequality problem that we solve using a log-domain interior point method. This approach is suitable for general sum games and does not require extensive structural assumptions (e.g., aggregative or potential structure). We demonstrate the efficiency and versatility of the method on a benchmark game and demonstrate it is able to outperform first-order methods and state-of-the-art primal-dual predictor-corrector interior point methods on small to medium scale problems.
|
|
15:45-16:00, Paper ThC13.2 | |
A Fast Optimized Dual-Color Colorimetric Temperature Measurement Method for High-Temperature Surfaces Based on Controllable Error of Temperature Field |
|
Liu, Junyang | School of Automation Engineering, University of Electronic Scien |
Yin, Chun | University of Electronic Science and Technology of China |
Huang, Xuegang | Aerodynamics Institute, China Aerodynamics Research and Developm |
Dadras, Sara | Company |
Yan, Zhongbao | School of Automation Engineering, University of Electronic Scien |
Keywords: Optimization algorithms, Control applications, Computational methods
Abstract: CCD-based colorimetric temperature measurement enables non-contact monitoring of high-temperature fields but faces dynamic range limitations due to improper exposure during calibration. Traditional exposure adjustments compromise measurement accuracy while attempting to expand temperature detection ranges. This study proposes a novel calibration method utilizing an artificial blackbody to bypass emissivity-dependent corrections of measured objects. By acquiring multi-exposure thermal radiation data from the calibration blackbody, the system achieves simultaneous radiometric calibration and temperature field reconstruction. The improved process reduces temperature measurement errors from 4.5383^circ C to 2.1397^circ C, yielding a 52.85% improvement. Experimental validation confirms the method efficacy across 1000^circ C-2000^circ C, significantly enhancing dynamic range, adaptability, and accuracy for color CCD cameras in high-temperature applications.
|
|
16:00-16:15, Paper ThC13.3 | |
Trajectory-Informed versus Physics-Informed Machine Learning Methods for Dynamic Zero-Sum Games |
|
Wadi, Ali | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Game theory, Optimal control, Machine learning
Abstract: In this paper, we introduce a trajectory-informed, machine-learning framework designed to address two-player zero-sum games for uncertain nonlinear systems. Our approach approximates the optimal value function that solves the Hamilton-Jacobi-Isaacs (HJI) equation within the context of infinite-horizon zero-sum games. We employ trajectory-informed machine learning, inspired by the principles of physics-informed neural networks (PINNs). Remarkably, our method does not require explicit knowledge of the drift term in the system dynamics. Additionally, it provides guarantees for a unique solution for the finite-horizon zero-sum game variant, which PINNs cannot theoretically guarantee. We offer rigorous mathematical justification, demonstrating uniform convergence and satisfactory approximation of the saddle point policies for sufficiently large time horizons. This holds true regardless of whether the system dynamics are fully known or only partially known. Our proposed approach is validated through simulations, comparing scenarios where the system dynamics are fully known with those where the saddle point policies are learned through interaction with the system.
|
|
16:15-16:30, Paper ThC13.4 | |
Servo-Controllers for Linear Time-Invariant Systems with Operational Constraints |
|
Lavretsky, Eugene | The Boeing Co |
Menner, Marcel | Aurora Flight Sciences (A Boeing Company) |
Keywords: Constrained control, Optimal control, Linear systems
Abstract: A servo-control design method is proposed for multi-input-multi-output linear time-invariant (LTI) systems with box constraints on the control input and output. The proposed control design is based on the Nagumo Theorem, the Comparison Lemma, the min-norm optimal controllers, and it is directly related to the method of Control Barrier Functions. The Nagumo Theorem ensures forward invariance, while the Comparison Lemma is used to derive operational dynamics constraints that can be enforced by the LTI system for any relative degree. The resulting linear dynamics constraints are embedded into a Quadratic Program (QP) formulation to enforce the designated soft operational constraints. This paper shows that an explicit analytical solution to the QP can be obtained using parameter design choices that allow the Karush-Kuhn-Tucker optimality conditions to become decoupled component-wise. The proposed control design yields a continuous piecewise-linear state feedback policy and as a result, the system stability and robustness metrics can be computed using traditional methods.
|
|
16:30-16:45, Paper ThC13.5 | |
A Nominal Control Structure-Agnostic Model Reference Adaptive Control Framework |
|
Wilcher, Kevin | University of South Florida |
Yucelen, Tansel | University of South Florida |
Kurtoglu, Deniz | University of South Florida |
Hrynuk, John | DEVCOM Army Research Lab |
Keywords: Human-in-the-loop control, Lyapunov methods, Adaptive systems
Abstract: In this paper, we address a limitation in the traditional model reference adaptive control literature concerning the knowledge of the structure of a closed-form nominal control signal. Having this knowledge can be limiting when considering that control signals can be created by human operators, artificial intelligence algorithms, or other approaches that do not produce that closed-form structure. In particular, we propose a nominal control structure-agnostic model reference adaptive control framework to help deal with this limitation. The proposed framework also offers user-defined performance guarantees between the reference model and uncertain dynamical system trajectories. To show the efficacy of this framework, we present a numerical example where a human subject is producing the nominal control signal for a dynamical system.
|
|
16:45-17:00, Paper ThC13.6 | |
Designing Time-Varying Input Sets for Safety and Performance Using Constrained Zonotopes |
|
Vellucci, Alyssa | University of Texas at Dallas |
Koeln, Justin | University of Texas at Dallas |
Ruths, Justin | University of Texas at Dallas |
Keywords: Optimization, Autonomous systems
Abstract: We present a linear program to design admissible input sets for linear dynamic systems that contain inputs that simultaneously guarantee safe system operation and achieve the desired system performance. Given a set of safe, non-dangerous operating states and a target set to be reached, we provide a set-containment optimization problem to design a new admissible input set such that the propagated reachable set of a linear dynamic system remains in the safe set, reaches the target set, and remains within the actual physical limits of the actuators. Because this artificial limitation is imposed in a supervisory manner, it can be employed in tandem with any arbitrary low-level controller. In this paper, we design a horizon of time-varying admissible input sets, which enables greater flexibility in finding solutions in challenging environments.
|
|
ThC14 |
Plaza Court 3 |
Safe and Constrained Spacecraft Control |
Invited Session |
Chair: Phillips, Sean | Air Force Research Laboratory |
Co-Chair: Petersen, Chris | University of Florida |
Organizer: Petersen, Chris | University of Florida |
Organizer: Phillips, Sean | Air Force Research Laboratory |
Organizer: Soderlund, Alexander | The Ohio State University |
|
15:30-15:45, Paper ThC14.1 | |
Attitude Motion Planning with Moving Keep-Out Cones Via Invariant Sets (I) |
|
Jimerson, Trazon | University of New Mexico |
Danielson, Claus | University of New Mexico |
Keywords: Aerospace, Constrained control, Network analysis and control
Abstract: This paper adapts the invariant set motion planner (ISMP) for spacecraft attitude motion planning to avoid moving keep out zones. The ISMP is a motion planning algorithm that uses constraint admissible positive invariant (CAPI) sets of the closed loop dynamics to find paths to a desired orientation that avoids obstacles. We construct a multi stage reachability graph that incorporates time to model moving obstacles. This allows the ISMP to plan safe paths around moving keep out zones. We present three key contributions that enable the construction of the multi stage reachable graph. First, we establish time bounds for maneuvers using the exponential stability of the nonlinear closed loop dynamics. Second, we construct a single stage reachability-graph using the one-step backward reachable sets of the CAPI sets. Finally, we expand this single stage reachability graph into a multi stage reachability graph over a planning horizon, which enables us to certify the safety of nodes during time intervals. We provide simulation results to demonstrate safe attitude motion planning and control in the presence of moving obstacles.
|
|
15:45-16:00, Paper ThC14.2 | |
Safe Vehicle Motion Planning Using Constraint Admissible Positive Invariant Sets on SE(3) (I) |
|
Brandt, Teo | University of New Mexico |
Fierro, Rafael | University of New Mexico |
Danielson, Claus | University of New Mexico |
Keywords: Constrained control, Robust control, LMIs
Abstract: This letter extends the application of the invariant set motion planner (ISMP) to space vehicles operating in SE(3)=SO(3)xR 3, considering the quaternion representation of SO(3). We provide a proof for a collision-free set by extending the concepts of configuration-space bubbles from robotics literature. We derive a constraint admissible positive invariant (CAPI) subset within the configuration-space bubble for a robust linearization of the nonlinear vehicle dynamics. The motion planner constructs a directed graph of position and orientation equilibria covering SE(3). CAPI sets are constructed to verify that equilibria are connected by a feasible trajectory. Graph search is applied to determine a sequence of reference configurations, starting at an initial position-orientation and terminating at a goal position-orientation. Simulation results are included that demonstrate the safe navigation of a vehicle in the presence of an obstacle. The trajectory is shown to maintain the CAPI conditions and is therefore safe under the nonlinear translational and rotational closed-loop vehicle dynamics.
|
|
16:00-16:15, Paper ThC14.3 | |
Learning-Based Shielding for Safe Autonomy under Unknown Dynamics (I) |
|
Reed, Robert | University of Colorado Boulder |
Lahijanian, Morteza | University of Colorado Boulder |
Keywords: Formal verification/synthesis, Markov processes, Stochastic systems
Abstract: Shielding is a common method used to guarantee the safety of a system under a black-box controller, such as a neural network controller from deep reinforcement learning (DRL), with simpler, verified controllers. Existing shielding methods rely on formal verification through Markov Decision Processes (MDPs), assuming either known or finite-state models, which limits their applicability to DRL settings with unknown, continuous-state systems. This paper addresses these limitations by proposing a data-driven shielding methodology that guarantees safety for unknown systems under black-box controllers. The approach leverages Deep Kernel Learning to model the systems' one-step evolution with uncertainty quantification and constructs a finite-state abstraction as an Interval MDP (IMDP). By focusing on safety properties expressed in safe linear temporal logic (safe LTL), we develop an algorithm that computes the maximally permissive set of safe policies on the IMDP, ensuring avoidance of unsafe states. The algorithms soundness and computational complexity are demonstrated through theoretical proofs and experiments on nonlinear systems, including a high-dimensional autonomous spacecraft scenario.
|
|
16:15-16:30, Paper ThC14.4 | |
Geostationary Satellite Station Keeping and Collocation under High-Thrust Impulsive Control (I) |
|
Pavlasek, Natalia | University of Washington |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Weiss, Avishai | Mitsubishi Electric Research Labs |
Keywords: Spacecraft control, Predictive control for nonlinear systems
Abstract: Ensuring that satellites in geostationary Earth orbit (GEO) remain in their allocated station-keeping windows necessitates accurate station-keeping algorithms. Due to the direct relationship between the fuel efficiency of station-keeping trajectories and satellite mass, optimizing propellant consumption can extend satellite lifetime, increase payload capacity, and lower launch costs. In this paper, we propose a nonlinear model predictive control (NMPC) policy for station keeping and collocation of multiple GEO satellites under infrequent high-thrust impulsive control. We develop a sequential convex programming-based approach to find locally fuel-optimal trajectories with enforced separation distances between collocated satellites. Numerical simulations with NASA's General Mission Analysis Tool demonstrate the effectiveness of the proposed NMPC policy for both GEO satellite station keeping and as a collocation strategy for three GEO satellites in a single station-keeping window.
|
|
16:30-16:45, Paper ThC14.5 | |
Hybrid Model Predictive Control Approach for Spacecraft Proximity Maneuvering and Docking Accounting for Collisions (I) |
|
Basu, Himadri | University of California Santa Cruz |
Castroviejo-Fernandez, Miguel | University of Michigan |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Spacecraft control, Hybrid systems, Optimization algorithms
Abstract: In this paper, we present a hybrid Model Predictive Control (MPC)-based strategy for docking and proximity operations of a deputy spacecraft with a passively tumbling chief on a circular orbit. Both spacecraft are modeled as rigid bodies, and the objective is to safely guide the deputy spacecraft to the chief while synchronizing the orientation, translational, and angular motion of these rigid bodies as they approach for docking with each other. Depending on the proximity between these spacecraft, the maneuvers are classified into three phases-1) pre-docking, 2) contact, and 3) docked phase-each with distinct control requirements and constraints. With the proposed approach, collisions between the spacecraft can be accounted for during the contact phase. Simulation results are reported to demonstrate the effectiveness of hybrid MPC-based solution.
|
|
16:45-17:00, Paper ThC14.6 | |
Computational Dynamics for Model Predictive Control Rendezvous and Proximity Operations (I) |
|
Ludden, Channing | University of Florida |
Petersen, Chris | University of Florida |
Keywords: Spacecraft control, Autonomous systems, Optimization
Abstract: This work demonstrates the existence of computational dynamics that evolve temporally when executing Model Predictive Control (MPC). These computational dynamics represent metrics such as CPU usage and power which are consumed by processors when performing the on-board calculations. The metrics, when observed temporally, appear to have dynamics that are asymptotically stable and perturbed when MPC is executed, demonstrating something akin to an input-output relationship. In particular, these magnitude of impact is driven by non-traditional inputs from the optimizations mathematical formulation, such as horizon length, to the solver parameters, such as stopping criteria. This work in particular focuses the analysis on spacecraft Rendezvous and Proximity Operations (RPO) where computation is limited, and demonstrates analytically the existence of these computational dynamics.
|
|
ThC15 |
Plaza Court 6 |
Spreading Processes in Complex Systems: Analysis, Control, and Estimation |
Invited Session |
Chair: Pare, Philip E. | Purdue University |
Co-Chair: Bizyaeva, Anastasia | Cornell University |
Organizer: Walter, Ian | Purdue University |
Organizer: Gracy, Sebin | South Dakota School of Mines and Technology |
Organizer: Pare, Philip E. | Purdue University |
|
15:30-15:45, Paper ThC15.1 | |
Optimal Bayesian Persuasion for Containing SIS Epidemics (I) |
|
Maitra, Urmee | Indian Institute of Technology, Kharagpur |
Hota, Ashish R. | Indian Institute of Technology (IIT), Kharagpur |
Pare, Philip E. | Purdue University |
Keywords: Game theory, Optimal control, Emerging control applications
Abstract: We consider a susceptible-infected-susceptible (SIS) epidemic model in which a large group of individuals decide whether to adopt partially effective protection without being aware of their individual infection status. Each individual receives a signal which conveys noisy information about its infection state, and then decides its action to maximize its expected utility computed using its posterior probability of being infected conditioned on the received signal. We first derive the static signal which minimizes the infection level at the stationary Nash equilibrium under suitable assumptions. We then formulate an optimal control problem to determine the optimal dynamic signal that minimizes the aggregate infection level along the solution trajectory. We compare the performance of the dynamic signaling scheme with the optimal static signaling scheme, and illustrate the advantage of the former through numerical simulations.
|
|
15:45-16:00, Paper ThC15.2 | |
Preventive-Reactive Defense Tradeoffs in Resource Allocation Contests (I) |
|
Paarporn, Keith | University of Colorado, Colorado Springs |
Xu, Shouhuai | University of Colorado Colorado Springs |
Keywords: Game theory, Agents-based systems, Optimization
Abstract: The connectivity enabled by modern computer networking technologies introduces vulnerabilities to adversarial attacks. Although it is ideal to be able to prevent all possible cyber attacks, this is not possible or feasible in practice and society must accept that attacks are inevitable. While many works study optimal security policies to minimize the chance of successful attacks, there are many unexplored territories. In this letter, we formulate and investigate a new problem, namely the tradeoff between the effort or resource that should be spent on preventing attacks (i.e., preventive defense) and the effort or resource that should be spent on recovering from attacks (i.e., reactive defense). We formulate the problem as a resource allocation game between the defender and the attacker, where they decide how to allocate resources to defend and attack a set nodes (e.g., computers), respectively. The game unfolds in two phases. (i) Allocate preventive resources to reduce the probabilities that the nodes are successfully compromised by the attacker. (ii) The compromised nodes undergo a recovery process, which can be sped up with the allocation of more reactive defense resources. Our results completely characterize the Nash equilibria of this game, revealing the defender's optimal allocation of preventive versus reactive resources.
|
|
16:00-16:15, Paper ThC15.3 | |
Resilience to Non-Compliance in Coupled Cooperating Systems (I) |
|
Butler, Brooks A. | University of California, Irvine |
Pare, Philip E. | Purdue University |
Keywords: Networked control systems, Control of networks, Cooperative control
Abstract: This letter explores the implementation of a safe control law for systems of dynamically coupled cooperating agents. Under a CBF-based collaborative safety framework, we examine how the maximum safety capability for a given agent, which is computed using a collaborative safety condition, influences safety requests made to neighbors. We provide conditions under which neighbors may be resilient to non-compliance of neighbors to safety requests, and compute an upper bound for the total amount of non-compliance an agent is resilient to, given its 1-hop neighborhood state and knowledge of the network dynamics. We then illustrate our results via simulations of a networked susceptible-infected-susceptible (SIS) epidemic model.
|
|
16:15-16:30, Paper ThC15.4 | |
Modeling Epidemic Spread: A Gaussian Process Regression Approach (I) |
|
She, Baike | Georgia Institute of Technology |
Xin, Lei | The Chinese University of Hong Kong |
Pare, Philip E. | Purdue University |
Hale, Matthew | Georgia Institute of Technology |
Keywords: Emerging control applications, Healthcare and medical systems, Biological systems
Abstract: Modeling epidemic spread is critical for informing policy decisions aimed at mitigation. Accordingly, in this work we present a new data-driven method based on Gaussian process regression (GPR) to model epidemic spread through the difference on the logarithmic scale of the infected cases. We bound the variance of the predictions made by GPR, which quantifies the impact of epidemic data on the proposed model. Next, we derive a high-probability error bound on the prediction error in terms of the distance between the training points and a testing point, the posterior variance, and the level of change in the spreading process, and we assess how the characteristics of the epidemic spread and infection data influence this error bound. We present examples that use GPR to model and predict epidemic spread by using real world infection data gathered in the UK during the COVID-19 epidemic. These examples illustrate that, under typical conditions, the prediction for the next twenty days has 94.29% of the noisy data located within the 95% confidence interval, validating these predictions. We further compare the modeling and prediction results with other methods, such as polynomial regression, k-nearest neighbors (KNN) regression, and neural networks, to demonstrate the benefits of leveraging GPR in disease spread modeling.
|
|
16:30-16:45, Paper ThC15.5 | |
Hybrid SIS Dynamics for Demand Modeling of Frequently Updated Products (I) |
|
Walter, Ian | Purdue University |
Panchal, Jitesh | Purdue University, School of Mechanical Engineering |
Pare, Philip E. | Purdue University |
Keywords: Model Validation, Nonlinear systems identification, Hybrid systems
Abstract: We propose a hybrid spreading process model to capture the dynamics of demand for software-based products. We introduce discontinuous jumps in the state to model sudden surges in demand that can be seen immediately after a product update is released. After each update, the modeled demand evolves according to a continuous-time susceptible-infected-susceptible (SIS) epidemic model. We identify the necessary and sufficient conditions for estimating the hybrid model's parameters for an arbitrary finite number of sequential updates. We verify the parameter estimation conditions in simulation, and evaluate how the estimation of these parameters is impacted by the presence of observation and process noise. We then validate our model by applying our estimation method to daily user engagement data for a regularly updating software product, the live-service video game `Apex Legends.'
|
|
16:45-17:00, Paper ThC15.6 | |
Opinion-Driven Risk Perception and Reaction in SIS Epidemics (I) |
|
Ordorica Arango, Marcela | Princeton University |
Bizyaeva, Anastasia | Cornell University |
Levin, Simon | Princeton University |
Leonard, Naomi Ehrich | Princeton University |
Keywords: Agents-based systems, Networked control systems
Abstract: We present and analyze a mathematical model to study the feedback between behavior and epidemic spread in a population that is actively assessing and reacting to risk of infection. In our model, a population dynamically forms an opinion that reflects its willingness to engage in risky behavior (e.g., not wearing a mask in a crowded area) or reduce it (e.g., social distancing). We consider SIS epidemic dynamics in which the contact rate within a population adapts as a function of its opinion. For the new coupled model, we prove the existence of two distinct parameter regimes. One regime corresponds to a low baseline infectiousness, and the equilibria of the epidemic spread are identical to those of the standard SIS model. The other regime corresponds to a high baseline infectiousness, and there is a bistability between two new endemic equilibria that reflect an initial preference towards either risk seeking behavior or risk aversion. We prove that risk seeking behavior increases the steady-state infection level in the population compared to the baseline SIS model, whereas risk aversion decreases it. When a population is highly reactive to extreme opinions, we show how risk aversion enables the complete eradication of infection in the population. Extensions of the model to a network of subpopulations are explored numerically.
|
|
ThC16 |
Plaza Court 7 |
LPV and Robust Systems |
Regular Session |
Chair: Zare, Armin | University of Texas at Dallas |
Co-Chair: Bhattacharya, Raktim | Texas A&M |
|
15:30-15:45, Paper ThC16.1 | |
Sparse Actuation for LPV Systems with Full-State Feedback in H2/H∞ Framework |
|
Kumar, Tanay | Texas A&M University |
Bhattacharya, Raktim | Texas A&M |
Keywords: Linear parameter-varying systems, H-infinity control, Robust control
Abstract: This paper addresses the sparse actuation problem for nonlinear systems represented in the Linear Parameter Varying (LPV) form. We propose a convex optimization framework that concurrently determines actuator magnitude limits and the state-feedback law that guarantees a user-specified closed-loop performance in the H2/H∞ sense. We also demonstrate that sparse actuation is achieved when the actuator magnitude-limits are minimized in the l1 sense. This is the first paper that addresses this problem for LPV systems. The formulation is demonstrated in a vibration control problem for a flexible wing.
|
|
15:45-16:00, Paper ThC16.2 | |
Perturbation Analysis of Turbulent Channel Flow Over Riblets |
|
Naseri, Mohammadamin | The University of Texas at Dallas |
Zare, Armin | University of Texas at Dallas |
Keywords: Linear parameter-varying systems, Linear systems, Reduced order modeling
Abstract: Spanwise-periodic surface roughness in the form of riblets has been shown to reduce skin-friction drag by numerous experimental and numerical studies. Recent efforts have demonstrated the efficacy of the linearized Navier-Stokes equations around the mean velocity in capturing the effect of riblets. We address ongoing issues pertain to the accuracy and computational complexity of such reduced-order models by combining turbulence modeling with a domain transformation that translates the spatial periodicity of the corrugated surface onto the differential operators. The latter technique, which yields a spatially periodic system that accounts for harmonic interactions between the mean flow and turbulent fluctuations, enables a perturbation analysis that significantly reduces computational complexity. Our numerical experiments demonstrate the efficacy of this approach in capturing skin-friction drag and kinetic energy in channel flow over semi-circular riblets.
|
|
16:00-16:15, Paper ThC16.3 | |
Finite-Time Stabilization of Continuous-Time Systems with Sampled Control |
|
Mazenc, Frederic | Inria Saclay |
Malisoff, Michael | Louisiana State University |
Keywords: Linear parameter-varying systems, Robust control, Delay systems
Abstract: We prove finite-time stabilization properties for continuous-timetime-varying linear systems, using sampled controls. Our main results yield finite-time input- to-state stability, where the upper bounding supremum of the uncertainty is over a time interval of constant finite length. Our work includes output feedback stabilization and input delays. We use our results to prove novel global exponential input-to-state estimates for nonlinear systems with state delays, including systems with outputs, using a trajectory based approach. We illustrate our work using a pendulum dynamics with poorly known friction.
|
|
16:15-16:30, Paper ThC16.4 | |
Optimal Sensing Precision for Celestial Navigation Systems in Cislunar Space Using LPV Framework |
|
Nychka, Eliot | Texas A&M University - College Station Tx |
Bhattacharya, Raktim | Texas A&M |
Keywords: Linear parameter-varying systems, Robust control, Estimation
Abstract: This paper introduces two innovative convex optimization formulations to simultaneously optimize the H2/Hinf observer gain and sensing precision, and guarantee a specified estimation error bound for nonlinear systems in LPV form. Applied to the design of an onboard celestial navigation system for cislunar operations, these formulations demonstrate the ability to maintain accurate spacecraft positioning with minimal measurements and theoretical performance guarantees by design.
|
|
16:30-16:45, Paper ThC16.5 | |
Direct Data-Driven Design of LPV Controllers and Polytopic Invariant Sets with Cross-Covariance Noise Bounds |
|
Mejari, Manas | University of Applied Sciences and Arts of Southern Switzerland |
Breschi, Valentina | Eindhoven University of Technology |
Keywords: Data driven control, Linear parameter-varying systems, Robust control
Abstract: We propose a direct data-driven method for the concurrent computation of polytopic robust control invariant (RCI) sets and associated invariance-inducing control laws for linear parameter-varying (LPV) system. We present a data-based covariance parameterization of the gain-scheduled controller and the closed-loop dynamics, utilizing a persistently exciting state-input-scheduling trajectory gathered from an LPV system. This parameterization, along with the assumption of bounded cross-covariance noise, allows us to express the invariance condition as a set of data-based LMIs with a number of decision variables independent of the length of the dataset. These LMIs are combined with state-input constraints framed as simple affine inequalities in a convex semi-definite program to maximize the volume of the RCI set. A numerical example demonstrates the computational effectiveness of the proposed method in synthesizing RCI sets even with large datasets.
|
|
16:45-17:00, Paper ThC16.6 | |
Robust Control of an Inverting Buck-Boost Converter under Exogenous Disturbances |
|
Verdín Monzón, Rodolfo Isaac | Centro De Investigaciones En óptica |
Flores, Gerardo | Texas A&M International University |
Keywords: Robust control, Lyapunov methods, Power systems
Abstract: This paper presents the design and implementation of a robust nonlinear control strategy for a DC-DC inverting buck-boost converter. The proposed controller ensures finite-time almost global stability, even in the presence of external disturbances such as voltage fluctuations and noise. To evaluate its performance, software-in-the-loop (SITL) simulations were conducted and compared against a nonlinear PI-type controller from the literature. The results demonstrate significant improvements in stability, convergence speed, and disturbance rejection, confirming the effectiveness of the proposed approach in handling dynamic and uncertain operating conditions.
|
|
ThC17 |
Plaza Court 8 |
Motion Planning and Control |
Regular Session |
Chair: van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Co-Chair: Gaspard, Mallory | Cornell University |
|
15:30-15:45, Paper ThC17.1 | |
Optimality of Motion Camouflage under Escape Uncertainty |
|
Gaspard, Mallory | Cornell University |
Keywords: Optimal control, Optimization, Biological systems
Abstract: This letter proposes a novel continuous-time dynamic programming framework to determine when it is optimal for a pursuer to use motion camouflage (MC) amidst uncertainty in the evader’s escape attempt time. We motivate this framework through the model problem of an energy-optimizing male hover fly pursuing a female hover fly for mating. The time at which the female fly initiates an escape is modeled to occur as the result of a non-homogeneous Poisson point process with a biologically informed rate function, and we obtain and solve two Hamilton-Jacobi-Bellman (HJB) PDEs which encode the pursuer’s optimal trajectories. Our numerical experiments and statistics illustrate when it is optimal to use MC pursuit tactics amidst uncertainty and how MC optimality is affected by certain properties of the evader’s sensing abilities.
|
|
15:45-16:00, Paper ThC17.2 | |
Shortest Dubins Path to a Moving Circle with Free Final Heading |
|
Manyam, Satyanarayana Gupta | DCS Corp., Air Force Research Labs |
Casbeer, David W. | Air Force Research Laboratory |
Von Moll, Alexander | Air Force Research Laboratory |
Weintraub, Isaac | Air Force Research Laboratory |
Keywords: Optimization, Autonomous robots, Optimal control
Abstract: In this paper the shortest time strategy of a turn-constrained vehicle for reaching a circle moving on a straight line is posed and solved. The shortest curvature constrained path to a circle is comprised of a left or right turning arc of minimum turn radius (L/R) and straight-line segment (S). An analysis of each of the candidate Dubins modes is provided. Under the condition that the agent's initial position is not on the path of the moving circle, we prove that the length of the shortest Dubins path to the circle is a continuous function of the position of the target center. Leveraging this continuity, we propose an algorithm that uses the bisection search and finds the time-optimal solution. A lower bound to the solution is obtained when the agent's initial position lies on the path of the moving circle, and a heuristic approach is discussed for such instances.
|
|
16:00-16:15, Paper ThC17.3 | |
Bezier Reachable Polytopes: Efficient Certificates for Robust Motion Planning with Layered Architectures |
|
Csomay-Shanklin, Noel | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Robotics, Hierarchical control, Predictive control for nonlinear systems
Abstract: Control architectures are often implemented in a layered fashion, combining independently designed blocks to achieve complex tasks. Providing guarantees for such hierarchical frameworks requires considering the capabilities and limitations of each layer and their interconnections at design time. To address this holistic design challenge, we introduce the notion of Bezier Reachable Polytopes -- certificates of reachable points in the space of Bezier polynomial reference trajectories. This approach captures the set of trajectories that can be tracked by a low-level controller while satisfying state and input constraints, and leverages the geometric properties of Bezier polynomials to maintain an efficient polytopic representation. As a result, these certificates serve as a constructive tool for layered architectures, enabling long-horizon tasks to be reasoned about in a computationally tractable manner.
|
|
16:15-16:30, Paper ThC17.4 | |
Optimal Motion Planning Using Mixed Bernstein-Fourier Approximants |
|
Mudrik, Liraz | Naval Postgraduate School |
Kragelund, Sean | Naval Postgraduate School |
Kaminer, Isaac | Naval Postgraduate School |
Keywords: Numerical algorithms, Autonomous systems, Optimal control
Abstract: This paper presents a novel numerical method for optimal motion planning by integrating Bernstein polynomials with Fourier series, effectively leveraging the strengths of both to address complex optimal control problems. Our mixed approximation method uses Fourier series to capture periodic behaviors while employing Bernstein polynomials to model non-periodic components, thereby accelerating convergence. We prove that the combined approximants—and their derivatives and integrals—exhibit uniform convergence for any continuous function, ensuring reliable approximation quality. In addition, we establish that the approximated solution is both feasible and consistent with the original optimal control problem, guaranteeing convergence to the true optimal solution as the approximation order increases. Simulation results confirm that our method significantly improves computational efficiency without loss of accuracy compared to the Bernstein-only approach.
|
|
16:30-16:45, Paper ThC17.5 | |
A Continuous Split-Path Integrator with Application to Motion Control |
|
Hoogeveen, Thomas | ASML |
van den Eijnden, Sebastiaan | Eindhoven University of Technology |
Heertjes, Marcel | Eindhoven University of Technology |
Keywords: Mechatronics, Control applications, Lyapunov methods
Abstract: This paper considers a split-path integrator design with continuous output. Compared to existing split-path integrators that have discontinuous outputs, the continuous split-path integrator has less harmonic distortion and its nonlinear frequency response is better explained by (quasi-linear) describing function analysis. Benefits in measured control performance are demonstrated on a fourth-order demonstrator system, e.g., an improved step response with less overshoot and less settling time as compared to a nominal linear design.
|
|
16:45-17:00, Paper ThC17.6 | |
Hysteresis in Motion Control Systems: A Frequency-Domain Analysis on Higher Harmonics |
|
Alferink, Dirk W.T. | University of Technology, Eindhoven |
Fey, Rob H.B. | Eindhoven University of Technology |
Van De Wouw, Nathan | Eindhoven University of Technology |
Heertjes, Marcel | Eindhoven University of Technology |
Keywords: Mechanical systems/robotics, Feedback linearization, Switched systems
Abstract: Motion stages in lithography machines are connected to other components through dynamic links, which exhibit hysteretic behavior that negatively impacts tracking performance. This paper presents a novel frequency-domain framework to analyze the steady-state response of motion control systems affected by hysteresis when subjected to sinusoidal inputs. The framework enables the calculation of quasi-linear frequency response functions for all harmonic components in the closed-loop system. The effectiveness of the approach is demonstrated through an industrially relevant numerical example, showing that higher-order harmonics induced by hysteresis are sufficiently small compared to the fundamental harmonic.
|
|
ThC18 |
Director's Row E |
Identification and Filtering |
Regular Session |
Chair: Wang, Ningshan | University of Michigan |
Co-Chair: Dózsa, Tamás Gábor | HUN-REN Institute for Computer Science and Control |
|
15:30-15:45, Paper ThC18.1 | |
Integrating System Identification and Blind Source Separation for Real-Time Pipeline Monitoring: A Field Study |
|
Maneshkarimi, Shirin | University of Calgary |
Dankers, Arne | University of Calgary |
Westwick, David | Schulich School of Engineering, University of Calgary |
Keywords: Identification for control, Numerical algorithms, Computational methods
Abstract: This work introduces an innovative method for real-time pipeline monitoring using acoustic sensors. Our proposed algorithm is based on decomposing the acoustic measurements into ‘’sources’’ and monitoring the sources for changes that could be attributed to leaks. The source separation (SS) algorithm is implemented using tools from system identification. In contrast to past implementations, our method performs in real time and incorporates a verification step to improve the reliability of source estimations. We explicitly incorporate SS with a cross-correlation test to verify the algorithm's reliability in identifying mutually uncorrelated sources. Furthermore, the algorithm's adaptability improved with the regularized least squares (ReLS) technique and an automatic regularization factor computed from the measured data. This automation not only assures the algorithm's flexibility but also maintains real-time performance under various situations without requiring user intervention, which is critical for online monitoring systems. Real-time monitoring, robustness, and verifiability are the three main points of this paper to guarantee that the system can reliably detect abnormalities as they occur, perform continuously under changing conditions, and produce reliable outcomes. The approach was tested with field data from an operating pipeline, confirming its effectiveness in real-world scenarios. The findings demonstrate considerable increases in both detection accuracy and real-time performance, indicating a major advancement in pipeline monitoring technology.
|
|
15:45-16:00, Paper ThC18.2 | |
System Identification with Generalized Prony Schemes |
|
Dózsa, Tamás Gábor | HUN-REN Institute for Computer Science and Control |
Ungvári, Gergő | Eötvös Loránd Tudományegyetem |
Soumelidis, Alexandros | Computer and Automation Research Inst |
Schipp, Ferenc | Eotvos Lorand University of Budapest |
Bokor, Jozsef | MTA SZTAKI Hungarian Academy of Sciences |
Keywords: Identification, Linear systems, Numerical algorithms
Abstract: We propose a novel method to identify the transfer functions of single-input-single-output linear time invariant (SISO-LTI) dynamic systems. Our approach makes use of the operator based generalization of Prony's method. In particular, the operator based Prony algorithm is used to reconstruct the transfer function of the system as a linear combination of rational basis functions. A considerable benefit of the proposed method is its robustness against the estimated system order. That is, if system order is over estimated, the correct system order can be found naturally. Another important benefit is that the proposed method is shown to be asymptotically robust towards zero expectation noise with the correct choice of certain evaluation functionals. The effectiveness of the proposed method is demonstrated through numerical experiments.
|
|
16:00-16:15, Paper ThC18.3 | |
A Sampling Complexity-Aware Framework for Discrete-Time Fractional-Order Dynamical System Identification |
|
Zhang, Xiaole | University of Southern California |
Gupta, Vijay | Purdue University |
Bogdan, Paul | University of Southern California |
Keywords: Nonlinear systems identification, Identification, Estimation
Abstract: A variety of complex biological, natural and man-made systems exhibit non-Markovian dynamics that can be modeled through fractional order differential equations, yet, we lack sample comlexity aware system identification strategies. Towards this end, we propose an affine discrete-time fractional order dynamical system (FoDS) identification algorithm and provide a detailed sample complexity analysis. The algorithm effectively addresses the challenges of FoDS identification in the presence of noisy data. The proposed algorithm consists of two key steps. Firstly, it avoids solving higher-order polynomial equations, which would otherwise result in multiple potential solutions for the fractional orders. Secondly, the identification problem is reformulated as a least squares estimation, allowing us to infer the system parameters. We derive the expectation and probabilistic bounds for the FoDS parameter estimation error, assuming prior knowledge of the functions ( f ) and ( g ) in the FoDS model. The error decays at a rate of ( N = Oleft( frac{d}{epsilon} right) ), where ( N ) is the number of samples, ( d ) is the dimension of the state variable, and ( epsilon ) represents the desired estimation accuracy. Simulation results demonstrate that our theoretical bounds are tight, validating the accuracy and robustness of this algorithm.
|
|
16:15-16:30, Paper ThC18.4 | |
Kalman Filter for Unobservable Systems and Its Application to Time Scale Generation by Atomic Clock Ensembles |
|
Mochida, Shunsuke | Gunma University |
Kawaguchi, Takahiro | Gunma University |
Yano, Yuichiro | National Institute of Information and Communications Technology |
Hanado, Yuko | National Institute of Information and Communications Technology |
Kurata, Yosuke | Seiko Solutions Inc |
Koike, Masakazu | Tokyo University of Marine Science and Technology |
Ishizaki, Takayuki | Tokyo Institute of Technology |
Keywords: Kalman filtering, Estimation
Abstract: This paper proposes Kalman filter algorithms for unobservable systems. The proposed algorithms are effective in the time scale generation by atomic clock ensembles with undetectable modes. Applying the conventional Kalman filter (CKF) to an undetectable system causes numerical instability due to the divergence of the error covariance matrix. The proposed algorithm can prevent numerical instability while providing the same estimation as that of the CKF if the system has undetectable modes only on the unit circle. In addition, we also propose the steady-state form of the proposed algorithm. Numerical simulations confirm the validity of these algorithms for time scale generation by atomic clock ensembles.
|
|
16:30-16:45, Paper ThC18.5 | |
Sensor Scheduling with Guarantees for Greedy Approximation of Non-Submodular Mean-Squared Error Metric |
|
Cho, Wooyeong | University of California, Los Angeles |
Mehta, Ankur | University of California Los Angeles |
Keywords: Kalman filtering, Linear systems, Optimization
Abstract: We address sensor scheduling in linear time-invariant (LTI) dynamical systems, aiming to optimize the utilization of a limited number of sensors, which is inherently an NP-hard problem. We explore complex scenarios where the metric is a non-submodular mean squared error (MSE) for state estimation. Despite the challenges posed by non-submodular metrics in deriving a performance guarantee for greedy solutions over multiple time steps, we develop a sequential sensor scheduling framework that ensures performance guarantees for the greedy algorithm across the entire sequence while leveraging concepts of the submodularity ratio and the curvature. Thus, we propose a theoretical performance guarantee for the greedy approximation using the sequential analysis for sensor scheduling. Our simulation results demonstrate that our proposed guarantee for sensor scheduling effectively bounds the performance across various system cases.
|
|
16:45-17:00, Paper ThC18.6 | |
Geometric Extended State Observer on TSO(3) in the Presence of Bias in Angular Velocity Measurements |
|
Wang, Ningshan | University of Michigan |
Sanyal, Amit | Syracuse University |
Keywords: Estimation, Aerospace, Stability of nonlinear systems
Abstract: This article presents an estimation scheme for a rotating rigid body in the presence of unknown disturbance torque and unknown bias in angular velocity measurements. The attitude, angular velocity and disturbance torque are estimated from on-board control inputs, landmark vector measurements, and angular velocity measurements. The estimated attitude evolves directly on the special orthogonal group SO(3) of rigid body rotations. A Lyapunov analysis is given to prove that the proposed estimation scheme is almost globally Lyapunov stable in the absence of measurement noise and dynamic disturbance. The estimation scheme is discretized as a geometric integrator for practical implementation. The geometry-preserving properties of this numerical integrator preserve the Lie group structure of the configuration space, and give long time numerical stability. Numerical simulations demonstrate the stability and robustness properties of the proposed scheme.
|
|
ThC19 |
Director's Row H |
Modeling |
Regular Session |
Chair: Kwon, Joseph | Texas A&M University |
Co-Chair: Kant, Nilay | Michigan State University |
|
15:30-15:45, Paper ThC19.1 | |
Investigating Bistable Dynamics of Coupled Oscillators with Similarities to Neural Activity in Epilepsy |
|
Kant, Nilay | Michigan State University |
Mukherjee, Ranjan | Michigan State University |
Keywords: Modeling, Biological systems, Stability of nonlinear systems
Abstract: This paper investigates the dynamics of a nonlinear oscillator with bistable characteristics, where both a stable equilibrium and a limit cycle coexist. We refer to it as a bistable oscillator unit (BOU) and show that two coupled BOUs (CBOUs) exhibit dynamics analogous to neural activity patterns in epilepsy, including healthy, localized, and fully spread epileptic states. By treating each CBOU as an independent system influenced by the state of the other, we establish local input-to-state stability near both the stable equilibrium and the limit cycle, and estimate the ultimate bounds of the trajectories. Our analysis then identifies the domain of the initial conditions and estimates of a coupling constant, which are critical in determining the epileptic behavior of the CBOUs. The actual value of the coupling constant, above which the dynamics transition from localized to fully spread epileptic states, is determined through simulations and closely aligns with the analytical estimate presented in the paper
|
|
15:45-16:00, Paper ThC19.2 | |
Enhancing Predictive Accuracy in Catalysis: A Hybrid Modeling Approach for Dynamic Surface Configuration Analysis |
|
Lee, Chi Ho | Texas A&M University |
Pahari, Silabrata | Texas A&M |
Yesudoss, David Kumar | Texas A&M University |
Djire, Abdoulaye | Texas A&M University |
Kwon, Joseph | Texas A&M University |
Keywords: Modeling, Energy systems, Computational methods
Abstract: One inherent limitation of density functional theory (DFT) is its inability to precisely capture changes in surface configurations and activation energies, both of which are essential for interpreting reaction kinetics effectively. These shortcomings are further exacerbated when combined with the challenges traditional kinetic Monte Carlo (kMC) simulations face in capturing latent reaction mechanisms on electrocatalysts due to complex many-body interactions among adsorbates, reactants, and intermediates. To tackle these issues, we have developed a hybrid model that integrates machine learning (ML) with first-principles model. This model clearly tracks the changes in surface configurations over time, integrating complex physical phenomena often overlooked by conventional DFT models but observable in experiments. Our approach not only boosts the predictive accuracy of these models but also broadens their applicability in testing catalyst performance under diverse conditions. Intriguingly, the ML component of our model does more than fitting empirical data; it identifies critical parameters that refine subsequent DFT calculations to better match real-world surface conditions and coverages.
|
|
16:00-16:15, Paper ThC19.3 | |
Adaptive Passification of Unknown Input-Affine Nonlinear Systems |
|
Miyano, Tatsuya | Toyota Central R&D Labs., Inc |
Shima, Ryotaro | Toyota Central R&D Labs |
Ito, Yuji | Toyota Central R&D Labs., Inc |
Keywords: Modeling, Optimization, Adaptive systems
Abstract: In this letter, we present an adaptive passification framework for unknown input-affine nonlinear systems. In the present framework, a reference system is designed so that the deviation between the reference system and an unknown nominal system is minimized, while ensuring some classes of passivity properties. Based on the passive reference system, we present an adaptive control method that drives the nominal system to the reference system. The performance of the present framework is demonstrated through numerical experiments.
|
|
16:15-16:30, Paper ThC19.4 | |
Dynamic Collision-Inclusive Modeling of a Multi-Rotor Aerial Vehicle Using Linear Complementarity Systems |
|
Abazari, Amirali | Arizona State University |
Kumar, Yogesh | Arizona State University |
Patnaik, Karishma | Arizona State University |
Zhang, Wenlong | Arizona State University |
Keywords: Modeling, Simulation, Mechanical systems/robotics
Abstract: Fast and discontinuous nature of collision has raised many challenges in modeling and control of mobile robots, especially for those with agile dynamic such as multi- rotor Micro Aerial Vehicles (MAVs). In this paper, a linear complementarity systems (LCS) model is developed to model collisions between a multi-rotor MAV and the surrounding environment. The contact properties and constants for the model are obtained by minimizing the error between the simulations and the experimental data. The derived collision model is then verified through a series of collision experiments using a custom-designed setup. Simulation results show that the developed model is capable of estimating the post-collision velocity with average errors lower than 0.1 m/s (for maximum collision velocity of 0.91 m/s and rebound velocity of 0.4 m/s) and the maximum normal contact force errors as low as 0.25 N (≤ 1% of the maximum normal force).
|
|
16:30-16:45, Paper ThC19.5 | |
Equivalent-Circuit Thermal Model for Batteries with One-Shot Parameter Identification |
|
Chowdhury, Myisha Ahmed | Texas Tech University |
Lu, Qiugang (Jay) | Texas Tech University |
Keywords: Energy systems, Modeling, Identification
Abstract: Accurate state of temperature (SOT) estimation for batteries is crucial for regulating their temperature within a desired range to ensure safe operation and optimal performance. The existing measurement-based methods often generate noisy signals and cannot scale up for large-scale battery packs. The electrochemical model-based methods, on the contrary, offer high accuracy but are computationally expensive. To tackle these issues, inspired by the equivalent-circuit voltage model for batteries, this paper presents a novel equivalent-circuit electro-thermal model (ECTM) for modeling battery surface temperature. By approximating the complex heat generation inside batteries with data-driven nonlinear (polynomial) functions of key measurable parameters such as state-of-charge (SOC), current, and terminal voltage, our ECTM is simplified into a linear form that admits rapid solutions. Such simplified ECTM can be readily identified with one single (one-shot) cycle data. The proposed model is extensively validated with benchmark NASA, MIT, and Oxford battery datasets. Simulation results verify the accuracy of the model, despite being identified with one-shot cycle data, in predicting battery temperatures robustly under different battery degradation status and ambient conditions.
|
|
16:45-17:00, Paper ThC19.6 | |
Limitations of Switching Dynamics in the Modeling of Cooperative Slung Load Transportation System |
|
Polese, Fabio | Università Degli Studi Di Roma La Sapienza |
Di Monaco, Giovanni | Sapienza University of Rome |
Zavoli, Alessandro | Sapienza University of Rome |
De Matteis, Guido | Sapienza University of Rome |
Keywords: Switched systems, Multivehicle systems, Flight control
Abstract: The limitations of approaches based on switching dynamics when dealing with physical constraints are investigated by analyzing the dynamics of a formation of aerial drones carrying a load via rectilinear and massless cables. A formulation based on the Udwadia-Kalaba equation is used to account for the constraints of the multiagent slung load system and a criterion on the maximum number of agents for effectively managing the switching dynamics is devised. The switching dynamics of formations consisting of two and four quadrotor agents are analyzed and discussed to assess the validity and suitability of the obtained results.
|
|
ThC20 |
Director's Row I |
Estimation and Filtering III |
Regular Session |
Chair: Chen, YangQuan | University of California, Merced |
Co-Chair: Bridgeman, Leila J. | Duke University |
|
15:30-15:45, Paper ThC20.1 | |
An Almost Globally Uniformly Asymptotically Stabilizing Geometric Nonlinear Filter for Angle and Bias Estimation on the Unit Circle |
|
Aslam, Farooq | Institute of Space Technology |
Haydar, Muhammad Farooq | Animal Dynamics Ltd |
Akhtar, Suhail | Institute of Space Technology |
Keywords: Algebraic/geometric methods, Kalman filtering, Aerospace
Abstract: This paper proposes a novel geometric nonlinear filter for angle and bias estimation on the unit circle S1. In the case of small estimation errors, the proposed filter reduces to the continuous-time deterministic S1-MEKF, that is, to the filter obtained by specializing the equations for the multiplicative extended Kalman filter (MEKF) to the unit circle. Moreover, the proposed generalized S1-MEKF has the distinctive property that it renders the desired equilibrium of the estimation error system to be almost globally uniformly asymptotically stable (AGUAS). More precisely, it drives the angular and bias estimation errors uniformly asymptotically to zero for almost all initial conditions except those in which the initial angular estimation error equals pi radians. Simulation results indicate that the generalized S1-MEKF has improved transient performance compared to the conventional S1-MEKF. Moreover, it guarantees (almost) global convergence through the use of curvature correction terms in both the filter gain and gain update equations.
|
|
15:45-16:00, Paper ThC20.2 | |
Online Learning-Driven Human Intent Estimation and Control for Human-Robot Interaction |
|
Ganie, Irfan Ahmad | Missouri University of Science and Technology Rolla MO 65401 |
Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Keywords: Human-in-the-loop control, Game theory, Learning
Abstract: This paper presents a novel Stackelberg-game theoretic multilayer-online learning framework for cooperative control of nonlinear Physical Human-Robot Interaction (pHRI), where the human is modeled as the leader guiding a robot follower. This hierarchical interaction is captured as a dynamic Stackelberg game, with the human's intention estimated in real-time through online multilayer neural networks (MNNs). We introduce SVD-based weight update laws for actor-critic MNNs, which approximate value functions and control inputs for both human and robot, eliminating the need for predefined basis functions. In this framework, the human objective is first inferred and used to guide the robot actions by shaping the robot control policy. The robot, acting as the follower, then adjusts its control inputs to optimize its own performance while adhering to the safety constraints and interaction dynamics dictated by the human leader inferred objectives. By applying Karush-Kuhn-Tucker (KKT) conditions to both cost functions, we develop a two-layer control structure that maintains the hierarchical nature of HRI while ensuring safety.
|
|
16:00-16:15, Paper ThC20.3 | |
Which Information Metric Is the Best, CRLB, FIM or EMGR for Optimal Mobile Sensing of a Diffusing Source? |
|
Giri, Sachin | MESA Lab at UC Merced |
Hollenbeck, Derek | UC Merced |
Chen, YangQuan | University of California, Merced |
Keywords: Information theory and control, Emerging control applications, Autonomous robots
Abstract: This study first explores the mathematical relationships between three information metrics: the Cramer-Rao Lower Bound (CRLB), the Fisher Information Matrix (FIM) and the empirical observability gramians (EMGR). Then we applied these three metrics respectively in localizing an emission or a diffusing source task by using a mobile sensor. We focus on maximizing the efficiency of the Maximum Likelihood Estimator (MLE) under the constraints imposed by information metrics, aiming to achieve minimal variance in our estimates. The sensor motion is planned to optimize the localization of emission source using two proposed methods: FIM and EMGR, and compared with CRLB. The MLE is dependent on the sensor measurement model, which further depends on the concentration, bias and sensor noise. Furthermore, the concentration of diffusing source at any point in space is obtained from the solution of the diffusion model. To accurately locate the emission source, the mobile sensor's path is determined by both proposed methods, which leverage estimates from Maximum Likelihood Estimation (MLE). These methods ultimately aim to minimize the time required to reduce the location error to a specified threshold level. We visualize and compare the results between these three methods.
|
|
16:15-16:30, Paper ThC20.4 | |
What Is a Relevant Signal-To-Noise Ratio for Numerical Differentiation? |
|
Verma, Shashank | University of Michigan |
Almuhaihi, Mohammad | University of Michigan |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Numerical algorithms, Estimation, Kalman filtering
Abstract: In applications that involve sensor data, a useful measure of signal-to-noise ratio (SNR) is the ratio of the root-mean-squared (RMS) signal to the RMS sensor noise. The present paper shows that, for numerical differentiation, the traditional SNR is ineffective. In particular, it is shown that, for a harmonic signal with harmonic sensor noise, a natural and relevant SNR is given by the ratio of the RMS of the derivative of the signal to the RMS of the derivative of the sensor noise. For a harmonic signal with white sensor noise, an effective SNR is derived. Implications of these observations for signal processing are discussed.
|
|
16:30-16:45, Paper ThC20.5 | |
Issues with Input-Space Representation in Nonlinear Data-Based Dissipativity Estimation |
|
LoCicero, Ethan | Duke University |
Bridgeman, Leila J. | Duke University |
Penne, Alexander | Duke University |
Keywords: Machine learning, Nonlinear systems identification, Identification for control
Abstract: In data-based control, dissipativity can be a powerful tool for attaining stability guarantees for nonlinear systems if that dissipativity can be inferred from data. This work provides a tutorial on several existing methods for data-based dissipativity estimation of nonlinear systems. The interplay between the underlying assumptions of these methods and their sample complexity is investigated. It is shown that methods based on delta-covering result in an intractable trade-off between sample complexity and robustness. A new method is proposed to quantify the robustness of machine learning-based dissipativity estimation. It is shown that this method achieves a more tractable trade-off between robustness and sample complexity. Several numerical case studies demonstrate the results.
|
|
16:45-17:00, Paper ThC20.6 | |
Initialization of Monocular Visual Navigation for Autonomous Agents Using Modified Structure from Small Motion (I) |
|
Florez, Juan-Diego | Georgia Institute of Technology |
Dor, Mehregan | Georgia Tech |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Estimation, Autonomous robots
Abstract: We propose a standalone monocular visual Simultaneous Localization and Mapping (vSLAM) initialization pipeline for autonomous space robots. Our method, a state-of-the-art factor graph optimization pipeline, extends Structure from Small Motion (SfSM) to robustly initialize a monocular agent in spacecraft inspection trajectories, addressing visual estimation challenges such as weak-perspective projection and center-pointing motion, which exacerbates the bas-relief ambiguity, dominant planar geometry, which causes motion estimation degeneracies in classical Structure from Motion, and dynamic illumination conditions, which reduce the survivability of visual information. We validate our approach on realistic, simulated satellite inspection image sequences with a tumbling spacecraft and demonstrate the method's effectiveness over existing monocular initialization procedures.
|
|
ThC21 |
Director's Row J |
Resiliency and Safety |
Regular Session |
Chair: Molnar, Tamas G. | Wichita State University |
Co-Chair: El-Farra, Nael H. | University of California, Davis |
|
15:30-15:45, Paper ThC21.1 | |
Safety for Time-Delay Systems Using Halanay-Type Conditions |
|
Reynaud, Olayo | GIPSA Lab, Université Grenoble Alpes |
Hably, Ahmad | GIPSA-Lab |
Maghenem, Mohamed Adlene | Gipsa Lab, CNRS, France |
Keywords: Control applications, Constrained control, Delay systems
Abstract: This paper proposes sufficient conditions for safety in the context of time-delay systems. In particular, when the set to render forward invariant is the zero super-level set of a scalar function, inequality constraints in terms of the scalar function and the system’s (delayed) dynamics are shown to guarantee forward invariance. Such conditions are inspired by recentlyestablished Halanay-type conditions for asymptotic stability. Our conditions are novel in the context of safety analysis, and are shown to be weaker to those in existing literature. Finally, we show how a QP-based design can be performed to enforce the proposed conditions in the context of control systems, ensuring forward invariance for the resulting closedloop system. Numerical simulations are included to support our theoretical results.
|
|
15:45-16:00, Paper ThC21.2 | |
Cyberattack-Aware Control Structure Screening for Controller-Actuator False Data Injection Attack Isolation |
|
Gajjar, Aatam | University of California, Davis |
Ellis, Matthew | University of California, Davis |
El-Farra, Nael H. | University of California, Davis |
Keywords: Fault diagnosis, Linear systems, Process Control
Abstract: This work presents a screening methodology that integrates cyberattack isolation capabilities as an additional criterion in the selection of the control system structure for processes subject to controller-actuator link attacks. We focus on a class of attack isolation schemes that utilize a bank of unknown input observers with dedicated residuals to identify false data injection attacks on the controller-actuator links. For this class of isolation schemes, the relationship between the control system structure and the ability of the isolation scheme to isolate specific controller-actuator link attacks is characterized. An attack isolation metric is introduced to quantify the isolation capabilities of different control structure candidates in terms of the number of controller-actuator links for which attacks can be isolated. A screening algorithm that systematically evaluates different control structure candidates using this metric is developed. The screening tool aims to support the selection of cyberattack-aware control structures that enable the isolation of controller-actuator link attacks. The developed screening methodology is applied to a simulated chemical process, and the attack isolation capabilities of a possible cyberattack-aware control system structure are demonstrated.
|
|
16:00-16:15, Paper ThC21.3 | |
Optimism Induction Attack on Deep Reinforcement Learning with Control Barrier Function Safety Filter for Autonomous Driving |
|
Lohrasbi, Saeedeh | University of Waterloo |
Khoshnevisan, Ladan | University of Waterloo |
Narayan, Apurva | University of Western Ontario |
L. Azad, Nasser | University of Waterloo |
Xiong, Pulei | National Research Council Canada |
Keywords: Autonomous systems, Reinforcement learning, Multivehicle systems
Abstract: As autonomous vehicles (AVs) increasingly incorporate Reinforcement Learning (RL) into their decision-making processes, ensuring both security and stability becomes paramount. This paper introduces the Optimism Induction Attack (OIA), a novel adversarial strategy specifically targeting Deep Reinforcement Learning (DRL) agents. In contrast to traditional adversarial attacks—such as the Fast Gradient Sign Method (FGSM), which primarily degrades overall performance—OIA strategically exploits the agent’s misperception of state safety, causing it to overestimate safety margins and consequently make suboptimal decisions in critical scenarios. While OIA is broadly applicable to any actor-critic RL algorithm, we conduct a case study on a Proximal Policy Optimization (PPO)-trained Adaptive Cruise Control (ACC) agent protected by Control Barrier Function (CBF). Our analysis examines system performance using metrics such as collision rate, jerk, and engine torque. The results reveal that OIA significantly undermines both safety and efficiency, emerging as a subtler and more effective adversarial threat than FGSM, as evidenced by increased collision rates despite the nominal safety guarantees provided by CBFs. This work advances the field of adversarial machine learning in AVs by highlighting the urgent need for more robust defense mechanisms capable of countering sophisticated attacks like OIA, particularly in safety-critical applications.
|
|
16:15-16:30, Paper ThC21.4 | |
Actuator-Enabling Attacks in Discrete-Event Systems with Unknown Supervisors |
|
Ma, Ziyue | Xidian University |
Giua, Alessandro (IEEE TAC Senior Editor) | IEEE Transactions on Automatic Control |
Seatzu, Carla | Univ. of Cagliari |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: In this work we study a property of resiliency in discrete event systems modeled by finite state automata. The closed-loop system consists of a plant and a supervisor which enforces a specification. There exists an external attacker who can eavesdrops the output of the system via a mask. The attacker has the knowledge of the exact plant model and knows the existence of the supervisor as well as the exact closed-loop language, but it does not have any knowledge of the supervisor nor the specification the supervisor is enforcing. The aim of the attacker is to let the system violate the specification by performing actuator-enabling attacks. We prove that the existence of such actuator-enabling harmful attacks can be verified by checking the supremal consistent supervisor.
|
|
16:30-16:45, Paper ThC21.5 | |
Safety-Critical Controller Synthesis with Reduced-Order Models |
|
Cohen, Max | California Institute of Technology |
Csomay-Shanklin, Noel | California Institute of Technology |
Compton, William | California Institute of Technology |
Molnar, Tamas G. | Wichita State University |
Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Lyapunov methods, Hierarchical control
Abstract: Reduced-order models (ROMs) provide lower dimensional representations of complex systems, capturing their salient features while simplifying control design. Building on previous work, this paper presents an overarching framework for the integration of ROMs and control barrier functions, enabling the use of simplified models to construct safety-critical controllers while providing safety guarantees for complex full-order models. To achieve this, we formalize the connection between full and ROMs by defining projection mappings that relate the states and inputs of these models and leverage simulation functions to establish conditions under which safety guarantees may be transferred from a ROM to its corresponding full-order model. The efficacy of our framework is illustrated through simulation results on a drone and hardware demonstrations on ARCHER, a 3D hopping robot.
|
|
16:45-17:00, Paper ThC21.6 | |
Necessary and Sufficient Certificates for Almost Sure Reachability |
|
Majumdar, R | MPI for Software Systems |
Venkatesan Ramesh, Sathiyanarayana | Max Planck Institute for Software Systems |
Soudjani, Sadegh | Max Planck Institute for Software Systems |
Keywords: Stochastic systems, Lyapunov methods
Abstract: We consider the almost sure reachability problem for discrete-time stochastic dynamical systems, which asks if a system reaches a given subset of its state space almost-surely (i.e., with probability one). We show necessary and sufficient conditions for almost sure reachability under suitable regularity assumptions on the system. Our conditions are in the spirit of Lyapunov theory, which reduces the problem of checking a global stability property of a dynamical system to checking properties of appropriate certificates. As certificates, we use supermartingales as estimates of the likelihood of the system's evolution and variant functions as measures of distances to the target set. Given candidate supermartingale and variant functions, our conditions provide locally checkable conditions for almost sure reachability. Our key insight is a converse theorem, which shows how to construct a suitable supermartingale and variant if a system satisfies the almost sure reachability property.
|
| |