| | |
Last updated on May 10, 2026. This conference program is tentative and subject to change
Technical Program for Wednesday May 27, 2026
| |
| WeA03 Tutorial Session, Grand Salon 3 |
Add to My Program |
| COIN: COmmunication-Aware INtegrated Learning, Estimation, and Control |
|
| |
| Chair: Johansson, Karl H. | KTH Royal Institute of Technology |
| Co-Chair: Mamduhi, Mohammad H. | University of Birmingham |
| Organizer: Maity, Dipankar | University of North Carolina at Charlotte |
| Organizer: Goswami, Debdipta | The Ohio State University |
| Organizer: Mamduhi, Mohammad H. | University of Birmingham |
| Organizer: Johansson, Karl H. | KTH Royal Institute of Technology |
| |
| 10:30-12:00, Paper WeA03.1 | Add to My Program |
| COIN: COmmunication-Aware INtegrated Learning, Estimation, and Control (I) |
|
| Maity, Dipankar | University of North Carolina at Charlotte |
| Goswami, Debdipta | The Ohio State University |
| Mamduhi, Mohammad H. | University of Birmingham |
| Johansson, Karl H. | KTH Royal Institute of Technology |
Keywords: Networked control systems, Identification for control, Quantized systems
Abstract: Future autonomous systems ranging from multi-robot networks to distributed satellite constellations must operate reliably under limited, delayed, and lossy communications. Traditional control and learning frameworks, however, assume perfect information flow, leaving a fundamental gap between the theory and the realities of modern cyber-physical systems. This tutorial introduces a unified framework for communication-constrained learning, estimation, and control, where sensing, estimation, learning, and decision-making are tightly coupled through shared, resource-limited communication channels. We begin with the theoretical foundations of system identi- fication under communication constraints, highlighting recent advances in operator-theoretic and learning-enabled modeling that yield provable performance guarantees even with degraded data. The second part focuses on communication-aware control design, covering sequential and joint control–estimation–communication co-design, adaptive quantization, and risk-aware robust control under bandwidth-limited, delay-prone, and unreliable communication. Finally, we demonstrate how these theories enable resilient guidance and control in robotics applications where nonlinear dynamics, limited bandwidth, and high-latency links pose critical challenges to autonomy. The tutorial combines theory, algorithms, and case studies to provide attendees with (i) a principled understanding of the quality–quantity trade-off in data-driven control, (ii) practical tools for implementing communication-aware controllers, and (iii) insights into emerging aerospace, autonomous cars, and robotic applications where these ideas are reshaping design paradigms.
|
| |
| WeA04 Regular Session, Grand Salon 4 |
Add to My Program |
| Aerospace I |
|
| |
| Chair: Bisheban, Mahdis | University of Calgary |
| Co-Chair: Atkins, Ella M. | University of Michigan |
| |
| 10:30-10:45, Paper WeA04.1 | Add to My Program |
| Safety-Critical Input-Constrained Nonlinear Intercept Guidance in Multiple Engagement Zones |
|
| Ranjan, Praveen Kumar | University of Texas at San Antonio |
| Sinha, Abhinav | The University of Cincinnati |
| Cao, Yongcan | University of Texas, San Antonio |
Keywords: Aerospace, Control applications
Abstract: This paper presents an input-constrained nonlinear guidance law to address the problem of intercepting a stationary target in contested environments with multiple defending agents. Contrary to prior approaches that rely on explicit knowledge of defender strategies or utilize conservative safety conditions based on a defender's range, our work characterizes defender threats geometrically through engagement zones that delineate inevitable-interception regions. Outside these engagement zones, the interceptor remains invulnerable. The proposed guidance law switches between a repulsive safety maneuver near these zones and a pursuit maneuver outside their influence. To deal with multiple engagement zones, we employ a smooth minimum function (log-sum-exponent approximation) that aggregates threats from all the zones while prioritizing the most critical threats. Input saturation is modeled and embedded in the non-holonomic vehicle dynamics so the controller respects actuator limits while maintaining stability. Numerical simulations with several defenders demonstrate the proposed method’s ability to avoid engagement zones and achieve interception across diverse initial conditions.
|
| |
| 10:45-11:00, Paper WeA04.2 | Add to My Program |
| Inversion-Free Adaptive Control of a Bicopter with Unknown Mass and Inertia |
|
| Portella Delgado, Jhon Manuel | University of Maryland Baltimore County |
| Goel, Ankit | University of Maryland Baltimore County |
Keywords: Adaptive control, Lyapunov methods, Aerospace
Abstract: This paper develops an inversion-free adaptive controller for stabilizing and tracking the trajectory of a bicopter system. In a bicopter system, the inertial parameters, that is, mass and moment of inertia, parameterize the input map. Since the classical adaptive backstepping technique requires inverting the input map, which would contain the parameter estimates, the stability of the closed-loop system cannot be guaranteed because the inversion of these estimates may result in division by zero. This paper proposes a novel technique to circumvent the inversion of parameter estimates in the control law. The resulting controller requires only the sign of the unknown parameters, which are trivially known to be positive in the case of a bicopter. The proposed controller is validated in simulation for a smooth and nonsmooth trajectory-tracking problem.
|
| |
| 11:00-11:15, Paper WeA04.3 | Add to My Program |
| An Adaptive Lyapunov-Based MPC Framework for Quadcopter Attitude Control with Real-World Experimental Validation |
|
| Protoulis, Teo | University of West Attica |
| Alexandridis, Alex | University of West Attica |
Keywords: Aerospace, Predictive control for nonlinear systems, Lyapunov methods
Abstract: In this work, a nonlinear adaptive Lyapunov-based model predictive control (NALMPC) framework for attitude regulation of quadcopters is presented. Through backstepping, a Lyapunov controller is synthesized that accounts for angular displacement constraints, while considering uncertain moments of inertia. The derived feedback laws are utilized to form a contractive constraint that is integrated into the optimal control problem (OCP) of the MPC protocol. Additionally, the moment of inertia adaptation laws are integrated into the OCP, thus resulting to a predictive model that is updated in real-time, based on the operating conditions. A numerical comparative case study against a standard nonlinear MPC (NMPC) formulation is presented, where NALMPC outperforms NMPC in terms of tracking error and computational load. Finally, experimental validation on a real-world quadcopter demonstrates the efficacy of the proposed framework in terms of tracking and computational efficiency.
|
| |
| 11:15-11:30, Paper WeA04.4 | Add to My Program |
| Trajectory Planning for Contingency Landing Using Optimal Control |
|
| Kim, Heejin | Virginia Tech |
| Atkins, Ella M. | University of Michigan |
Keywords: Optimal control, Aerospace, Flight control
Abstract: This paper presents an optimal control framework for emergency landing trajectory generation under loss-of-thrust conditions. The method incorporates population density maps to minimize ground risk while ensuring feasibility with a six-degree-of-freedom nonlinear dynamics model. Trajectories are initialized using either discrete search or Dubins paths. Time-dependent and time-independent cost functions are evaluated. Using real population density data from the New York City region, results show that excluding a time cost favors longer trajectories that bypass densely populated regions, while including time in the cost function reduces flight duration at the expense of greater population exposure. Closed-loop simulations confirm feasibility and consistently lower costs compared to discrete search and Dubins solutions. The study highlights the importance of accurate initialization for optimal control as small errors in reference trajectory position or yaw can result in an adverse optimal control gradient and in turn infeasible results. Convergence and computational considerations suggest optimal control be deployed in preflight planning where computational and supervisory requirements can be safely accommodated.
|
| |
| 11:30-11:45, Paper WeA04.5 | Add to My Program |
| Realtime Wind Estimation Using Low Cost Quadrotor Uncrewed Aerial Vehicles |
|
| Udagedara, Hiranya | University of Calgary |
| Bisheban, Mahdis | University of Calgary |
Keywords: Kalman filtering, Estimation, Modeling
Abstract: In environmental monitoring as well as emergency response applications such as wildfires, wind velocity measurement is essential. Quadrotor UAVs have become popular platforms for wind velocity estimation due to their maneuverability, compact size, and cost-effectiveness. Numerous studies use the Extended Kalman Filter (EKF) to estimate the wind velocity based on the quadrotor dynamic model. However, EKF performance is constrained by its reliance on linearized approximations of the nonlinear quadrotor dynamics around current states, limiting accuracy in highly nonlinear scenarios, including windy conditions. This study proposes the use of an Unscented Kalman Filter (UKF), a nonlinear estimator to provide accurate wind estimations while maintaining the trajectory of the quadrotor UAV. The quadrotor is modeled on the Special Euclidean group SE(3) and the approach is evaluated through numerical simulations using a geometric controller to maintain quadrotor flight paths, under different wind conditions. The results indicate that the UKF consistently outperforms the EKF, reducing wind velocity estimation error by 11.8% under near-linear flight conditions, and 25.5% under strongly nonlinear conditions. This demonstrates the potential of the UKF as a reliable estimator for highly nonlinear scenarios, capable of maintaining the trajectory with minimal deviation while providing accurate wind velocity estimations.
|
| |
| WeA05 Tutorial Session, Grand Salon 6 |
Add to My Program |
Control Barrier Functions for Flight-Critical Systems in Aerospace
Applications |
|
| |
| Chair: Hussain, Heather | The Boeing Company |
| Co-Chair: Cohen, Max | North Carolina State University |
| Organizer: Cohen, Max | North Carolina State University |
| Organizer: Hussain, Heather | The Boeing Company |
| |
| 10:30-12:00, Paper WeA05.1 | Add to My Program |
| Control Barrier Functions for Flight-Critical Systems in Aerospace Applications (I) |
|
| Cohen, Max | North Carolina State University |
| Hussain, Heather | The Boeing Company |
| Molnar, Tamas G. | Wichita State University |
| van Wijk, David E. J. | California Institute of Technology |
| Gaudio, Joseph | Boeing |
| Fisher, Peter | Massachusetts Institute of Technology |
| Menner, Marcel | Aurora Flight Sciences (A Boeing Company) |
| Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
| Autenrieb, Johannes | German Aerospace Center (DLR) |
| Lavretsky, Eugene | The Boeing Co |
| Wise, Kevin A. | Boeing Defense and Space Systems |
| Ratliff, Ryan T. | The Boeing Company |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Aerospace
Abstract: We present a tutorial focused on the applications of control barrier functions (CBFs) to flight-critical aerospace systems. This tutorial is motivated by the need for mathematically justified approaches to state and input limiting on various aerospace systems, much of which is currently accomplished using heuristic techniques. Here, we illustrate how CBFs -- originally motivated by applications in robotics -- provide a natural framework for constrained control of aerospace systems. Our tutorial encompasses both the theoretical foundations of CBFs as well as real-world aerospace applications of CBFs.
|
| |
| WeA06 Regular Session, Grand Salon 7 |
Add to My Program |
| Game Theory I |
|
| |
| Chair: Vasconcelos, Marcos M. | Florida State University |
| Co-Chair: Shishika, Daigo | George Mason University |
| |
| 10:30-10:45, Paper WeA06.1 | Add to My Program |
| Linear Programming Approach to Deceptive Path Planning Game with Goal Selection |
|
| Rostobaya, Violetta | George Mason University |
| Guan, Yue | Georgia Institute of Technology |
| Berneburg, James | George Mason University |
| Shishika, Daigo | George Mason University |
Keywords: Game theory, Agents-based systems, Information theory and control
Abstract: In adversarial settings, a mobile agent may strategically plan its motion to influence an opponent’s inference about its intended goal. We study deceptive path planning in a scenario where a mobile agent aims to reach a privately selected goal while an adversarial observer allocates limited defensive resources based on the observed trajectory. Unlike classical path-planning and goal-recognition approaches that model observers as passive inference process, our game-theoretic formulation models them as strategic decision-makers. For the resulting dynamic asymmetric-information game, we develop an efficient solution method that combines a linear programming formulation with the Double Oracle algorithm. To evaluate performance, we introduce metrics that quantify both the risk and the effectiveness of deception and provide illustrative numerical examples.
|
| |
| 10:45-11:00, Paper WeA06.2 | Add to My Program |
| Game-To-Real Gap: Quantifying the Effect of Model Misspecification in Network Games |
|
| Ferguson, Bryce L. | Dartmouth College |
| Maheshwari, Chinmay | Johns Hopkins University |
| Wu, Manxi | University of California Berkeley |
| Sastry, Shankar | Univ. of California at Berkeley |
Keywords: Game theory, Agents-based systems, Network analysis and control
Abstract: Game-theoretic models and solution concepts provide rigorous tools for predicting collective behavior in multi-agent systems. In practice, however, different agents may rely on different game-theoretic models to design their strategies. As a result, when these heterogeneous models interact, the realized outcome can deviate substantially from the outcome each agent expects based on its own local model. In this work, we introduce the game-to-real gap, a new metric that quantifies the impact of such model misspecification in multi-agent environments. The game-to-real gap is defined as the difference between the utility an agent actually obtains in the multi-agent environment (where other agents may have misspecified models) and the utility it expects under its own game model. Focusing on quadratic network games, we show that misspecifications in either (i) the external shock or (ii) the player interaction network can lead to arbitrarily large game-to-real gaps. We further develop novel network centrality measures that allow exact evaluation of this gap in quadratic network games. Our analysis reveals that standard network centrality measures fail to capture the effects of model misspecification, underscoring the need for new structural metrics that account for this limitation. Finally, through illustrative numerical experiments, we show that existing centrality measures in network games may provide a counterintuitive understanding of the impact of model misspecification.
|
| |
| 11:00-11:15, Paper WeA06.3 | Add to My Program |
| Privacy-Preserving Nash Equilibrium Synthesis with Partially Ordered Temporal Objectives |
|
| Probine, Caleb | The University of Texas at Austin |
| Kulkarni, Abhishek | Vijil, Inc |
| Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Autonomous systems, Automata
Abstract: Nash equilibrium is a central solution concept for reasoning about self-interested agents. We study the synthesis of Nash equilibria in two-player deterministic games on graphs, where players have private, partially-ordered preferences on temporal goals. Unlike prior work, which assumes preferences are common knowledge, we develop a communication protocol for equilibrium synthesis in settings where players' preferences are private information. In the protocol, players communicate to synthesize equilibria by exchanging information about when they can force desirable outcomes. We incorporate privacy by ensuring the protocol stops before enough information is revealed to expose players' preferences. We prove completeness by showing that, when no player halts communication, the protocol either returns an equilibrium or certifies that none exist. We then prove privacy by showing that, with stopping, the messages a player sends are always consistent with multiple preference relations and thus do not reveal some given secret regarding a player's true preference ordering. Experiments show that we can synthesize non-trivial equilibria while preserving privacy of preferences, showing the protocol’s potential for applications in strategy synthesis with constrained information sharing.
|
| |
| 11:15-11:30, Paper WeA06.4 | Add to My Program |
| Designing Inferable Signaling Schemes for Bayesian Persuasion |
|
| Probine, Caleb | The University of Texas at Austin |
| Karabag, Mustafa O. | The University of Texas at Austin |
| Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Learning, Information theory and control
Abstract: In Bayesian persuasion, an informed sender, who observes a state, commits to a randomized signaling scheme that guides a self-interested receiver's actions. Classical models assume the receiver knows the commitment. Instead, we study a setting where the receiver infers the scheme from repeated interactions. We bound the sender’s performance loss relative to the known-commitment case by a term that grows with the signal space size and shrinks as the receiver’s optimal actions become more distinct. We then lower bound the samples required for the sender to approximately achieve their known-commitment utility in the inference setting. We show that the sender in persuasion requires more samples than the leader in a Stackelberg game, which includes commitment but lacks signaling. Motivated by these bounds, we propose two methods for designing inferable signaling schemes, one being stochastic gradient descent (SGD) on the sender’s inference-setting utility, and the other being optimization with a boundedly-rational receiver model. SGD performs best in low-interaction regimes, but applying the boundedly-rational model and tuning the rationality constant remains a flexible method for designing inferable schemes. Applied in a safety-alert example, SGD finds schemes that have fewer signals and make citizens’ optimal actions more distinct than in the known-commitment case.
|
| |
| 11:30-11:45, Paper WeA06.5 | Add to My Program |
| Deceptive Planning Exploiting Inattention Blindness |
|
| Karabag, Mustafa O. | The University of Texas at Austin |
| Milzman, Jesse | DEVCOM Army Research Laboratory |
| Topcu, Ufuk | The University of Texas at Austin |
Keywords: Markov processes, Game theory, Information theory and control
Abstract: We study decision-making with rational inattention in settings where agents have perception constraints. In such settings, inaccurate prior beliefs or models of others may lead to inattention blindness, where an agent is unaware of its incorrect beliefs. We model this phenomenon in two-player zero-sum stochastic games, where Player 1 has perception constraints and Player 2 deceptively deviates from its security policy presumed by Player 1 to gain an advantage. We formulate the perception constraints as an online sensor selection problem, develop a value-weighted objective function for sensor selection capturing rational inattention, and propose the greedy algorithm for selection under this monotone objective function. When Player 2 does not deviate from the presumed policy, this objective function provides an upper bound on the expected value loss compared to the security value where Player 1 has perfect information of the state. We then propose a myopic decision-making algorithm for Player 2 to exploit Player 1's beliefs by deviating from the presumed policy and, thereby, improve upon the security value. Numerical examples illustrate how Player 1 persistently chooses sensors that are consistent with its priors, allowing Player 2 to systematically exploit its inattention.
|
| |
| 11:45-12:00, Paper WeA06.6 | Add to My Program |
| Log-Linear Learning for Coordination with Heterogeneous Bounded Rationalities |
|
| Rajab, Fathy Omar | Florida State University |
| Anubi, Olugbenga Moses | Florida State University |
| Vasconcelos, Marcos M. | Florida State University |
Keywords: Game theory, Networked control systems, Iterative learning control
Abstract: This work investigates a two-player coordination game in which the players exhibit heterogeneous levels of bounded rationality. We analyze the log-linear learning dynamics, where the probability distribution used to select which of the agents gets to revise its strategy is fixed but not necessarily uniform. The stationary distribution of the resulting Markov chain on the strategy profile space is derived in closed-form as a function of the rationalities and the agent selection probabilities. We proceed by showing that adjusting the selection probabilities can be used to bias the stationary distribution toward the potential-maximizing state. However, this optimization comes at the cost of a reduced convergence rate, whereas the uniform selection probabilities uniquely maximizes the convergence speed irrespective of the players’ rationality levels. To address this trade-off, a Pareto-optimal probability selection rule is proposed, balancing the distributional bias with convergence rate. Moreover, it is shown that in coordination games, high levels of rationality sometimes accelerate convergence, whereas in other cases they may paradoxically hinder the convergence rate of the log-linear learning dynamics.
|
| |
| WeA07 Invited Session, Grand Salon 9 |
Add to My Program |
| Autonomous Risk-Aware Perception, Planning, and Control |
|
| |
| Chair: Motee, Nader | Lehigh University |
| Co-Chair: Liu, Guangyi | Amazon Robotics |
| Organizer: Liu, Guangyi | Amazon Robotics |
| Organizer: Li, Na | Harvard University |
| Organizer: Topcu, Ufuk | The University of Texas at Austin |
| Organizer: Zavlanos, Michael M. | Duke University |
| Organizer: Motee, Nader | Lehigh University |
| |
| 10:30-10:45, Paper WeA07.1 | Add to My Program |
| Risk-Aware Safety Filters with Poisson Safety Functions and Laplace Guidance Fields (I) |
|
| Bahati, Gilbert | California Institute of Technology |
| Bena, Ryan | California Institute of Technology |
| Wilkinson, Meg | California Institute of Technology |
| Mestres, Pol | California Institute of Technology |
| Cosner, Ryan | California Institute of Technology |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Vision-based control, Autonomous systems, Constrained control
Abstract: Robotic systems navigating in real-world settings require a semantic understanding of their environment to properly determine safe actions. This work aims to develop the mathematical underpinnings of such a representation---specifically, the goal is to develop safety filters that are risk-aware. To this end, we take a two step approach: encoding an understanding of the environment via Poisson’s equation, and associated risk via Laplace guidance fields. That is, we first solve a Dirichlet problem for Poisson’s equation to generate a safety function that encodes system safety as its 0-superlevel set. We then separately solve a Dirichlet problem for Laplace’s equation to synthesize a safe guidance field that encodes variable levels of caution around obstacles---by enforcing a tunable flux boundary condition. The safety function and guidance fields are then combined to define a safety constraint and used to synthesize a risk-aware safety filter which, given a semantic understanding of an environment with associated risk levels of environmental features, guarantees safety while prioritizing avoidance of higher risk obstacles. We demonstrate this method in simulation and discuss how a priori understandings of obstacle risk can be directly incorporated into the safety filter to generate safe behaviors that are risk-aware.
|
| |
| 10:45-11:00, Paper WeA07.2 | Add to My Program |
| A Formal Gatekeeper Framework for Safe Dual Control with Active Exploration (I) |
|
| Naveed, Kaleb Ben | University of Michigan, |
| Agrawal, Devansh Ramgopal | University of Michigan |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Robust adaptive control, Autonomous robots
Abstract: Planning safe trajectories under model uncertainty is a fundamental challenge. Robust planning ensures safety by considering worst-case realizations, yet ignores uncertainty reduction and leads to overly conservative behavior. Actively reducing uncertainty on-the-fly during a nominal mission defines the dual control problem. Most approaches address this by adding a weighted exploration term to the cost, tuned to trade off the nominal objective and uncertainty reduction, but without formal consideration of when exploration is beneficial. Moreover, safety is enforced in some methods but not in others. We propose a framework that integrates robust planning with active exploration under formal guarantees as follows: The key innovation and contribution is that exploration is pursued only when it provides a verifiable improvement without compromising safety. To achieve this, we utilize our earlier work on gatekeeper as an architecture for safety verification, and extend it so that it generates both safe and informative trajectories that reduce uncertainty and the cost of the mission, or keep it within a user-defined budget. The methodology is evaluated via simulation case studies on the online dual control of a quadrotor under parametric uncertainty.
|
| |
| 11:00-11:15, Paper WeA07.3 | Add to My Program |
| Bayesian Risk-Aware CBFs for Discrete-Time Stochastic Systems with Learned Dynamics (I) |
|
| Hoxha, Bardh | Toyota NA R&D |
| Black, Mitchell | Toyota NA R&D |
| Majd, Keyvan | Toyota NA R&D |
| Okamoto, Hideki | Toyota NA R&D |
| Fainekos, Georgios | Toyota NA R&D |
| Prokhorov, Danil | Toyota NA R&D |
Keywords: Stochastic systems, Statistical learning, Robust control
Abstract: We study safety for stochastic systems under sampled–data control and learned dynamics. We develop Bayesian Risk‑Aware Control Barrier Functions (RA‑CBFs) for discrete time. First, we give two guarantees for barrier crossing over a finite horizon: (i) a time‑uniform martingale bound using Ville’s inequality and a predictable variance budget, and (ii) a tighter pathwise bound that recovers the continuous‑time RA‑CBF margin via DDS time change and the reflection principle. Second, we propagate posterior uncertainty in drift and diffusion into an upper confidence bound (UCB) on the generator and derive a closed-form inter-sample robustness margin. Third, we synthesize a minimally invasive controller via a convex QP that enforces risk and uncertainty thresholds jointly. The result is a high-confidence, finite-horizon safety bound and a practical sampled-data controller that, under the stronger pathwise noise assumption, recovers the continuous-time RA-CBF margin while avoiding supermartingale-based S-CBF conditions.
|
| |
| 11:15-11:30, Paper WeA07.4 | Add to My Program |
| Delay-Independent Safe Control with Neural Networks: Positive Lur’e Certificates for Risk-Aware Autonomy (I) |
|
| Montazeri Hedesh, Hamidreza | Northeastern University |
| Siami, Milad | Northeastern University |
Keywords: Neural networks, Delay systems, Uncertain systems
Abstract: We present a risk-aware safety certification method for autonomous, learning-enabled control systems. Focusing on two realistic risks, state/input delays and interval matrix uncertainty, we model the neural network (NN) controller with local sector bounds and exploit positivity structure to derive linear, delay-independent certificates that guarantee local exponential stability across admissible uncertainties. To benchmark performance, we adopt and implement a state-of-the-art IQC NN-verification pipeline. In representative cases, our positivity-based tests run orders of magnitude faster than SDP-based IQC while certifying regimes the latter cannot, providing scalable safety guarantees that complement risk-aware control.
|
| |
| 11:30-11:45, Paper WeA07.5 | Add to My Program |
| Optimism As Risk-Seeking in Multi-Agent Reinforcement Learning (I) |
|
| Zhang, Runyu | Massachusetts Institute of Technology |
| Li, Na | Harvard University |
| Ozdaglar, Asu | MIT |
| Shamma, Jeff S. | University of Illinois at Urbana-Champaign |
| Zardini, Gioele | Massachusetts Institute of Technology |
Keywords: Reinforcement learning, Decentralized control, Cooperative control
Abstract: Risk sensitivity has become a central theme in reinforcement learning (RL), where convex risk measures and robust formulations provide principled ways to model preferences beyond expected return. Recent extensions to multi-agent RL (MARL) have largely emphasized the risk-averse setting, prioritizing robustness to uncertainty. In cooperative MARL, however, such conservatism often leads to suboptimal equilibria, and a parallel line of work has shown that optimism can promote cooperation. Existing optimistic methods, though effective in practice, are typically heuristic and lack theoretical grounding. Building on the dual representation for convex risk measures, we propose a principled framework that interprets risk-seeking objectives as optimism.We introduce optimistic value functions, which formalize optimism as divergence-penalized risk-seeking evaluations. Building on this foundation, we derive a policy-gradient theorem for optimistic value functions, including explicit formulas for the entropic risk/KLpenalty setting, and develop decentralized optimistic actorcritic algorithms that implement these updates. Empirical results on cooperative benchmarks demonstrate that riskseeking optimism consistently improves coordination over both risk-neutral baselines and heuristic optimistic methods. Our framework thus unifies risk-sensitive learning and optimism, offering a theoretically grounded and practically effective approach to cooperation in MARL.
|
| |
| 11:45-12:00, Paper WeA07.6 | Add to My Program |
| Risk-Budgeted Control Framework for Balanced Performance and Safety in Autonomous Vehicles |
|
| Chang, Pei Yu | The Ohio State University |
| Renganathan, Vishnu | The Ohio State University |
| Ahmed, Qadeer | The Ohio State University |
Keywords: Autonomous systems, Automotive systems, Automotive control
Abstract: This paper presents a hybrid control framework with a risk-budgeted monitor for safety-certified autonomous driving. A sliding-window monitor tracks insufficient barrier residuals and triggers switching from a relaxed control barrier function (R-CBF) to a more conservative conditional value-at-risk CBF (CVaR-CBF) when the safety margin deteriorates. Two real-time triggers are considered: feasibility-triggered (FT), which activates CVaR-CBF when the R-CBF problem is reported infeasible, and quality-triggered (QT), which switches when the residual falls below a prescribed safety margin. The framework is evaluated with model predictive control (MPC) under vehicle localization noise and obstacle position uncertainty across multiple AV-pedestrian interaction scenarios with 1,500 Monte Carlo runs. In the most challenging case with 5 m pedestrian detection uncertainty, the proposed method achieves a 94--96% collision-free success rate over 300 trials while maintaining the lowest mean cross-track error (CTE = 3.2--3.6 m), indicating faster trajectory recovery after obstacle avoidance and a favorable balance between safety and performance.
|
| |
| WeA08 Regular Session, Grand Salon 10-13 |
Add to My Program |
| Distributed Control |
|
| |
| Chair: Li, Jing Shuang (Lisa) | University of Michigan |
| Co-Chair: Matni, Nikolai | University of Pennsylvania |
| |
| 10:30-10:45, Paper WeA08.1 | Add to My Program |
| Distributed Continuous-Time Control Via System Level Synthesis |
|
| Du, Yaozhi | University of Michigan |
| Li, Jing Shuang (Lisa) | University of Michigan |
Keywords: Distributed control, Linear systems, Optimal control
Abstract: This paper designs H2 and H-Infinitydistributed controllers with local communication and local disturbance rejection. We propose a two-step procedure: first, select closed-loop poles; then, optimize over parameterized controllers. We build on the system level synthesis (SLS) parameterization --- primarily used in the discrete-time setting --- and extend it to the general continuous-time setting. We verify our approach in simulation on a 9-node grid governed by linearized swing equations, where our distributed controllers achieve performance comparable to that of optimal centralized controllers while facilitating local disturbance rejection.
|
| |
| 10:45-11:00, Paper WeA08.2 | Add to My Program |
| Scalable Distributed Nonlinear Control under Flatness-Preserving Coupling |
|
| Yang, Fengjun | University of Pennsylvania |
| Welde, Jake | Cornell University |
| Matni, Nikolai | University of Pennsylvania |
Keywords: Distributed control, Feedback linearization, Large-scale systems
Abstract: We study distributed control for a network of nonlinear, differentially flat subsystems subject to dynamic coupling. Although differential flatness simplifies planning and control for isolated subsystems, the presence of coupling can destroy this property for the overall joint system. Focusing on subsystems in pure-feedback form, we identify a class of compatible lower-triangular dynamic couplings that preserve flatness and guarantee that the flat outputs of the subsystems remain the flat outputs of the coupled system. Further, we show that the joint flatness diffeomorphism can be constructed from those of the individual subsystems and, crucially, its sparsity structure reflects that of the coupling. Exploiting this structure, we synthesize a distributed tracking controller that computes control actions from local information only, thereby ensuring scalability. We validate our proposed framework on a simulated example of planar quadrotors dynamically coupled via aerodynamic downwash, and show that the distributed controller achieves accurate trajectory tracking.
|
| |
| 11:00-11:15, Paper WeA08.3 | Add to My Program |
| Explicit Distributed MPC: Reducing Computation and Communication Load by Exploiting Facet Properties |
|
| Brahmbhatt, Parth | University of Wisconsin Madison |
| Ganesh, Hari | Indian Institute of Technology Gandhinagar, |
| Avraamidou, Styliani | University of Wisconsin Madison |
Keywords: Distributed control, Cooperative control, Decentralized control
Abstract: Classical Distributed Model Predictive Control (DiMPC) requires multiple iterations to achieve convergence, leading to high computational and communication burdens. This work focuses on the improvement of an iteration-free distributed MPC methodology that minimizes computational effort and communication load. The aforementioned methodology leverages multiparametric programming to compute explicit control laws offline for each subsystem, enabling real-time control without iterative data exchanges between subsystems. Extending our previous work on iteration-free DiMPC, here we introduce a FAcet-based Critical region Exploration Technique for iteration-free DiMPC (FACET-DiMPC) that further reduces computational complexity by leveraging facet properties to do targeted critical region exploration. Simulation results demonstrate that the developed method achieves comparable control performance to centralized methods, while significantly reducing communication overhead and computation time. In particular, the proposed methodology offers substantial efficiency gains in terms of the average computation time reduction of 98% compared to classic iterative DiMPC methods and 42% compared to iteration-free DiMPC methods, making it well-suited for real-time control applications with tight latency and computation constraints.
|
| |
| 11:15-11:30, Paper WeA08.4 | Add to My Program |
| A Graph-Based Classification Approach for the Adaptive Decomposition of Model Predictive Control Problems |
|
| Cooper, Elliot | University of Michigan |
| Allman, Andrew | University of Michigan |
Keywords: Distributed control, Machine learning, Adaptive control
Abstract: Distributed model predictive control (DMPC) has emerged as a computationally efficient alternative to centralized model predictive control (CMPC) by decomposing the centralized optimal control problem into a collection of constituent subsystem controllers. The subsystem controllers can be solved in parallel and coordinated to reach a consensus solution, significantly reducing computational cost at the potential expense of control quality relative to CMPC. In this paper, we demonstrate that state measurements and operational objectives, which serve as parameters in the optimal control problem of each distributed subsystem controller, are strongly correlated with the quality of control achieved by a particular DMPC architecture. We propose a novel graph-based classifier that selects the most suitable DMPC architecture based on these time-varying parameters and integrate this selection process into the online control loop. This adaptive approach dynamically responds to process variations, enhancing control performance and mitigating the potential loss in control quality typically associated with DMPC.
|
| |
| 11:30-11:45, Paper WeA08.5 | Add to My Program |
| Discrete-Time Robust Cooperative Output Regulation by Dynamic Measurement Output Feedback |
|
| Gul, Kursad Metehan | Utah State University |
| Sarsilmaz, Selahattin Burak | Utah State University |
Keywords: Distributed control, Output regulation, LMIs
Abstract: Recent work [1] by the authors has considered robust cooperative output regulation of discrete-time uncertain heterogeneous (in dimension) multi-agent systems (MASs) using the distributed internal model approach, assuming that each follower has full access to its own state. This paper extends our discrete-time robust cooperative output regulation results to the case wherein followers have partial access to their states. Specifically, a detectability condition is added, and each follower employs a local state observer. We first show that the solvability of the robust cooperative output regulation problem (RCORP) with the internal model-based distributed dynamic measurement output feedback control law of interest reduces to the solvability of stabilization problems, including a structured stabilization problem. Then, we leverage our recent results, which cover the existence and design of the corresponding structured control gain, and extend both global and agent-wise local conditions to solve the RCORP via distributed dynamic measurement output feedback. Finally, we provide a numerical example of a MAS comprising three quadrotors and one uninhabited ground vehicle to illustrate the resulting agent-wise local design method.
|
| |
| 11:45-12:00, Paper WeA08.6 | Add to My Program |
| Geometry-Aware Decentralized Sinkhorn for Wasserstein Barycenters |
|
| Baheri, Ali | Rochester Institute of Technology |
| Millard, David | Rochester Institute of Technology |
| Vahid, Alireza | Rochester Institute of Technology |
Keywords: Decentralized control, Distributed control, Communication networks
Abstract: Distributed systems require fusing heterogeneous local probability distributions into a global summary over sparse and unreliable communication networks. Traditional consensus algorithms, which average distributions in Euclidean space, ignore their inherent geometric structure, leading to misleading results. Wasserstein barycenters offer a geometry-aware alternative by minimizing optimal transport costs, but their entropic approximations via the Sinkhorn algorithm typically require centralized coordination. This paper proposes a fully decentralized Sinkhorn algorithm that reformulates the centralized geometric mean as an arithmetic average in the log-domain, enabling approximation through local gossip protocols. Agents exchange log-messages with neighbors, interleaving consensus phases with local updates to mimic centralized iterations without a coordinator. To optimize bandwidth, we integrate event-triggered transmissions and b-bit quantization, providing tunable trade-offs between accuracy and communication while accommodating asynchrony and packet loss. Under mild assumptions, we prove convergence to a neighborhood of the centralized entropic barycenter, with bias linearly dependent on consensus tolerance, trigger threshold, and quantization error. Complexity scales near-linearly with network size. Simulations confirm near-centralized accuracy with significantly fewer messages, across various topologies and conditions.
|
| |
| WeA09 Invited Session, Grand Salon 12 |
Add to My Program |
| Battery Systems, Estimation & Energy Management |
|
| |
| Chair: Chen, Jun | Oakland University |
| Co-Chair: Gupta, Shobhit | General Motors |
| Organizer: Chen, Jun | Oakland University |
| Organizer: Gupta, Shobhit | General Motors |
| Organizer: Chang, Insu | General Motors LLC |
| Organizer: Wang, Zejiang | The University of Texas at Dallas |
| |
| 10:30-10:45, Paper WeA09.1 | Add to My Program |
| Dense Extended Kalman Filter for Simultaneous Cell State-Parameter Estimation (I) |
|
| Nuculaj, Luke | Oakland University |
| Chen, Jun | Oakland University |
Keywords: Kalman filtering, Estimation, Energy systems
Abstract: This work investigates the use of dense extended Kalman filter (DEKF) to simultaneously estimate both the state-of-charge (SOC) and equivalent-circuit parameters of serial-connected, lithium-ion battery cells under limited voltage measurement. Stability analyses are conducted to certify the reliability of the DEKF for simultaneous state-parameter estimation under certain mild assumptions. To begin, the local asymptotic stability of the expected dense error in the presence of nonlinearity is verified by selection of an appropriate Lyapunov function. Then the expected sparse error is shown to be marginally stable in the absence of nonlinearities by proving that the spectral radius of the sparse expected error dynamics is exactly equal to 1. Finally, a sufficient condition for sparse stability in the presence of nonlinearity is obtained via selection of an appropriate Lyapunov function. Simulation trials under several different scenarios are executed, confirming that the associated error dynamics are indeed stable for the duration of the simulation time.
|
| |
| 10:45-11:00, Paper WeA09.2 | Add to My Program |
| Battery Remaining-Useful-Life Estimation with Probabilistic Transformers (I) |
|
| Chen, Evan | Purdue University |
| Chellapandi, Vishnu Pandi | Cummins Inc |
| Borhan, Hoseinali | Cummins Inc |
| Brinton, Christopher | Purdue University |
Keywords: Neural networks, Optimization, Adaptive control
Abstract: Accurate remaining useful life (RUL) prediction is essential for battery safety and reliability, yet point forecasts can be misleading under nonstationarity and domain shift. We propose QE–RULMamba, a probabilistic, transformer-based state-space forecaster that ensembles multiple experts under a fixed parameter budget and enforces quantile diversity to capture prognostic uncertainty. From capacity forecasts, we compute actionable RUL via a lower-confidence-bound (LCB) search that respects simple temporal and throughput constraints and exposes a tunable risk knob for operations. Evaluated on the NASA and TJU datasets, QE–RULMamba achieves consistently lower MAE/RMSE and higher R^2 than competitive sequence models and a non-quantile ensemble, while yielding calibrated intervals, tight in-distribution and appropriately wider under shift. The resulting confidence-aware RUL curves track oracle behavior in stable regimes and become deliberately conservative when degradation dynamics are uncertain, enabling practical triggers (e.g., early inspection when bounds approach the end-of-life threshold) and underscoring the value of uncertainty-aware forecasting for real-world battery management.
|
| |
| 11:00-11:15, Paper WeA09.3 | Add to My Program |
| A Multi-Objective Optimization Based Energy Management System (EMS) for Improving Battery Health in Battery-Supercapacitor Electric Vehicles (I) |
|
| Upadhyaya, Ashruti | Indian Institute of Technology, Guwahati |
| Mahanta, Chitralekha | Indian Institute of Technology, Guwahati |
Keywords: Control applications, Automotive control, Hybrid systems
Abstract: This paper proposes a novel multi-objective optimization based energy management system (EMS) for battery/ Supercapacitor (SC) Electric Vehicles (EVs). Two conflicting objectives, viz. minimization of battery state of charge (SOC) degradation and reduction of SC power loss are considered for allocating power optimally between battery and SC. The choice of objective functions is based on the fact that battery is the main energy source of the EV and SC is used for supporting the battery during the journey. The primary objective is to minimize battery degradation and reduce overall system losses by minimizing SC power loss. This will prevent overburdening of the auxiliary energy source i.e. SC and improve system performance. The proposed optimal control problem is solved by applying Dynamic Programming (DP) method. A Pareto front is generated displaying the trade-offs between the two objective functions which can play an important role in benchmarking other real-time control techniques. A throughput based dynamic model is also utilized for studying the battery State of Health (SOH) degradation in varying conditions. Simulation study and numerical investigations are conducted by altering the priority of the two objective functions and performance of the proposed technique is evaluated. The proposed Energy Management System (EMS) is compared with the EMS using Filter based Strategy (FBS) and battery only system.
|
| |
| 11:15-11:30, Paper WeA09.4 | Add to My Program |
| High-Fidelity Battery Capacity Estimation Method Using Recurrent Neural Network and Data Mapping Strategy (I) |
|
| Chang, Insu | General Motors LLC |
| Shieh, Su-Yang | General Motors LLC |
| Gupta, Shobhit | General Motors LLC |
| Zanardelli, Wesley | General Motors LLC |
Keywords: Energy systems, Estimation, Neural networks
Abstract: This paper presents a data-driven framework for accurate battery capacity estimation, addressing challenges of large storage requirements and time-scale discrepancies. The proposed method converts high-frequency time-series data into compact snapshots using a histogram-based data mapping strategy and optimal bin-sizing, reducing data storage significantly. A recurrent neural network (RNN) is employed to model capacity evolution, and the network architecture is optimized using a random search method. Evaluation on an open source NASA dataset achieved less than 1 % validation error, and the benefit scales across other battery chemistries. These results confirm the scalability and adaptability of the method for diverse applications.
|
| |
| 11:30-11:45, Paper WeA09.5 | Add to My Program |
| Accelerating Battery Charging Via Bayesian Optimization with Risk-Seeking Sampling |
|
| Zhang, Yi | Georgia Institute of Technology |
| Wang, Xizhe | Tsinghua University |
| Jiang, Benben | Tsinghua University |
Keywords: Energy systems, Optimization algorithms, Machine learning
Abstract: The global transition towards electrified transportation has increased the demand for fast and reliable charging technologies, which are essential for the widespread adoption of electric vehicles. However, enabling rapid charging without accelerating battery degradation remains a major technical challenge. Conventional methods for optimizing charging protocols, such as model-based approaches and grid search, are constrained by inaccurate battery models and the high cost of extensive experimental characterization. Bayesian optimization offers an efficient alternative, as it can learn and optimize unknown objectives with limited sampling. Nevertheless, standard acquisition functions are often designed for general-purpose tasks and may not align with the performance oriented nature of battery fast charging. This research gap motivates the development of the acquisition function better suited to high performance charging applications. We therefore propose a novel acquisition function that prioritizes best case outcomes by identifying and exploring high risk, high reward regions in the objective landscape. Experimental results on the mathematical benchmark function and the PET based battery simulator validate the effectiveness of our proposed approach.
|
| |
| 11:45-12:00, Paper WeA09.6 | Add to My Program |
| A Nonlinear Geometric Approach for Energy Management of a Cascaded Multilevel Inverter Integrating the Battery |
|
| Buenfil Solis, Guillermo | CentraleSupélec |
| Desreveaux, Anatole | CNAM |
| Iovine, Alessio | CNRS |
| Pasillas-Lepine, William | CNRS |
Keywords: Control applications, Power electronics, Automotive systems
Abstract: Multilevel inverters have recently emerged as a promising alternative for traction and stationary applications, offering advantages such as reduced losses and lower current distortion compared to conventional two-level inverters. To fully harness these benefits, suitable control strategies are required to drive the system towards desired operating states, along with a battery management system to preserve the health of the battery pack. In this work, we consider a cascaded multilevel inverter that directly integrates batteries as the energy source and propose a control technique based on a geometric approach. The method simultaneously achieves accurate machine control and balances the state of charge across modules, thereby mitigating non-uniform aging of the battery pack. The effectiveness of the proposed approach is validated through simulation studies.
|
| |
| WeA10 Regular Session, Grand Salon 15 |
Add to My Program |
| Linear Systems I |
|
| |
| Chair: Zare, Armin | University of Texas at Dallas |
| Co-Chair: Zaccarian, Luca | LAAS-CNRS |
| |
| 10:30-10:45, Paper WeA10.1 | Add to My Program |
| H2-Optimal Parameter Tuning Via Structured Feedback-Gain Basis Decomposition |
|
| Hartman, Daniel | University of Texas at Dallas |
| Zare, Armin | University of Texas at Dallas |
Keywords: Linear parameter-varying systems, Optimal control, Optimization algorithms
Abstract: Parameter tuning in structured control design faces inherent challenges due to nonconvex feasible sets, sensitivity to initialization, and the need for stabilizing starting points. We revisit this problem by recasting parameter tuning as a structured state-feedback synthesis that expands the gain over sparsity-preserving bases. We formulate a quadratic objective in the state and augmentation terms and develop two augmented-Lagrangian algorithms that efficiently compute optimal parameters in the presence of application-driven bound constraints. This framework accommodates additional convex constraints when needed, delivers high-quality warm starts for subspace gradient methods, and applies broadly whenever the dynamics depend affinely on the unknown parameters. We provide examples that illustrate the effectiveness of our approach and the proposed algorithms in solving parameter tuning and structured control design problems.
|
| |
| 10:45-11:00, Paper WeA10.2 | Add to My Program |
| Computation of Minimal Kernel Representation of a Discrete LTI Behavior |
|
| Dilip, Sanand | IIT Kharagpur |
| Khare, Swanand | Indian Institute of Technology Kharagpur |
Keywords: Linear systems, Computational methods
Abstract: We propose a novel linear algebraic approach to construct a minimal kernel representation of a discrete LTI behavior from an arbitrary kernel representation. We also present an algorithm to generate data for a restricted behavior from its kernel representation. Using this data, we construct certain Hankel matrices and obtain a minimal kernel representation from left kernels of these Hankel matrices via a modified algorithm of [Markovsky and Dorfler, IEEE Transactions on Automatic Control, vol. 68, no. 3, pp. 1667–1677, 2022]. As a consequence, we give a constructive algorithm to compute a minimal basis of a finitely generated module over a single variable polynomial ring.
|
| |
| 11:00-11:15, Paper WeA10.3 | Add to My Program |
| Identifying Network Structure of Linear Dynamical Systems: Observability and Edge Misclassification |
|
| Gill, Jaidev | University of Michigan, Ann Arbor |
| Li, Jing Shuang (Lisa) | University of Michigan |
Keywords: Linear systems, Networked control systems, Identification
Abstract: This work studies the limitations of uniquely identifying the structure (i.e., topology) of a networked linear system from partial measurements of its nodal dynamics. In general, many networks can be consistent with these measurements; this is a consideration often neglected by standard network inference methods. We show that the space of these networks are related through the nullspace of the observability matrix for the true network. We establish relevant metrics to investigate this space, including an analytic characterization of the most structurally dissimilar network that can be inferred, as well as the possibility of mis-inferring presence or absence of edges. In simulations, we find that when observing over 6% of nodes in random network models (e.g., Erdos-Renyi and Watts-Strogatz), approximately 99% of edges are correctly classified. Extending this discussion, we construct a family of networks that keep measurements epsilon-close to each other, and connect the identifiability of these networks to the spectral properties of an augmented observability Gramian.
|
| |
| 11:15-11:30, Paper WeA10.4 | Add to My Program |
| A Periodic Output Feedback Stabilization Scheme with Non-Simultaneous Sensing and Actuation |
|
| Manca, Giorgio | University of Rome, Tor Vergata |
| Sassano, Mario | University of Rome, Tor Vergata |
| Zaccarian, Luca | LAAS-CNRS |
Keywords: Hybrid systems, Linear systems, Stability of hybrid systems
Abstract: In this work, an output feedback control scheme is proposed for the stabilization of a continuous-time LTI system characterized by non-simultaneous sensing and actuation. Using a Gramian-based framework, a periodically repeating time window is divided into a sensing phase, where the state is reconstructed by filtering the output, and an actuation phase, where the state is driven to a target point balancing the control effort and the stabilization goal. The overall control architecture is cast as a well-posed hybrid dynamical system, providing a convenient tool for proving robust global exponential stability.
|
| |
| 11:30-11:45, Paper WeA10.5 | Add to My Program |
| Robust Controllable Set Computation Using Constrained Convex Generators |
|
| Silvestre, Daniel | NOVA University of Lisbon |
| P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Keywords: Linear systems, Predictive control for linear systems, Autonomous systems
Abstract: Robust Controllable (RC) sets enable safe control of dynamical systems under constraints and uncertainty. Existing approaches typically rely on polytopic representations for the computation of these sets, which suffer from conservativeness and scalability issues. Recently, Constrained Convex Generators (CCGs) were proposed to allow set-based control and analysis in presence of both ellipsoidal and polytopic state-input constraints. However, the computation of RC sets using CCGs is currently hindered because the Pontryagin difference set operation has not been developed for CCGs. In this paper, we provide theory and algorithms to address this challenge, and enable safe control under uncertainty using RC sets and CCGs. Specifically, we propose an inner approximation for the RC set using a CCG description. We show in simulations that the proposed approach improves accuracy and memory usage when compared to computations with polytopic approximations.
|
| |
| WeA11 Regular Session, Grand Salon 16 |
Add to My Program |
| Mechanical Systems |
|
| |
| Chair: Gilbert, Hunter B. | Louisiana State University |
| Co-Chair: Diaz-Mercado, Yancy | University of Maryland |
| |
| 10:30-10:45, Paper WeA11.1 | Add to My Program |
| Sequential Model Calibration of Vapor Compression Cycles Using Approximate Bayesian Computation |
|
| Zinage, Shrenik | Purdue University, West Lafayette |
| Dixit, Vaibhav | Meta |
| Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
| Qiao, Hongtao | Mitsubishi Electric Research Laboratories (MERL) |
| Laughman, Christopher R. | Mitsubishi Electric Research Labs |
| Bilionis, Ilias | Purdue University |
| Deshpande, Vedang M. | Mitsubishi Electric Research Laboratories |
Keywords: Mechanical systems/robotics, Estimation, Differential-algebraic systems
Abstract: Physics-based simulation models are essential for the development of vapor compression cycles (VCCs), the core technology behind most modern refrigeration and air conditioning systems. These models enable robust control, monitoring, fault detection and diagnostics, and digital twin technologies. However, nonlinear dynamics, high-dimensional parameter and state spaces, numerical stiffness, and the limited integration of conventional modeling environments with scientific machine learning workflows create significant challenges for efficient, transferable, and automated calibration. Existing approaches typically rely on deterministic methods and lack mechanisms for principled knowledge transfer across calibration tasks, while also failing to quantify epistemic and aleatoric uncertainty. To address these limitations, we propose a Bayesian calibration framework for VCC systems that explicitly quantifies various sources of uncertainty in model predictions. The framework supports transferability across datasets by sequentially updating informative priors from previously inferred posteriors. We implement and evaluate this approach on a commercially available high-fidelity Julia based dynamic VCC model, demonstrating its ability to successfully estimate key system parameters.
|
| |
| 10:45-11:00, Paper WeA11.2 | Add to My Program |
| Adaptive Motion Planning Via Contact-Based Intent Inference for Human-Robot Collaboration |
|
| Song, Jiurun | Texas A&M University |
| Liang, Xiao | Texas A&M University |
| Zheng, Minghui | Texas A&M University |
Keywords: Mechanical systems/robotics, Robotics, Human-in-the-loop control
Abstract: Human-robot collaboration (HRC) requires robots to adapt their motions to human intent to ensure safe and efficient cooperation in shared spaces. Although large language models (LLMs) provide high-level reasoning for inferring human intent, their application to reliable motion planning in HRC remains challenging. Physical human-robot interaction (pHRI) is intuitive but often relies on continuous kinesthetic guidance, which imposes burdens on operators. To address these challenges, a contact-informed adaptive motion-planning framework is introduced to infer human intent directly from physical contact and employ the inferred intent for online motion correction in HRC. First, an optimization-based force estimation method is proposed to infer human-intended contact forces and locations from joint torque measurements and a robot dynamics model, thereby reducing cost and installation complexity while enabling whole-body sensitivity. Then, a torque-based contact detection mechanism with link-level localization is introduced to reduce the optimization search space and to enable real-time estimation. Subsequently, a contact-informed adaptive motion planner is developed to infer human intent from contacts and to replan robot motion online, while maintaining smoothness and adapting to human corrections. Finally, experiments on a 7-DOF manipulator demonstrate the accuracy of the proposed force estimation method with a mean absolute joint-torque estimation error of 0.665 Nm, and verify the effectiveness of the contact-informed adaptive motion planner under perception uncertainty in HRC.
|
| |
| 11:00-11:15, Paper WeA11.3 | Add to My Program |
| Input Shaping for Point-To-Point Motion with a Continuum Robot Arm |
|
| Hernandez Ibarra, Rodolfo Alejandro | Tecnológico De Monterrey (ITESM) |
| Baker, Karan | Louisiana State University |
| Molaei, Parsa | Louisiana State University |
| Stein, Adrian | Louisiana State University |
| Gilbert, Hunter B. | Louisiana State University |
Keywords: Mechanical systems/robotics, Mechatronics, Flexible structures
Abstract: A cable-driven continuum robot arm is an underactuated mechanism and may suffer residual vibration at the end of a rest-to-rest maneuver. In this work, a time-delay filter is applied as an input shaper to the system to eliminate the excitation of vibratory modes. A non-robust and a robust time-delay filter are designed based on a linear system model and demonstrate improved response compared to a velocity-driven pulse input. Experimental results using the continuum robot validate the application of the input shaper, with reduced overshoot and settling time exemplifying the reduction in oscillation at the end of the maneuver. It is also shown that utilizing the robust shaper further improves the response of the arm in comparison to applying the non-robust shaper. These results are significant towards the precise and robust implementation of continuum robots in applications involving arbitrary end-effector trajectories.
|
| |
| 11:15-11:30, Paper WeA11.4 | Add to My Program |
| A Hierarchical Model-Free Controller for Stable Limit Cycle Generation in Underactuated Mechanical Systems |
|
| Tahir, Muhammad Rafey | National University of Sciences and Technology, Islamabad |
| Murtaza, Muhammad Ali | Information Technology University |
| Hayat, Rameez | National University of Science and Technology, Islamabad |
| Hutchinson, Seth | Northeastern University |
| Ali, Usman | De Montfort University |
Keywords: Mechanical systems/robotics, Hierarchical control, Adaptive control
Abstract: This paper presents a hierarchical model-free control framework for underactuated mechanical systems aimed at generating stable limit cycles. Our proposed control architecture consists of a high-level controller that generates reference trajectories while the low-level controller guarantees accurate tracking of these trajectories on the unactuated coordinates. Our proposed controller has desirable robustness and stability properties and we show its effectiveness through demonstrations on various underactuated mechanical systems such as rotary inverted pendulum (RIP), a segway, an acrobot and a leg-foot model on deformable ground.
|
| |
| 11:30-11:45, Paper WeA11.5 | Add to My Program |
| Task-Space Singularity Avoidance for Control Affine Systems Using Control Barrier Functions |
|
| Forghani, Kimia | University of Maryland College Park |
| Raval, Suraj | University of Maryland, College Park |
| Mair, Lamar | Manipulation and Particle Research, Weinberg Medical Physics, Inc |
| Krieger, Axel | John Hopkins University |
| Diaz-Mercado, Yancy | University of Maryland |
|
|
| |
| 11:45-12:00, Paper WeA11.6 | Add to My Program |
| A Class of Axis–Angle Attitude Control Laws for Rotational Systems |
|
| Gonçalves, Francisco | Washington State University |
| Bena, Ryan | California Institute of Technology |
| Perez Arancibia, Nestor Osvaldo | Washington State University (WSU) |
Keywords: Robotics, Mechatronics, Control applications
Abstract: We introduce a new class of attitude control laws for rotational systems; the proposed framework generalizes the use of the Euler axis–angle representation beyond quaternion-based formulations. Using basic Lyapunov stability theory and the notion of extended class K function, we developed a method for determining and enforcing the global asymptotic stability of the single fixed point of the resulting closed-loop (CL) scheme. In contrast with traditional quaternion-based methods, the introduced generalized axis–angle approach enables greater flexibility in the design of the control law, which is of great utility when employed in combination with a switching scheme whose transition state depends on the angular velocity of the controlled rotational system. Through simulation and real-time experimental results, we demonstrate the effectiveness of the developed formulation. According to the recorded data, in the execution of high-speed tumble-recovery maneuvers, the new method consistently achieves shorter stabilization times and requires lower control effort relative to those corresponding to the quaternion-based and geometric-control methods used as benchmarks.
|
| |
| WeA12 Invited Session, Grand Salon 18 |
Add to My Program |
| Optimization for Energy and Infrastructure Networks |
|
| |
| Chair: Moura, Scott | University of California, Berkeley |
| Co-Chair: De Castro, Ricardo | University of California, Merced |
| Organizer: Wang, Ruiting | University of California, Berkeley |
| Organizer: Moura, Scott | University of California, Berkeley |
| Organizer: De Castro, Ricardo | University of California, Merced |
| |
| 10:30-10:45, Paper WeA12.1 | Add to My Program |
| Joint Price and Power MPC for Peak Power Reduction at Workplace EV Charging Stations (I) |
|
| Cambronne, Thibaud | University of California, Berkeley |
| Bobick, Samuel | University of California, Berkeley |
| Zeng, Wente | TotalEnergies S.E |
| Moura, Scott | University of California, Berkeley |
Keywords: Smart grid, Human-in-the-loop control
Abstract: Demand charge, a utility fee based on an electricity customer's peak power consumption, often constitutes a significant portion of costs for commercial electric vehicle (EV) charging station operators. This paper explores control methods to reduce peak power consumption at workplace EV charging stations in a joint price and power optimization framework. We optimize a menu of price options to incentivize users to select controllable charging service. Using this framework, we propose a model predictive control approach to reduce both demand charge and overall operator costs. Through a Monte Carlo simulation, we find that our algorithm outperforms a state-of-the-art benchmark optimization strategy and can significantly reduce station operator costs.
|
| |
| 10:45-11:00, Paper WeA12.2 | Add to My Program |
| Energy Management Strategies for Electric Aircraft Charging Leveraging Active Landside Vehicle-To-Grid (I) |
|
| Vehlhaber, Finn Niklas | Eindhoven University of Technology |
| Salazar, Mauro | Eindhoven University of Technology |
Keywords: Transportation networks, Energy systems, Smart grid
Abstract: The deployment of medium-range battery electric aircraft is a promising pathway to improve the environmental footprint of air mobility. Yet such a deployment would be accompanied by significant electric power requirements at airports due to aircraft charging. Given the growing prevalence of electric vehicles and their bi-directional charging capabilities—so-called vehicle-to-grid (V2G)—we study energy buffer capabilities of parked electric vehicles to alleviate pressure on grid connections. To this end, we present energy management strategies for airports providing cost-optimal apron and landside V2G charge scheduling. Specifically, we first formulate the optimal energy management problem of joint aircraft charging and landside V2G coordination as a linear program, whereby we use partial differential equations to model the aggregated charging dynamics of the electric vehicle fleet. Second, we consider a shuttle flight network with a single hub of a large Dutch airline, real-world grid prices, and synthetic parking garage occupancy data to test our framework. Our results show that V2G at even a single airport can indeed reduce energy costs to charge the aircraft fleet: Compared to a baseline scenario without V2G, the proposed concept yields cost savings of up to 32 %, depending on the schedule and amount of participating vehicles, and has other potential beneficial effects on the local power grid, e.g., the reduction of potential power peaks.
|
| |
| 11:00-11:15, Paper WeA12.3 | Add to My Program |
| Optimal Surface Power Allocation for Sustainable Urban Electric Vehicle Growth (I) |
|
| Fierro Ulloa, Joel Ignacio | Centre Inria De l'Université Grenoble Alpes |
| Canudas de Wit, Carlos | CNRS, GIPSA-Lab |
Keywords: Optimization, Multivehicle systems, Large-scale systems
Abstract: The rapid transition to electric mobility requires not only the deployment of charging infrastructure but also the strategic allocation of power within urban electricity networks. While most existing studies address charger siting and network coverage, this work focuses on the complementary problem of how much power should be distributed across different areas of a city — a challenge we define as the Surface Power Allocation Problem (SPAP). We introduce a macroscopic modeling framework in which the urban landscape is partitioned into Urban Surface Units (USUs), each treated as a virtual charging node with aggregated demand. Using an origin–destination graph of mobility flows and a dynamic nonlinear representation of electric vehicle state-of-charge evolution, we estimate spatial power requirements consistent with long-term sustainability objectives. The resulting optimization problem is formulated as a large-scale nonlinear program, which we solve through a relaxation and gradient-based approach to provide computationally efficient solutions.
|
| |
| 11:15-11:30, Paper WeA12.4 | Add to My Program |
| Regulating EV Charging Markets for Fairness: Incentives for Pricing and Capacity Decisions (I) |
|
| Wang, Ruiting | University of California, Berkeley |
| Hu, Kita | University of California, Berkeley |
| Yu, Yitong | Nanyang Technological University, Singapore |
| Moura, Scott | University of California, Berkeley |
Keywords: Transportation networks, Optimization algorithms, Game theory
Abstract: The transition to electric mobility calls for charging infrastructure that is both efficient and socially equitable. This paper examines fairness in electric vehicle (EV) charging station pricing and capacity through a game-theoretic perspective. We model a non-cooperative market in which competing charging service providers set prices and capacities while customers choose stations based on generalized cost, leading to a market equilibrium. We then benchmark this decentralized outcome against an idealized planner solution that jointly optimizes efficiency and equity. To align market outcomes with socially desirable goals, we design targeted incentives that guide operators toward more fair charger placement. Case studies demonstrate that unregulated competition tends to exacerbate disparities in charger access across demographic groups, whereas carefully calibrated incentives can reduce inequities without significant efficiency loss. The framework provides insights for policymakers on reconciling free-market dynamics with the broader societal goals of fairness in electrified mobility systems.
|
| |
| 11:30-11:45, Paper WeA12.5 | Add to My Program |
| Energy-Aware Integrated Proactive Maintenance Planning and Production Scheduling (I) |
|
| Li, Hongliang | Pennsylvania State University |
| Pangborn, Herschel | The Pennsylvania State University |
| Kovalenko, Ilya | Pennsylvania State University |
Keywords: Manufacturing systems, Hierarchical control, Optimal control
Abstract: Demand-side energy management, such as the real-time pricing (RTP) program, offers manufacturers opportunities to reduce energy costs by shifting production to low-price hours. However, this strategy is challenging to implement when machine degradation is considered, as degraded machines have decreased processing capacity and increased energy consumption. Proactive maintenance (PM) can restore machine health but requires production downtime, creating a challenging trade-off: scheduling maintenance during low-price periods sacrifices energy savings opportunities, while deferring maintenance leads to capacity losses and higher energy consumption. To address this challenge, we propose a hierarchical bi-level control framework that jointly optimizes PM planning and runtime production scheduling, considering the machine degradation. A higher-level optimization, with the lower-level model predictive control (MPC) embedded as a subproblem, determines PM plans that minimize total operational costs under day-ahead RTP. At runtime, the lower-level MPC executes closed-loop production scheduling to minimize energy costs under realized RTP, meeting delivery targets. Simulation results from a lithium-ion battery pack assembly line case study demonstrate that the framework strategically shifts PM away from bottlenecks and high-price hours, meeting daily production targets while reducing energy costs.
|
| |
| 11:45-12:00, Paper WeA12.6 | Add to My Program |
| Fairness-Aware Management of Electric Vehicle Charging Stations (I) |
|
| Monteiro, Paulo | UC Merced |
| Williams, Chase | University of California, Merced |
| Moyalan, Joseph | University of California, Merced |
| Chen, YangQuan | University of California, Merced |
| De Castro, Ricardo | University of California, Merced |
Keywords: Optimization, Optimization algorithms, Energy systems
Abstract: This paper presents an optimization framework for scheduling electric vehicle (EV) charging at public stations, with the aim of minimizing overall user dissatisfaction while taking into account arrival time, charging duration, and power demand. To promote fairness, the framework differentiates between high-priority users (those without access to home or workplace charging) and low-priority users (those with such access). In addition, it ensures contiguous charging time slots by incorporating trigger functions into the optimization model. The problem is formulated as a Mixed Integer Linear Program (MILP). The effectiveness of the proposed approach is demonstrated through simulations using EV charger data from the UC Merced campus parking lot.
|
| |
| WeA13 Regular Session, Grand Salon 19 |
Add to My Program |
| Autonomous Systems I |
|
| |
| Chair: Chen, Fei | Massachusetts Institute of Technology |
| Co-Chair: Samanipour, Pouya | University of Kentucky |
| |
| 10:30-10:45, Paper WeA13.1 | Add to My Program |
| Verification Framework for the Union of Control Barrier Functions |
|
| Jiang, Chuanrui | Washington University in St. Louis |
| Clark, Andrew | Washington University in St. Louis |
Keywords: Autonomous systems, Constrained control, Switched systems
Abstract: Control Barrier Functions (CBFs) have been proposed to ensure safety of autonomous systems. This paper considers control policies that switch between CBF constraints. Under this approach, we represent a complex non-convex safe region as a union of sets that are computationally tractable to verify. We denote this framework as union-CBFs and make the following contributions. First, considering switching CBF-QP controllers, we propose a sufficient condition that ensures (i) the system undergoes a finite number of switches in any finite time interval and ensures (ii) the forward invariance of the closed-loop system in between switches. Second, we consider two types of switching strategies and propose union-CBFs conditions for each strategy to satisfy (i) and (ii). Third, we formulate Sum-of-Squares (SOS) algorithms to verify the conditions. The experiments show that our union-CBFs framework results in a larger safe region compared to high-degree polynomial CBFs. We also show the efficiency of the verification algorithms using a polynomial system model.
|
| |
| 10:45-11:00, Paper WeA13.2 | Add to My Program |
| ReLU Barrier Functions for Nonlinear Systems with Constrained Control: A Union of Invariant Sets Approach |
|
| Samanipour, Pouya | Univ. of Kentucky |
| Poonawala, Hasan A. | Univ. of Kentucky |
Keywords: Autonomous systems, Formal verification/synthesis, Hybrid systems
Abstract: Certifying safety for nonlinear systems with polytopic input constraints is challenging because control barrier function (CBF) synthesis must ensure control admissibility under input saturation. We propose an approximation-verification pipeline that performs convex barrier synthesis on piecewise-affine (PWA) surrogates and certifies safety for the original nonlinear system through facet-wise verification. To reduce conservatism while preserving tractability, we use a two-slope Leaky ReLU surrogate for the extended class-K function alpha(.) and combine multiple certificates using Union of Invariant Sets (UIS). Counterexamples are handled through local uncertainty updates. Simulations on pendulum and cart-pole systems with input saturation show larger certified invariant sets than linear-alpha designs while maintaining tractable computation time.
|
| |
| 11:00-11:15, Paper WeA13.3 | Add to My Program |
| Cooperative Visual-Inertial Navigation with Inter-Robot Coordinate Alignment |
|
| Liu, Jiayang | The University of Melbourne |
| Zhou, Yizhi | Geroge Mason University |
Keywords: Estimation, Autonomous robots, Kalman filtering
Abstract: This paper focuses on multi-robot navigation in GPS-denied environments, where accurate localization is essential for tasks such as environmental monitoring and search-and-rescue. While Visual–Inertial Navigation Systems (VINS) provide lightweight 6-DoF state estimation for single robots, extending them to multi-robot settings introduces the critical challenge of inter-robot frame alignment. Existing approaches often assume ground-truth initialization or overlook this issue, limiting their practicality. We propose a cooperative multi-robot VINS framework that augments the inter-robot transformation as part of the state and jointly estimates it online together with each robot’s trajectory. Crucially, our method leverages common observations as additional constraints, ensuring consistent frame alignment and enabling effective cross-robot fusion. Monte Carlo simulations on 3-D trajectories demonstrate that the proposed framework achieves lower error and improved consistency compared with single-robot baselines, validating its effectiveness for real-world multi-robot localization.
|
| |
| 11:15-11:30, Paper WeA13.4 | Add to My Program |
| Warm-Starting Optimization-Based Motion Planning for Robotic Manipulators Via Point Cloud-Conditioned Flow Matching |
|
| Tian, Sibo | Texas A&M University |
| Zheng, Minghui | Texas A&M University |
| Liang, Xiao | Texas A&M University |
Keywords: Emerging control applications, Mechanical systems/robotics, Intelligent systems
Abstract: Rapid robot motion generation is critical in Human-Robot Collaboration (HRC) systems, as robots need to respond to dynamic environments in real time by continuously observing their surroundings and replanning their motions to ensure both safe interactions and efficient task execution. Current sampling-based motion planners face challenges in scaling to high-dimensional configuration spaces and often require post-processing to interpolate and smooth the generated paths, resulting in time inefficiency in complex environments. Optimization-based planners, on the other hand, can incorporate multiple constraints and generate smooth trajectories directly, making them potentially more time-efficient. However, optimization-based planners are sensitive to initialization and may get stuck in local minima. In this work, we present a novel learning-based method that utilizes a Flow Matching model conditioned on a single-view point cloud to learn near-optimal solutions for optimization initialization. Our method does not require privileged knowledge of the environment, such as obstacle locations and geometries, and can generate feasible trajectories directly from single-view depth camera input. Simulation studies on a UR5e robotic manipulator in cluttered workspaces demonstrate that the proposed generative initializer achieves a high success rate on its own, significantly improves the success rate of trajectory optimization compared with traditional and learning-based benchmark initializers, requires fewer optimization iterations, and exhibits strong generalization to unseen environments.
|
| |
| 11:30-11:45, Paper WeA13.5 | Add to My Program |
| Context-Aware LLM-Based Safe Control against Latent Risks |
|
| Deng, Xiyu | Carnegie Mellon University |
| Luu, Quan | Purdue University |
| Ho, Van | JAIST |
| Nakahira, Yorie | Carnegie Mellon University |
Keywords: Autonomous systems, Uncertain systems, Optimization
Abstract: Autonomous control systems face significant challenges in executing complex tasks under latent risks—risks arising from occluded agents or partially observable dynamics. To address this, we propose a framework that integrates Large Language Models (LLMs), numerical optimization, and optimization-based control to enable context-aware subtask generation and refinement under latent risks. The framework operates across multiple timescales: (i) in-context learning with LLMs for leveraging past experience, (ii) multi-turn Chain-of-Thought reasoning with numerical optimization for refining subtask parameters, and (iii) real-time feedback control via model predictive control (MPC). Complex tasks are decomposed into subtasks that explicitly account for latent risks, refined through multi-turn optimization in physics-based simulation, and improved over time by accumulating failure-related examples for in-context learning. We validate the framework through simulation-based case studies involving robots and autonomous vehicles using GPT-4o, demonstrating its feasibility and potential to complement MPC with language-driven reasoning for safer decision-making in latent-risk scenarios.
|
| |
| 11:45-12:00, Paper WeA13.6 | Add to My Program |
| AuDeRe: Automated Strategy Decision and Realization in Robot Planning and Control Via LLMs |
|
| Meng, Yue | MIT |
| Chen, Fei | Massachusetts Institute of Technology |
| Chen, Yongchao | Harvard University |
| Fan, Chuchu | Massachusetts Institute of Technology |
Keywords: Autonomous systems, Robotics
Abstract: Recent advancements in large language models (LLMs) have shown significant promise in various domains, especially robotics. However, most prior LLM-based work in robotic applications either directly predicts waypoints or applies LLMs within fixed tool integration frameworks, offering limited flexibility in exploring and configuring solutions best suited to different tasks. In this work, we propose a framework that leverages LLMs to select appropriate planning and control strategies based on task descriptions, environmental constraints, and system dynamics. These strategies are then executed by calling the available comprehensive planning and control APIs. Our approach employs iterative LLM-based reasoning with performance feedback to refine the algorithm selection. We validate our approach through extensive experiments across tasks of varying complexity, from simple tracking to complex planning scenarios involving spatiotemporal constraints. The results demonstrate that using LLMs to determine planning and control strategies from natural language descriptions significantly enhances robotic autonomy while reducing the need for extensive manual tuning and expert knowledge. Furthermore, our framework maintains generalizability across different tasks and notably outperforms baseline methods that rely on LLMs for direct trajectory, control sequence, or code generation.
|
| |
| WeA14 Regular Session, Grand Salon 21 |
Add to My Program |
| Data Driven Control |
|
| |
| Chair: Iwata, Takumi | Hiroshima University |
| Co-Chair: Ibuki, Tatsuya | Meiji University |
| |
| 10:30-10:45, Paper WeA14.1 | Add to My Program |
| Recursive Gaussian Process Based Safety Assurance Exploration Control in Unknown Environments |
|
| Kanou, Kento | Meiji University |
| Ibuki, Tatsuya | Meiji University |
Keywords: Data driven control, Machine learning, Constrained control
Abstract: This article presents a novel method for environmental exploration that takes safety into account in unknown areas by using recursive Gaussian process regression (RGPR). Safety in unknown environments is ensured by an RGPR-based control barrier function, which is constructed non-parametrically from the posterior mean of RGPR. Simultaneously, an exploration function inspired by a control Lyapunov function is presented. This function utilizes the posterior variance of RGPR to achieve efficient exploration. Moreover, the use of RGPR mitigates the cubic growth in computational cost inherent to conventional Gaussian process regression, thereby enabling online learning. To validate the effectiveness of the proposed method, an experiment using differential-drive robots equipped with a LiDAR sensor is conducted.
|
| |
| 10:45-11:00, Paper WeA14.2 | Add to My Program |
| A Small-Gain Look at Cyber-Physical Security |
|
| Chakraborty, Sayan | New York University |
| Jiang, Zhong-Ping | New York University |
Keywords: Data driven control, Optimal control, Networked control systems
Abstract: This paper studies the resilience of cyber-physical systems under denial-of-service attacks. We develop a novel framework for resilient control that avoids the need for detailed information about the system or attacker dynamics by treating the plant–attacker interaction as an interconnected system. Using small-gain analysis and switching systems theory, we derive explicit resilience conditions, and employ reinforcement learning to synthesize an optimal policy directly from input–state data, estimating the required small-gain bounds in a data-driven manner. A numerical example illustrates the effectiveness of the proposed approach.
|
| |
| 11:00-11:15, Paper WeA14.3 | Add to My Program |
| Robust Data-Driven Receding-Horizon Control for LQR with Input Constraints |
|
| Zheng, Jian | Northeastern University |
| Sznaier, Mario | Northeastern University |
Keywords: Data driven control, Optimal control, Predictive control for linear systems
Abstract: This letter presents a robust data-driven receding-horizon control framework for the discrete-time linear quadratic regulator (LQR) with input constraints. Unlike earlier data-driven approaches that design a controller from initial data and apply it unchanged throughout the trajectory, our method exploits all available execution data in a receding-horizon manner, thereby capturing additional information about the unknown system and enabling less conservative performance. Existing data-driven LQR model predictive control methods rely on over-approximations of the consistency set and ell_2 descriptions of noise. In contrast, the proposed approach uses exact descriptions of the consistency set under ell_infty-bounded noise, leveraging duality to recast the problem into a tractable convex optimization. Further, the proposed controller renders the closed-loop system input-to-state stable. Simulation results demonstrate the effectiveness of the method.
|
| |
| 11:15-11:30, Paper WeA14.4 | Add to My Program |
| Gain-Scheduled Data-Enabled Predictive Control: A DeePC Approach for Nonlinear Systems |
|
| Guerrero, Margarita A. | KTH Royal Institute of Technology |
| Lakshminarayanan, Braghadeesh | KTH Royal Institute of Technology |
| Rojas, Cristian R. | KTH Royal Institute of Technology |
Keywords: Data driven control, Predictive control for nonlinear systems, Linear parameter-varying systems
Abstract: Model predictive control is a well established control technology for trajectory tracking. Its use requires the availability of an accurate model of the plant, but obtaining such a model is often time consuming and costly. Data-Enabled Predictive Control (DeePC) addresses this shortcoming in the linear time-invariant setting, by skipping the model building step and instead relying directly on input-output data. Unfortunately, many real systems are nonlinear and exhibit strong operating-point dependence. Building on classical linear parameter-varying control, we introduce DeePC-GS, a gain-scheduled extension of DeePC for unknown, regime-varying systems. The key idea is to allow DeePC to switch between different local Hankel matrices—selected online via a measurable scheduling variable—thereby uniting classical gain scheduling tools with identification-free, data-driven MPC. We test the effectiveness of our DeePC-GS formulation on a nonlinear ship-steering benchmark, demonstrating that it outperforms state-of-the-art data-driven MPC while maintaining tractable computation.
|
| |
| 11:30-11:45, Paper WeA14.5 | Add to My Program |
| Data-Fused MPC with Guarantees: Application to Flying Humanoid Robots |
|
| Gorbani, Davide | Italian Institute of Technology |
| Elobaid, Mohamed | King Abdullah University of Science and Technology |
| L'Erario, Giuseppe | Istituto Italiano Di Tecnologia |
| Mohamed, Hosameldin Awadalla Omer | Italian Institute of Technology |
| Pucci, Daniele | Istituto Italiano Di Tecnologia |
Keywords: Data driven control, Robotics, Predictive control for linear systems
Abstract: This paper introduces a Data-Fused Model Predictive Control (DFMPC) framework that combines physics-based models with data-driven representations of unknown dynamics. Leveraging Willems’ Fundamental Lemma and an artificial equilibrium formulation, the method enables tracking of changing, potentially unreachable setpoints while explicitly handling measurement noise through slack variables and regularization. We provide guarantees of recursive feasibility and practical stability under input–output constraints for a specific class of reference signals. The approach is validated on the iRonCub flying humanoid robot, integrating analytical momentum models with data-driven turbine dynamics. Simulations show improved tracking and robustness compared to a purely model-based MPC, while maintaining real-time feasibility.
|
| |
| 11:45-12:00, Paper WeA14.6 | Add to My Program |
| Data Informativity for Analysis and Design of Positive Systems |
|
| Iwata, Takumi | Hiroshima University |
| Azuma, Shun-ichi | Kyoto University |
| Nagahara, Masaaki | Hiroshima University |
| Peaucelle, Dimitri | LAAS-CNRS, Université De Toulouse |
| Ebihara, Yoshio | Kyushu University |
Keywords: Compartmental and Positive systems, Data driven control, Constrained control
Abstract: This paper investigates data informativity of positive systems using linear programming (LP). The concept called data informativity represents the sufficiency of a given dataset to solve analysis/design problems. In this paper, we provide the necessary and sufficient conditions for the data-driven analysis and design problems of positive systems to be solvable. Moreover, we clarify that these conditions are characterized by LP problems.
|
| |
| WeA15 Regular Session, Grand Salon 22 |
Add to My Program |
| Lyapunov Methods |
|
| |
| Chair: Goel, Ankit | University of Maryland Baltimore County |
| Co-Chair: Allen, Brendon C. | Auburn University |
| |
| 10:30-10:45, Paper WeA15.1 | Add to My Program |
| Sampling-Based Efficient Receding Horizon Control Barrier Functions |
|
| Li, Anni | The University of North Carolina at Charlotte |
| Xiao, Wei | WPI |
Keywords: Lyapunov methods, Constrained control, Optimal control
Abstract: Control Barrier Functions (CBFs) are widely used in guaranteeing safety for nonlinear systems. They can conservatively map a constrained optimal control problem into a sequence of point-wise Quadratic Programs (QPs) for affine control systems. One of the challenges with the CBF-based point-wise optimization is its myopic property: optimality and safety are only considered at the current time instant. This could make the system overly aggressive and susceptible to infeasibility in the optimization. To address this issue, we propose a sampling method for CBFs to formulate a receding horizon QP. Specifically, at each iteration, we first sample a trajectory based on the current state by a tractable planning method, and then incorporate it into nonlinear dynamics and constraints to obtain piecewise linear approximations that are then enforced by CBFs within the planning horizon. The eventual optimization problem is reformulated into a sequence of receding horizon QPs instead of complex nonlinear programs. We show that the proposed framework can still guarantee safety while staying close to the (optimal) planning trajectory with minimum deviation. We illustrate our approach on traffic merging and robot obstacle avoidance, and compare it with other methods to show its advantage in efficiency and guarantees.
|
| |
| 10:45-11:00, Paper WeA15.2 | Add to My Program |
| Accelerating Lyapunov-Stable Neural Control Using Fulfillment Priority Logic |
|
| Abdelgawad, Abdelrahman | Boston University |
| El Mabsout, Bassel | Boston University |
| Wang, Zili | Boston University |
| Mancuso, Renato | Boston University |
| Andersson, Sean B. | Boston University |
| Tron, Roberto | Boston University |
Keywords: Lyapunov methods, Formal verification/synthesis, Neural networks
Abstract: We present a two-stage approach for learning stability-certified neural controllers that achieves a reduction of up to ~95% in training time compared to the state-of-the-art baseline, which introduced monotonic neural Lyapunov architectures. Our method combines monotonic neural Lyapunov functions with fulfillment priority logic (FPL) to efficiently initialize controllers before formal verification. Traditional approaches for jointly learning controllers and neural Lyapunov functions require computationally expensive mixed-integer linear programming (MILP) or satisfiability modulo theory (SMT) solvers at each training iteration, often taking several hours to converge. We address this bottleneck by leveraging FPL to perform early joint initialization of the controller and Lyapunov networks. Building on the monotonic neural network architecture from the baseline, which guarantees non-negativity and a unique global minimum by construction, our method focuses on efficiently satisfying the remaining property of decreasing along trajectories. Existing works focus on maximizing the region of attraction/convergence of the learned controller. In contrast, leveraging FPL allows us to (1) increase learning efficiency substantially and (2) focus on complementary performance metrics, such as convergence rate and control effort minimization, thereby adding significant specification flexibility. In this paper, we encode an approximate Lyapunov-decrease condition in FPL to pre-train the controller and Lyapunov networks, then apply a MILP-based verification/refinement step. This decouples efficient learning from certificate enforcement and allows the FPL specification to include auxiliary objectives (e.g., convergence rate and control effort), whose influence persists through the final MILP pass. The resulting controllers converge rapidly while admitting formal Lyapunov certificates on standard nonlinear control benchmarks.
|
| |
| 11:00-11:15, Paper WeA15.3 | Add to My Program |
| Augmented Lyapunov-Net: Incorporating Adversarial Verifier for the Construction of Lyapunov Functions Using Neural Networks |
|
| Zhao, Yuxuan | Hong Kong University of Science and Technology |
| Wang, Junkai | Georgia Institute of Technology |
| Zhang, Fumin | Hong Kong University of Science and Technology |
Keywords: Lyapunov methods, Neural networks, Learning
Abstract: Lyapunov functions are pivotal for analyzing the stability of dynamical systems. While neural networks have emerged as effective approximators, existing methods often struggle with hard-to-fit regions and scale poorly to high-dimensional systems. To address these limitations, this paper introduces the Augmented Lyapunov-Net, a novel framework that integrates a modified CounterExample Guided Inductive Synthesis (CEGIS) architecture. Our framework features a Learner, implemented by the Lyapunov-Net, and an Adversarial Verifier, which actively employs gradient-free optimization to pinpoint states that violate the Lie derivative condition. These strategically discovered counterexamples then guide a focused sampling procedure, enabling the Learner to efficiently refine the candidate function in critical regions. Numerical experiments on several benchmark systems demonstrate that the proposed method achieves higher approximation efficiency to higher-dimensional systems compared to the original Lyapunov-Net.
|
| |
| 11:15-11:30, Paper WeA15.4 | Add to My Program |
| Stability Preserving Safe Control of a Bicopter |
|
| Portella Delgado, Jhon Manuel | University of Maryland Baltimore County |
| Goel, Ankit | University of Maryland Baltimore County |
Keywords: Lyapunov methods, Stability of nonlinear systems, Constrained control
Abstract: This paper presents a control law for stabilization and trajectory tracking of a multicopter subject to safety constraints. The proposed approach guarantees forward invariance of a prescribed safety set while ensuring smooth tracking performance. Unlike conventional control barrier function methods, the constrained control problem is transformed into an unconstrained one using state-dependent mappings together with carefully constructed Lyapunov functions. This approach enables explicit synthesis of the control law, instead of requiring a solution of constrained optimization at each step. The transformation also enables the controller to enforce safety without sacrificing stability or performance. Simulation results for a polytopic reference trajectory confined within a designated safe region demonstrate the effectiveness of the proposed method.
|
| |
| 11:30-11:45, Paper WeA15.5 | Add to My Program |
| A Lyapunov and Concurrent Learning-Based Approach to Train the Weights of a DNN-Based Controller in Real-Time |
|
| Basyal, Sujata | Auburn University |
| Ting, Jonathan | Auburn University |
| Mishra, Kislaya | Auburn University |
| Allen, Brendon C. | Auburn University |
Keywords: Lyapunov methods, Stability of nonlinear systems, Neural networks
Abstract: Deep neural networks (DNNs) are a function approximation tool that are capable of approximating complex dynamic functions by learning complex relationships between the input-output data through training and optimization. Classical DNN training methods utilize numerical optimization tools to improve the function approximation performance. However, in most cases, DNN-based controllers are implemented in an openloop manner, meaning that the DNN model in the controller does not adapt its weights online and therefore fails to take into consideration any unexpected behavior that could hinder system stability and robustness. To tackle this issue, the authors recently leveraged Lyapunov-based methods to develop innovative realtime update laws that train a DNN’s weights in real-time, ensuring stability and improved performance. In this paper, the performance and stability result for DNN-based controllers are further improved by incorporating a Lyapunov-based concurrent learning (CL) inspired term within the real-time DNN update law. A rigorous Lyapunov-based stability analysis was performed to ensure that the proposed DNN-based controller and CL augmented update law results in global exponential convergence of the tracking errors and the DNN weight estimation errors towards an ultimate bound. Thus, for the first time, the proposed approach will ensure both trajectory tracking and convergence of the DNN weights towards their optimal values. Furthermore, simulations were performed that demonstrate the significant potential of the developed DNN-based control system.
|
| |
| 11:45-12:00, Paper WeA15.6 | Add to My Program |
| A Nonlinear Robust Controller Formulation for a Class of Systems Subject to Hysteresis |
|
| Hindistan, Cagri | Ege University |
| Selim, Erman | Ege University |
| Tatlicioglu, Enver | Ege University |
| Zergeroglu, Erkan | Gebze Technical University |
Keywords: Lyapunov methods
Abstract: This work presents the design and corresponding analysis of a robust tracking controller for a class of nonlinear systems subject to hysteresis. Specifically, the hysteresis effects are assumed to be in the form of the Bouc-Wen model and despite the lack of accurate knowledge of system dynamics and hysteresis model parameters, a continuous robust controller is designed via the use of the robustness of integral of sign of the error feedback. The stability of the closed--loop system and the convergence of the tracking error are ensured via Lyapunov based arguments. The tracking performance of the proposed control framework is investigated by numerical simulation studies for a second order system.
|
| |
| WeA16 Regular Session, Grand Salon 24 |
Add to My Program |
| Reinforcement Learning I |
|
| |
| Chair: Mitra, Aritra | North Carolina State University |
| Co-Chair: Talebi, Shahriar | University of California, Los Angeles |
| |
| 10:30-10:45, Paper WeA16.1 | Add to My Program |
| Training Task Reasoning LLM Agents for Multi-Turn Task Planning Via Single-Turn Reinforcement Learning |
|
| Hu, Hanjiang | Carnegie Mellon University |
| Liu, Changliu | Carnegie Mellon University |
| Li, Na | Harvard University |
| Wang, Yebin | Mitsubishi Electric Research Labs |
Keywords: Reinforcement learning, Machine learning, Emerging control applications
Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in knowledge acquisition, reasoning, and tool use, making them promising candidates for autonomous agent applications. However, training LLM agents for complex multi-turn task planning faces significant challenges, including sparse episode-wise rewards, credit assignment across long horizons, and the computational overhead of reinforcement learning in multi-turn interaction settings. To this end, this paper introduces a novel approach that transforms multi-turn task planning into single-turn task reasoning problems, enabling efficient policy optimization through Group Relative Policy Optimization (GRPO) with dense and verifiable reward from expert trajectories. Our theoretical analysis shows that GRPO improvement on single-turn task reasoning results in higher multi-turn success probability under the minimal turns, as well as the generalization to subtasks with shorter horizons. Experimental evaluation on the complex task planning benchmark demonstrates that our 1.5B parameter model trained with single-turn GRPO achieves superior performance compared to larger baseline models up to 14B parameters, with success rates of 70% for long-horizon planning tasks with over 30 steps. We also theoretically and empirically validate the strong cross-task generalizability that the models trained on complex tasks can lead to the successful completion of all simpler subtasks.
|
| |
| 10:45-11:00, Paper WeA16.2 | Add to My Program |
| Robust Federated Q-Learning with Almost No Communication |
|
| Maity, Sreejeet | North Carolina State University, Raleigh |
| Mitra, Aritra | North Carolina State University |
Keywords: Reinforcement learning, Machine learning, Optimization algorithms
Abstract: We consider a federated reinforcement learning setting involving M agents, all of whom interact with a common Markov Decision Process (MDP). The agents exchange information via a central server to learn the optimal value function. Our goal is to understand to what extent one can hope for collaborative sample-complexity speedups in such a setting, when a small fraction of the agents are adversarial and can act arbitrarily. To that end, we propose Robust Fed-Q, a federated Q-learning algorithm that blends ideas from both model-based and model-free RL, along with the median-of-means device from robust statistics. We prove that despite corruption, with high-probability, Robust Fed-Q (i) guarantees emph{exact} convergence to the optimal value function in the limit of infinite samples, and (ii) enjoys near-optimal finite-time rates that benefit from collaboration. In addition, our approach requires just tilde{O}(1) rounds of communication to achieve each of the above guarantees, a feature of independent interest in FL where communication is the major bottleneck.
|
| |
| 11:00-11:15, Paper WeA16.3 | Add to My Program |
| The Deception Magic: Using Reward Machines for Privacy-Preserving Reinforcement Learning |
|
| Meshkat Alsadat, Shayan | Arizona State University |
| Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning, Markov processes
Abstract: In reinforcement learning (RL) scenarios where agents must preserve privacy about their objectives, deceptive behavior becomes essential to prevent adversarial observers from inferring the agent’s true intentions. Unlike existing approaches that often assume Markovian reward functions or require model-based planning, we propose a model-free deceptive reinforcement learning framework using reward machines (RMs) that operates under adversarial settings. We introduce a belief-induced reward mechanism for deceptive RL using RMs, where agents balance ground truth task completion with entropy maximization over observer beliefs about candidate reward functions. RMs allow us to encode complex temporal dependencies in non-Markovian reward structures and guide the deceptive learning process through structured task representation. Our methods introduce Q-learning and actor-critic algorithms with reward machines to learn optimal deceptive policies that distract observers while maintaining task completion objectives, ensuring effective privacy preservation without compromising primary goals. We establish theoretical guarantees, demonstrating that our algorithms converge to optimal deceptive policies with privacy preservation properties. We further evaluate our methods against baselines in experimental case studies to demonstrate their effectiveness in balancing deception and task performance.
|
| |
| 11:15-11:30, Paper WeA16.4 | Add to My Program |
| Hereditary Geometric Meta-RL: Non-Local Generalization Via Task Symmetries |
|
| Nitschke, Paul | Harvard University |
| Talebi, Shahriar | University of California, Los Angeles |
Keywords: Reinforcement learning, Mechanical systems/robotics, Robotics
Abstract: Meta-Reinforcement Learning (Meta-RL) commonly generalizes via smoothness in the task encoding. While this enables local generalization around each training task, it requires dense coverage of the task space and leaves richer task space structure untapped. In response, we develop a geometric perspective that endows the task space with a "hereditary geometry" induced by the inherent symmetries of the underlying system. Concretely, the agent reuses a policy learned at the train time by transforming states and actions through actions of a Lie group. This converts Meta-RL into symmetry discovery rather than smooth extrapolation, enabling the agent to generalize to wider regions of the task space. We show that when the task space is inherited from the symmetries of the underlying system, the task space embeds into a subgroup of those symmetries whose actions are linearizable, connected, and compact--properties that enable efficient learning and inference at the test time. To learn these structures, we develop a differential symmetry discovery method. This collapses functional invariance constraints and thereby improves numerical stability and sample efficiency over functional approaches. Empirically, on a two-dimensional navigation task, our method efficiently recovers the ground-truth symmetry and generalizes across the entire task space, while a common baseline generalizes only near training tasks.
|
| |
| 11:30-11:45, Paper WeA16.5 | Add to My Program |
| Nash Q-Learning with Inferring Causal Signal Temporal Logic: A Study of Competitive Multi-Agent Systems |
|
| Partovi Aria, Hadi | Arizona State University |
| Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning
Abstract: Multi-agent reinforcement learning with temporal logic presents unique challenges, particularly in competitive settings where agents pursue conflicting objectives. This paper presents NASTL-CIRL (Nash Signal Temporal Logic for Causal Inference in Reinforcement Learning), a novel approach that combines Nash Q-Learning with Causal Signal Temporal Logic (Causal STL) inference to guide agent behavior in competitive environments. Our approach enables agents to infer causal relationships through STL formulas that explain agent behavior and environmental dynamics, providing strategic advantages. Pacman Game scenario and Factory Assembly scenario are conducted to show the superior performance of the proposed NASTL-CIRL algorithm compared to baseline methods. Our results show that integrating causal STL inference with Nash equilibrium concepts leads to more structured and interpretable agent behaviors while maintaining competitive performance.
|
| |
| 11:45-12:00, Paper WeA16.6 | Add to My Program |
| Lipschitz-Aware Exploration for Safety in Reinforcement Learning |
|
| Jha, Mayank Shekhar | University of Lorraine |
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
| Marthi, Satya Vinay Chavan | CRAN |
| Theilliol, Didier | Universite De Lorraine |
Keywords: Reinforcement learning, Neural networks, Optimal control
Abstract: This paper develops a novel Lipschitz-aware safe exploration framework for reinforcement learning in environments with abrupt, unmodeled safety variations. Local Lipschitz constants of the safety function are estimated online using kernel density estimation (KDE), providing a data-driven measure of rapid changes in the safety landscape. These estimates are incorporated into a robust quadratic program (QP) with Lipschitz-aware control barrier function (CBF) constraints, yielding a safe exploration law that guarantees forward invariance of an enlarged safe set under probing noise. The exploration mechanism is then coupled with a safety-aware learning stage to obtain a unified safe RL framework. Simulations on an inverted pendulum illustrate the efficacy of the proposed approach.
|
| |
| WeA17 Invited Session, Churchill A1 |
Add to My Program |
| Healthcare and Medical Systems (I) |
|
| |
| Chair: Pereira, Emily | Texas Tech University |
| Co-Chair: Hahn, Jin-Oh | University of Maryland |
| Organizer: Menezes, Amor A. | University of Florida |
| Organizer: Hahn, Jin-Oh | University of Maryland |
| Organizer: Medvedev, Alexander V. | Uppsala University |
| Organizer: Mesbah, Ali | University of California, Berkeley |
| Organizer: Pereira, Emily | Texas Tech University |
| Organizer: Zhang, Wenlong | Arizona State University |
| |
| 10:30-10:45, Paper WeA17.1 | Add to My Program |
| Analytical Characterization of Inter-Spike Interval Statistics in Excitatory–inhibitory Neuronal Networks (I) |
|
| Gambrell, Oliver | University of Delaware |
| Bokes, Pavol | Comenius University |
| Singh, Abhyudai | University of Delaware |
Keywords: Biological systems
Abstract: A key component of intraneuronal communication is the modulation of postsynaptic firing frequencies by stochastic transmitter release from presynaptic neurons. The time interval between successive postsynaptic firings is called the inter-spike interval (ISI), and understanding its statistics is integral to neural information processing. We start with a model of an excitatory chemical synapse with postsynaptic neuron firing governed as per a classical integrate-and-fire model. Using a first-passage time framework, we derive exact analytical results for the ISI statistical moments, revealing parameter regimes driving precision in postsynaptic action potential timing. Next, we extended this analysis to include both an excitatory and an inhibitory presynaptic connection onto the same postsynaptic neuron. We consider both a fixed postsynaptic-firing threshold and a threshold that adapts based on the postsynaptic membrane potential history. Our analysis shows that the latter adaptive threshold can result in scenarios where increasing the inhibitory input frequency increases the postsynaptic firing frequency. Moreover, we characterize parameter regimes where ISI noise is hypo-exponential or hyper-exponential based on its coefficient of variation being less than or higher than one, respectively.
|
| |
| 10:45-11:00, Paper WeA17.2 | Add to My Program |
| Toward Skill-Informed Haptic Feedback for Human Motor Learning in High-Dimensional De-Novo Tasks (I) |
|
| Kamboj, Ankur | Michigan State University |
| Ranganathan, Rajiv | Michigan State University |
| Tan, Xiaobo | Michigan State University |
| Srivastava, Vaibhav | Michigan State University |
Keywords: Human-in-the-loop control, Markov processes, Biomedical
Abstract: This work addresses the challenge of designing haptic feedback for high-dimensional motor learning tasks, which is difficult due to the latent evolution of motor skill and the redundancy inherent in high-dimensional control spaces. We propose to formulate the feedback design problem as a Partially Observable Markov Decision Process (POMDP) and to compute an optimal nudging policy. In our framework, the POMDP's latent states represent the evolution of the learner's internal motor skill during a high-dimensional, de-novo (novel) motor learning task performed with a hand exoskeleton. Results from a human study (N=10) on novel motor skill acquisition show faster improvement in task performance in the participant group trained with our POMDP-derived nudging policy than in a group trained with no feedback. These results demonstrate the potential of skill-informed feedback design to accelerate learning and performance.
|
| |
| 11:00-11:15, Paper WeA17.3 | Add to My Program |
| Reduced-Input Model Predictive Control for Electrical Stimulation in Multi-Electrode Systems (I) |
|
| Hakam, Noor | North Carolina State University |
| Singh, Mayank | North Carolina State Univeristy |
| Xue, Xiangming | North Carolina State University |
| Lambeth, Krysten | North Carolina State University |
| Favorov, Oleg | University of North Carolina at Chapel Hill |
| Sharma, Nitin | North Carolina State University |
Keywords: Biomedical, Model Validation, Human-in-the-loop control
Abstract: Abstract--- Functional electrical stimulation (FES) with multi-electrode systems enables more flexible control of muscle activation but is complicated by nonlinear, redundant input–output relationships that often lead to unintended co-activation. We address this as a control allocation problem and introduce a selectivity-aware framework that integrates Dynamic Active Subspaces (DAS) with Model Predictive Control (MPC). By reducing the input space to dominant, task-relevant directions, the method simplifies optimization, reducing the computational intensity of the control problem, while preserving biological richness. We demonstrate the approach using simulations from an invasive sciatic nerve cuff in a pig model.
|
| |
| 11:15-11:30, Paper WeA17.4 | Add to My Program |
| Ergodic Quasilinearization and Control for Brain Dynamics (I) |
|
| Nasiriziba, Ilia | Department of Mechanical Engineering, University of Illinois at Urbana-Champaign |
| Singh, Matthew | University of Illinois, Urbana-Champaign; Beckman Institute for Advanced Science & Technology |
Keywords: Control of networks, Large-scale systems, Stochastic optimal control
Abstract: Controlling complex, nonlinear systems, such as the brain, presents a fundamental challenge that requires simplified models for practical controller design. Traditional approaches often fail when these systems operate far from steady states under noise and changing inputs. Different control strategies drive systems into distinct behavioral regimes, each requiring a specific approximation. Rather than imposing a single approximation, this work introduces ergodic quasilinearization (EQL), which automatically identifies the appropriate linear model for each operating scenario. EQL generates adaptive linear models whose parameters adjust based on the system's long-term statistical behavior under varying inputs and noise levels. These statistics are derived analytically from the steady-state equalities, eliminating the need for repeated computation of the full nonlinear dynamics. The effectiveness of EQL is demonstrated on large-scale brain network models, where traditional methods encounter difficulties due to complex nonlinearities and high dimensionality. Conventional linearization methods perform well under fixed conditions but lose accuracy when control strategies change the operating regime. In contrast, EQL maintains accuracy across diverse operating scenarios, supporting robust controller design for systems that rarely reach simple steady states. We demonstrate the power of EQL in predicting brain-model responses to complex stimulation protocols and in identifying an optimal open-loop control for reproducing target brain-activity patterns.
|
| |
| 11:30-11:45, Paper WeA17.5 | Add to My Program |
| Blood Pressure Prediction During Blood Transfusion: A Population-Informed Multi-Modal Sequential Inference Approach |
|
| Kao, Yi-Ming | University of Maryland |
| Rezaei, Parham | University of Maryland, College Park |
| Masoumi Shahrbabak, Sina | University of Maryland |
| Pepino, Jeremy | Massachusetts General Hospital |
| Shogren, Ian | Massachusetts General Hospital |
| Wang, Yang | Massachusetts General Hospital |
| Reisner, Andrew | Harvard Medical School |
| Hahn, Jin-Oh | University of Maryland |
Keywords: Healthcare and medical systems, Biomedical
Abstract: Blood pressure (BP) management is a critical component of blood transfusion, but no mature technology capable of predicting BP response to blood transfusion exists. This paper concerns the development and preliminary in vivo testing of a BP prediction method applicable to hemorrhage and blood transfusion. Key obstacles are (i) large inter-individual variability in the BP response to blood transfusion, (ii) unknown hemorrhage, and (iii) input/state-dependent observability. To cope with these challenges, we developed a multi-modal sequential inference-enabled BP prediction method built upon a mathematical model of patient physiology parameterized by population-informed prior. The method infers patient-specific physiological state and hemorrhage, and uses them to predict future BP in a patient receiving blood transfusion. The in vivo testing of the method using the data collected from large animals undergoing hemorrhage and blood transfusion showed that it could adequately predict mean arterial BP with median absolute errors for 5-min and 15-min predictions of 3.1 mmHg and 7.4 mmHg as well as adequately infer physiological state and hemorrhage: all the hemorrhage events were detected with <3.5 min delay, with median F1 score of 85%. In sum, the prediction of BP response to blood transfusion may be feasible, even in the presence of unknown hemorrhage.
|
| |
| 11:45-12:00, Paper WeA17.6 | Add to My Program |
| Symptom-Driven Personalized Proton Pump Inhibitors Therapy Using Bayesian Neural Networks and Model Predictive Control |
|
| Li, Yutong | University of Michigan, Ann Arbor |
| Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Constrained control, Biomedical, Neural networks
Abstract: Proton Pump Inhibitors (PPIs) are the standard of care for gastric acid disorders but carry significant risks when administered chronically at high doses. Precise long-term control of gastric acidity is challenged by the impracticality of invasive gastric acid monitoring beyond 72 hours and wide inter-patient variability. We propose a noninvasive, symptom-based framework that tailors PPI dosing solely on patient-reported reflux and digestive symptom patterns. A Bayesian Neural Network prediction model learns to predict patient symptoms and quantifies its uncertainty from historical symptom scores, meal, and PPIs intake data. These probabilistic forecasts feed a chance-constrained Model Predictive Control (MPC) algorithm that dynamically computes future PPI doses to minimize drug usage while enforcing acid suppression with high confidence—without any direct acid measurement. In silico studies over diverse dietary schedules and virtual patient profiles demonstrate that our learning-augmented MPC reduces total PPI consumption by 65% compared to standard fixed regimens, while maintaining acid suppression with at least 95% probability. The proposed approach offers a practical path to personalized PPI therapy, minimizing treatment burden and overdose risk without invasive sensors.
|
| |
| WeA18 Regular Session, Churchill A2 |
Add to My Program |
| Constrained Control I |
|
| |
| Chair: Kiumarsi, Bahare | Michigan State University |
| Co-Chair: Molnar, Tamas G. | Wichita State University |
| |
| 10:30-10:45, Paper WeA18.1 | Add to My Program |
| Safety-Certified Planning and Control in Dynamic Environments Via Model Predictive Control |
|
| Khaledi, Marjan | Michigan State University |
| Kiumarsi, Bahare | Michigan State University |
Keywords: Constrained control, Optimization
Abstract: This paper proposes a novel approach to safe navigation in environments with static and dynamic obstacles by embedding control barrier functions (CBFs) within the model predictive control (MPC) framework. Unlike conventional methods that rely on unbounded additive slack variables, the proposed approach enforces each CBF constraint separately, allowing individual flexibility through dedicated slack variables with bounded relaxation weights. These weights modulate the permissible degree of constraint relaxation, ensuring that any safety softening remains quantitatively bounded, systematically tunable, and theoretically consistent with the CBF-based safety guarantees. Furthermore, the feasibility of the proposed approach is guaranteed, and the effectiveness of our method is demonstrated through the simulation results.
|
| |
| 10:45-11:00, Paper WeA18.2 | Add to My Program |
| Lag-Compensating Control Barrier Functions for Feasible Safety-Critical Control |
|
| Chen, Yuchen | University of Michigan |
| Molnar, Tamas G. | Wichita State University |
| Orosz, Gabor | University of Michigan |
Keywords: Constrained control, Lyapunov methods, Nonlinear output feedback
Abstract: First-order lags are commonly used in control system design to approximate system delays, which can significantly compromise system safety. To maintain safety under lags, this paper introduces the concept of lag-compensating control barrier function (LCCBF). First, we propose a lag-eliminating state transformation that removes first-order lag dynamics from linear systems. Then, we use this transformation to construct an LCCBF from the CBF designed for the lag-free system. We prove that controllers synthesized via the LCCBF ensure safety under actuator lags, while preserving feasibility for the same input bounds that are used to synthesize controllers via the original CBF. Finally, we simulate a double integrator with actuator limits to validate that the LCCBF maintains both safety and feasibility under lags.
|
| |
| 11:00-11:15, Paper WeA18.3 | Add to My Program |
| Safety-Critical Control with Guaranteed Lipschitz Continuity Via Filtered Control Barrier Functions |
|
| Liu, Shuo | Boston University |
| Xiao, Wei | WPI |
| Belta, Calin | University of Maryland |
Keywords: Constrained control, Lyapunov methods, Optimal control
Abstract: In safety-critical control systems, ensuring both system safety and smooth control input is essential for practical deployment. Existing Control Barrier Function (CBF) frameworks, especially High-Order CBFs (HOCBFs), effectively enforce safety constraints, but also raise concerns about the smoothness of the resulting control inputs. While smoothness typically refers to continuity and differentiability, it does not by itself ensure bounded input variation. In contrast, Lipschitz continuity is a stronger form of continuity that not only is necessary for the theoretical guarantee of safety, but also bounds the rate of variation and eliminates abrupt changes in the control input. Such abrupt changes can degrade system performance or even violate actuator limitations, yet current CBF-based methods do not provide Lipschitz continuity guarantees. This paper introduces Filtered Control Barrier Functions (FCBFs), which extend HOCBFs by incorporating an auxiliary dynamic system—referred to as an input regularization filter—to produce Lipschitz continuous control inputs. The proposed framework ensures safety, control bounds, and Lipschitz continuity of the control inputs simultaneously by integrating FCBFs and HOCBFs within a unified quadratic program (QP). Theoretical guarantees are provided and simulations on a unicycle model demonstrate the effectiveness of the proposed method compared to standard and smoothness-penalized HOCBF approaches.
|
| |
| 11:15-11:30, Paper WeA18.4 | Add to My Program |
| Input-To-State Safe Backstepping: Robust Safety-Critical Control with Unmatched Uncertainties |
|
| Cohen, Max | North Carolina State University |
| Ong, Pio | California Institute of Technology |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Lyapunov methods
Abstract: Guaranteeing safety in the presence of unmatched disturbances---uncertainties that cannot be directly canceled by the control input---remains a key challenge in nonlinear control. This paper presents a constructive approach to safety-critical control of nonlinear systems with unmatched disturbances. We first present a generalization of the input-to-state safety (ISSf) framework for systems with these uncertainties using the recently developed notion of an Optimal Decay CBF, which provides more flexibility for satisfying the associated Lyapunov-like conditions for safety. From there, we outline a procedure for constructing ISSf-CBFs for two relevant classes of systems with unmatched uncertainties: i) strict-feedback systems; ii) dual-relative-degree systems, which are similar to differentially flat systems. Our theoretical results are illustrated via numerical simulations of an inverted pendulum and planar quadrotor.
|
| |
| 11:30-11:45, Paper WeA18.5 | Add to My Program |
| Combinatorial Control Barrier Functions: Nested Boolean and P-Choose-R Compositions of Safety Constraints |
|
| Ong, Pio | California Institute of Technology |
| Lee, Haejoon | University of Michigan |
| Molnar, Tamas G. | Wichita State University |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Lyapunov methods
Abstract: This paper investigates the problem of composing multiple control barrier functions (CBFs)---and matrix control barrier functions (MCBFs)---through logical and combinatorial operations. Standard CBF formulations naturally enable conjunctive (AND) combinations, but disjunctive (OR) and more general logical structures introduce nonsmoothness and possibly a combinatorial blow-up in the number of logical combinations. We introduce the framework of combinatorial CBFs that addresses p-choose- r safety specifications and their nested composition. The proposed framework ensures safety for the exact safe set in a scalable way, using the original number of primitive constraints. We establish theoretical guarantees on safety under these compositions, and we demonstrate their use on a patrolling problem in a multi-agent system.
|
| |
| 11:45-12:00, Paper WeA18.6 | Add to My Program |
| Reformulations of Quadratic Programs for Lipschitz Continuity |
|
| Agrawal, Devansh Ramgopal | University of Michigan |
| Lee, Haejoon | University of Michigan |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Optimization
Abstract: Optimization-based controllers often lack regularity guarantees, such as Lipschitz continuity, when multiple constraints are present. When used to control a dynamical system, these conditions are essential to ensure the existence and uniqueness of the system's trajectory. Here we propose a general method to convert a Quadratic Program (QP) into a Second Order Cone Program (SOCP), which is shown to be Lipschitz continuous. Key features of our approach are that (i) the regularity of the resulting formulation does not depend on the structural properties of the constraints, such as the linear independence of their gradients; and (ii) it admits a closed-form solution under some assumptions, which is not available for general QPs with multiple constraints, enabling faster computation. We support our method with rigorous analysis and examples.
|
| |
| WeA20 Invited Session, Churchill B2 |
Add to My Program |
| Set-Based Methods in Dynamic Systems and Control I |
|
| |
| Chair: Pangborn, Herschel | The Pennsylvania State University |
| Co-Chair: Ruths, Justin | University of Texas at Dallas |
| Organizer: Koeln, Justin | University of Texas at Dallas |
| Organizer: Pangborn, Herschel | The Pennsylvania State University |
| Organizer: Jain, Neera | Purdue University |
| Organizer: Ruths, Justin | University of Texas at Dallas |
| |
| 10:30-10:45, Paper WeA20.1 | Add to My Program |
| A Set-Based Approach for Stable MPC with Minkowski Cost Functions (I) |
|
| Leung, Jordan | Mitsubishi Electric Research Laboratories |
| P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Keywords: Predictive control for linear systems, Constrained control, Optimal control
Abstract: This paper considers the formulation and implementation of model predictive control (MPC) laws with cost functions defined as Minkowski functions of convex, compact sets. We propose constructive procedures for selecting stabilizing terminal elements based on a lambda-contractive set. In addition, we describe a set-based implementation of the exact MPC policy based on constrained zonotopes and a suboptimal MPC policy based on a polytopic set approximation. Our numerical simulations demonstrate that the proposed suboptimal set-based policy achieves performance comparable to the standard trajectory-based optimal control formulation, while requiring less computational effort for low-dimensional systems.
|
| |
| 10:45-11:00, Paper WeA20.2 | Add to My Program |
| Safe Control of Sampled-Data Systems Via Discretized Parametric Contracting Dynamics (I) |
|
| Marvi, Zahra | University of Minnesota |
| Bullo, Francesco | Univ of California at Santa Barbara |
| Alleyne, Andrew G. | University of Minnesota |
Keywords: Constrained control, Hybrid systems, Lyapunov methods
Abstract: This paper presents discrete control barrier proximal dynamics (D-CBPD), a computationally-light safe control method for sampled-data systems. Ensuring safety in such systems requires bridging the gap between continuous-time control design and discrete implementation. D-CBPD is a discretized contracting dynamics that solves a control barrier function (CBF)-based quadratic program (QP). We first characterize how discretized parametric contracting dynamics track their fixed point and derive an explicit upper bound on the tracking error. We then analyze their use as discrete controllers for continuous-time systems. Leveraging Lipschitz properties, we show that trajectories remain bounded under constant inputs at each interval and exhibit linear growth in passive and weakly contracting systems. This allows us to bound the deviation of the discrete control signal from the optimal continuous-time controller, both at sampling instants and during inter-sample evolution. Interpreting this deviation as an input disturbance enables analysis of the discrete-time controller as a continuous-time system subject to disturbance. Building on this, we introduce D-CBPD, a safe and scalable discrete controller that guarantees safety of a continuously evolving system with a bounded and adjustable violation margin, provide explicit condition for step size to ensure convergence, and demonstrate its effectiveness in thermal management of a simplified lithium-ion battery.
|
| |
| 11:00-11:15, Paper WeA20.3 | Add to My Program |
| Online Constraint Tightening for MPC Using Constrained Zonotope Reachability Analysis and Zonotope Over-Approximations (I) |
|
| Robbins, Joshua | The Pennsylvania State University |
| Glunt, Jonah | The Pennsylvania State University |
| Thompson, Andrew | The Pennsylvania State University |
| Pangborn, Herschel | The Pennsylvania State University |
Keywords: Predictive control for nonlinear systems, Constrained control, Formal verification/synthesis
Abstract: Robust model predictive control (MPC) generally relies on Pontryagin difference calculations to tighten constraints. In cases where the dynamics model or disturbances are not known a priori, it is desirable to perform these constraint tightening calculations online. This paper presents a constraint tightening method based on reachability analysis of error dynamics that is tractable to implement online. Constrained zonotopes are used to bound the error dynamics using polyhedral envelopes to account for dynamic nonlinearities. To leverage a recently developed method for inner-approximating Pontryagin differences where the subtrahend is a zonotope, a novel optimization-free method for over-approximating constrained zontopes as zonotopes is presented. The method is shown to consistently produce tighter over-approximations than existing optimization-free methods. The developed online constraint tightening procedure is evaluated in a numerical example where the MPC uses a linear time-varying model.
|
| |
| 11:15-11:30, Paper WeA20.4 | Add to My Program |
| Viscosity CBFs: Bridging the Control Barrier Function and Hamilton-Jacobi Reachability Frameworks in Safe Control Theory (I) |
|
| Hirsch, Dylan | UC San Diego (UCSD) |
| Fernández Fisac, Jaime | Princeton University |
| Herbert, Sylvia | UC San Diego (UCSD) |
Keywords: Constrained control, Optimal control, Lyapunov methods
Abstract: Control barrier functions (CBFs) and Hamilton-Jacobi reachability (HJR) are central frameworks in safe control. Traditionally, these frameworks have been viewed as distinct, with the former focusing on optimally safe controller design and the latter providing sufficient conditions for safety. A previous work introduced the notion of a control barrier value function (CB-VF), which is defined similarly to the other value functions studied in HJR but has certain CBF-like properties. In this work, we proceed the other direction by generalizing CBFs to non-differentiable ``viscosity'' CBFs. We show the connection between viscosity CBFs and CB-VFs, bridging the CBF and HJR frameworks. Through this bridge, we characterize the viscosity CBFs as precisely those functions which provide CBF-like safety guarantees (control invariance and smooth approach to the boundary). We then further show nice theoretical properties of viscosity CBFs, including their desirable closure under maximum and limit operations. In the process, we also extend CB-VFs to non-exponential anti-discounting and update the corresponding theory for CB-VFs along these lines.
|
| |
| 11:30-11:45, Paper WeA20.5 | Add to My Program |
| Synthesizing Provably Invariant Sets Via Stochastically Sampled Data (I) |
|
| Strong, Amy | Duke University |
| Kashani, Ali | University of New Mexico |
| Danielson, Claus | University of New Mexico |
| Bridgeman, Leila J. | Duke University |
Keywords: Constrained control, Identification, Pattern recognition and classification
Abstract: Positive invariant (PI) sets are essential for ensuring safety, i.e. constraint adherence, of dynamical systems. With the increasing availability of sampled data from complex (and often unmodeled) systems, it is advantageous to leverage these data sets for PI set synthesis. This paper uses data driven geometric conditions of invariance to synthesize PI sets from data. Where previous data driven, set-based approaches to PI set synthesis used deterministic sampling schemes, this work instead synthesizes PI sets from any pre-collected data set. Beyond a data set and Lipschitz continuity, no additional information about the system is needed. A tree data structure is used to partition the space and select samples used to construct the PI set, while Lipschitz continuity is used to provide deterministic guarantees of invariance. Finally, probabilistic bounds are given on the number of samples needed for the algorithm to determine a PI set of a certain volume.
|
| |
| 11:45-12:00, Paper WeA20.6 | Add to My Program |
| Guaranteed Privacy-Preserving Control of Discrete-Time Systems (I) |
|
| Khajenejad, Mohammad | The University of Tulsa |
Keywords: Nonlinear output feedback, Observers for nonlinear systems, Uncertain systems
Abstract: We introduce a guaranteed privacy-preserving controller for nonlinear discrete-time systems with bounded uncertainties. Moving beyond stochastic differential privacy, our design offers deterministic privacy through hard bounds on the proximity of set-valued estimates. The solution involves synthesizing a stabilizing controller for a perturbed framer system, where control gains and a privacy-inducing noise factor are co-optimized via semi-definite programming. This integrated approach ensures both input-to-state stable closed-loop dynamics and certified privacy. We also formalize the inherent performance-privacy trade-off by quantifying the accuracy loss due to privacy constraints. Simulations confirm that our method outperforms standard differential privacy techniques.
|
| |
| WeA21 Regular Session, Churchill C1 |
Add to My Program |
| Optimization I |
|
| |
| Chair: Kalaimani, Rachel Kalpana | Indian Institute of Technology Madras |
| Co-Chair: Paternain, Santiago | Rensselaer Polytechnic Institute |
| |
| 10:30-10:45, Paper WeA21.1 | Add to My Program |
| Resource Allocation under Stochastic Demands Using Shrinking Horizon Optimization |
|
| Tzikas, Alexandros | Stanford University |
| Ure, Nazim Kemal | Stanford University |
| Arief, Mansur | Stanford University |
| Kochenderfer, Mykel | Stanford University |
| Boyd, Stephen | Stanford University |
Keywords: Optimization, Finance, Stochastic optimal control
Abstract: We consider the problem of optimally allocating a limited number of resources across time to maximize revenue under stochastic demands. This formulation is relevant in various areas of control, such as supply chain, ticket revenue maximization, healthcare operations, and energy allocation in power grids. We propose a bisection method to solve the static optimization problem and extend our approach to a shrinking horizon algorithm for the sequential problem. The shrinking horizon algorithm computes future allocations after updating the distribution of future demands by conditioning on the observed values of demand. We illustrate the method on a simple synthetic example with jointly log-normal demands, showing that it achieves performance close to a bound obtained by solving the prescient problem.
|
| |
| 10:45-11:00, Paper WeA21.2 | Add to My Program |
| A Hybrid Systems Model of Feedback Optimization for Linear Systems: Convergence and Robustness |
|
| Chuy, Oscar Jed | Georgia Institute of Technology |
| Hale, Matthew | Georgia Institute of Technology |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Optimization, Hybrid systems, Linear systems
Abstract: Feedback optimization algorithms compute inputs to a system using real-time output measurements, which helps mitigate the effects of disturbances. However, existing work often models both system dynamics and computations in either discrete or continuous time, which may not accurately model some applications. In this work, we model linear system dynamics in continuous time, and we model the computations of inputs in discrete time. Therefore, we present a novel hybrid systems model of feedback optimization. We first establish the well-posedness of this hybrid model and establish completeness of solutions while ruling out Zeno behavior. Then we show the state of the system converges exponentially fast to a ball of known radius about a desired goal state. Next we analytically show that this system is robust to perturbations in (i) the values of measured outputs, (ii) the matrices that model the linear time-invariant system, and (iii) the times at which inputs are applied to the system. Simulation results confirm that this approach successfully mitigates the effects of disturbances.
|
| |
| 11:00-11:15, Paper WeA21.3 | Add to My Program |
| Resource-Aware Greedy Sensor Scheduling with Guarantees under Sequence Supermodular Objectives |
|
| Cho, Wooyeong | University of California, Los Angeles |
| Bhanot, Isean | University of California, Los Angeles (UCLA) |
| Mehta, Ankur | University of California Los Angeles |
Keywords: Optimization, Linear systems, Large-scale systems
Abstract: We address resource-aware optimal sensor scheduling for linear gaussian systems over a finite horizon, aiming to minimize the total number of sensor activations while ensuring a target estimation accuracy, measured by the log determinant of the posterior error covariance. Due to the NP hard nature of the problem and the temporal coupling over the time step, prior work has mainly focused on a time-invariant approximation of optimal sensor selection while comparing with greedy scheduling. In this paper, we extend this to time varying sensor scheduling in order to approximate the total number of sensor activations over the finite time horizon. We build a sequential analysis aligned with Kalman recursion and derive a theoretical utilization bound using greedy scheduling under a sequence-supermodular objective. Our simulations across diverse system settings demonstrate that the proposed bound reliably captures sensor usage while ensuring the desired accuracy.
|
| |
| 11:15-11:30, Paper WeA21.4 | Add to My Program |
| Proximal Dynamic Framework for Equilibrium Problems with Prescribed-Time Guarantees |
|
| Abdel Aal, Osama Fuad | MESA Lab at UC Merced |
| Ozbek, Necdet Sinan | TOBB University of Economics and Technology |
| Viola, Jairo | University of California, Merced |
| Chen, YangQuan | University of California, Merced |
Keywords: Optimization, Optimization algorithms, Lyapunov methods
Abstract: Fixed-point dynamical systems provide a powerful framework for representing and analyzing a wide variety of problems in control, optimization, and learning. Many fundamental tasks—such as solving nonlinear equations, minimizing objective functions, enforcing constraints, and coordinating distributed agents—can be formulated as fixed-point problems. Traditional methods often rely on asymptotic convergence and may not guarantee performance within a finite horizon. In this work, we first establish a framework for analyzing prescribed-time convergence of fixed-point dynamical systems, ensuring that trajectories reach a fixed point within a user-defined and uniformly bounded time, independent of initial conditions. Building on this foundation, we extend the framework to proximal dynamics for equilibrium problems, thereby unifying prescribed-time fixed-point analysis with proximal operator-based methods. Rigorous analysis confirms prescribed-time stability under smooth assumptions, and numerical experiments illustrate the effectiveness of the proposed approach.
|
| |
| 11:30-11:45, Paper WeA21.5 | Add to My Program |
| Zeroth Order Gradient Descent for Low-Rank Functions |
|
| Thakur, Aman | Indian Institute of Technology Madras |
| Kalaimani, Rachel Kalpana | Indian Institute of Technology Madras |
Keywords: Optimization, Optimization algorithms, Computational methods
Abstract: In this paper we propose a Zeroth-order gradient descent algorithm for cost functions with Low-rank structure, i.e., the gradients of the cost function exists in a lower-dimensional space. Low-rank structures have emerged as a recurring phenomenon in modern machine learning, deep learning, and recommendation systems. In higher-dimensional problems with costly gradient computations, such Low-rank structures have been exploited to design first-order algorithms with reduced computational cost. We propose a Zeroth-order algorithm to cater for scenarios where the cost function is not explicitly available and only function evaluations are possible by querying a noisy oracle. The algorithm first identifies the lower dimensional subspace where the gradients exists mostly. Then in order to compute an approximate gradient the queries to the oracle are done only in the directions corresponding to this lower dimensional subspace thereby reducing the oracle complexity. We provide convergence analysis for the case of strongly convex and non-convex functions.
|
| |
| 11:45-12:00, Paper WeA21.6 | Add to My Program |
| Online Optimization on Hadamard Manifolds: Curvature Independent Regret Bounds on Horospherically Convex Objectives |
|
| Sahinoglu, Emre | Northeastern University |
| Shahrampour, Shahin | Northeastern University |
Keywords: Optimization algorithms, Optimization
Abstract: We study online Riemannian optimization on Hadamard manifolds under the framework of horospherical convexity (h-convexity). Prior work mostly relies on the geodesic convexity (g-convexity), often leading to regret bounds scaling poorly with the manifold curvature. To address this limitation, we analyze Riemannian online gradient descent for h-convex and strongly h-convex functions and establish O(sqrt{T}) and O(log(T)) regret guarantees, respectively. These bounds are curvature-independent and match the results in the Euclidean setting. We validate our approach with experiments on the manifold of symmetric positive-definite (SPD) matrices equipped with the affine-invariant metric. In particular, we investigate online Tyler's M-estimation and online Fréchet mean computation, showing the application of h-convexity in practice.
|
| |
| WeA22 Invited Session, Churchill C2 |
Add to My Program |
| Estimation and Control of Distributed Parameter Systems I |
|
| |
| Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
| Co-Chair: Hu, Weiwei | University of Georgia |
| Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
| Organizer: Hu, Weiwei | University of Georgia |
| |
| 10:30-10:45, Paper WeA22.1 | Add to My Program |
| Delay Compensation of Multi-Input Distinct Delay Nonlinear Systems Via Neural Operators (I) |
|
| Bajraktari, Filip | University of Belgrade, Serbia |
| Bhan, Luke | University of California, San Diego |
| Krstic, Miroslav | University of California, San Diego |
| Shi, Yuanyuan | University of California San Diego |
Keywords: Delay systems, Machine learning, Lyapunov methods
Abstract: In this work, we present the first stability results for approximate predictors in multi-input non- linear systems with distinct actuation delays. We show that if the predictor approximation satisfies a uniform (in time) error bound, semi-global practical stability is correspondingly achieved. For such approximators, the required uniform error bound depends on the desired region of attraction and the number of control inputs in the system. The result is achieved through transforming the delay into a transport PDE and con- ducting analysis on the coupled ODE-PDE cascade. To highlight the viability of such error bounds, we demonstrate our results on a class of approximators - neural operators - showcasing sufficiency for satisfying such a universal bound both theoretically and in simulation on a mobile robot experiment.
|
| |
| 10:45-11:00, Paper WeA22.2 | Add to My Program |
| Reduction of Traffic-Related Pollution through Green Infrastructure-Solutions (I) |
|
| Rarita', Luigi | University of Salerno |
| D'Apice, Ciro | University of Salerno |
| Manzo, Rosanna | University of Salerno |
| Piccoli, Benedetto | Rutgers University - Camden |
Keywords: Differential-algebraic systems, Numerical algorithms, Simulation
Abstract: This paper investigates the effect of green infrastructures in mitigating air pollution in urban environments. This complex issue needs a thorough examination of traffic-related emissions and the atmospheric dispersion of chemical species. A macroscopic second-order model is employed to simulate traffic conditions, followed by a microscopic approach to estimate emission levels. Finally, partial differential equations-based atmospheric dispersion, driven by chemistry ordinary differential equations for particulate/nitrogen oxides/ozone interactions, handles vertical transport under varying conditions. Numerical results show that different configurations of green barriers yield varying pollutant concentrations in metropolitan regions.
|
| |
| 11:00-11:15, Paper WeA22.3 | Add to My Program |
| Equation-Free Coarse Control of Distributed Parameter Systems Via Local Neural Operators (I) |
|
| Fabiani, Gianluca | Johns Hopkins University |
| Siettos, Constantinos | Università Degli Studi Di Napoli Federico II |
| Kevrekidis, Ioannis G. | Johns Hopkins University |
Keywords: Distributed parameter systems, Machine learning, Computer-aided control design
Abstract: The control of high-dimensional distributed parameter systems (DPS) remains a challenge when explicit coarse-grained equations are unavailable. Classical equation-free (EF) approaches rely on fine-scale simulators treated as black-box timesteppers. However, repeated simulations for steady-state computation, linearization, and control design are often computationally prohibitive, or the microscopic timestepper may not even be available, leaving us with data as the only resource. We propose a data-driven alternative that uses local neural operators, trained on spatiotemporal microscopic/mesoscopic data, to obtain efficient short-time solution operators. These surrogates are employed within Krylov subspace methods to compute coarse steady and unsteady-states, while also providing Jacobian information in a matrix-free manner. Krylov–Arnoldi iterations then approximate the dominant eigenspectrum, yielding reduced models that capture the open-loop slow dynamics without explicit Jacobian assembly. Both discrete-time Linear Quadratic Regulator (dLQR) and pole-placement (PP) controllers are based on this reduced system and lifted back to the full nonlinear dynamics, thereby closing the feedback loop. The framework is validated by stabilizing an unstable steady-state of the Liouville–Bratu PDE, demonstrating consistent performance between the learned surrogate and the true system, with quantified degradation under plant-model mismatch.
|
| |
| 11:15-11:30, Paper WeA22.4 | Add to My Program |
| Covariate-Based Localized Conditioning of Random Optimal Parameters in an Abstract Parabolic Model for the Transdermal Transport of Ethanol and a Transdermal Alcohol Biosensor (I) |
|
| Tan, Qinyang | University of Southern California |
| Chen, Hongyi | University of Southern California |
| Luczak, Susan | University of Southern California |
| Rosen, I. Gary | University of Southern California |
Keywords: Distributed parameter systems, Identification, Uncertain systems
Abstract: We study an inverse problem arising in the estimation of breath alcohol concentration (BrAC) from transdermal alcohol concentration (TAC) measured by wearable biosensors. The forward model is a parabolic PDE–ODE system describing ethanol diffusion through the skin, with random parameters reflecting inter-individual and environmental variability. To reduce uncertainty in BrAC estimation, we develop a framework that conditions the distribution of skin parameters on subject-specific covariates by exploiting the dependence of BrAC model parameters on those covariates. The key analytical tools are Banach space versions of the Implicit Function Theorem and sensitivity results expressed in terms of Frechet derivatives of parameter dependent semigroups. A central contribution is the rigorous proof of convergence of second derivatives under Galerkin approximation, ensuring that sensitivity and uncertainty quantification computed in finite dimensions faithfully approximate the infinite-dimensional setting. Together, these results establish a mathematically rigorous basis for population calibration and real-time BrAC reconstruction from TAC, with applications to clinical monitoring, addiction treatment, and digital health.
|
| |
| 11:30-11:45, Paper WeA22.5 | Add to My Program |
| Optimizing Pricing and Performance of Sensor Network for Optimal Estimation of Parabolic PDEs (I) |
|
| Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems, Kalman filtering
Abstract: This paper introduces economic aspects in the state estimator design for a class of infinite dimensional systems. At one end a spatially distributed sensor providing the best possible filter performance and associated with a highly reliable sensor is identified as a very expensive choice. At the other end, a network of inexpensive sensors with pointwise spatial distribution and characterized by reduced reliability are utilized as an alternative to the expensive sensor. Using a modification to centroidal Voronoi Tessellations to arrive at a network of pointwise sensors that collectively approximate the spatially varying expensive sensor, both the total price and the associated filter performance are examined to provide a joint economic and filter performance metric for sensor design in spatially distributed systems.
|
| |
| 11:45-12:00, Paper WeA22.6 | Add to My Program |
| Grad-Div Stabilized Mixed Finite Element Method for Control of Incompressible Flows (I) |
|
| Ravindran, S.S. | University of Alabama in Huntsville |
Keywords: Control applications, Computational methods, Fluid flow systems
Abstract: This paper studies inf-sup stable finite element discretizations for control of viscous incompressible flows with a grad-div stabilization. The proposed grad-div stabilization method augments the mixed Galerkin finite element with a term enhancing mass conservation and improving the performance of the algorithm. We analyse a fully discrete optimality system with grad-div stabilized finite element method. An optimal order error estimate of the approximations of the optimality system is derived. We formulate and solve computationally a control problem that involves velocity matching in a closed cavity. Numerical experiments show the feasibility and applicability of the grad-div stabilized finite element method for control of incompressible flows.
|
| |
| WeB03 Tutorial Session, Grand Salon 3 |
Add to My Program |
| Bridging Control with Neural Network Verifier alpha, Beta-CROWN |
|
| |
| Chair: Hu, Bin | University of Illinois at Urbana-Champaign |
| Co-Chair: Zhang, Huan | UIUC |
| Organizer: Li, Haoyu | University of Illinois, Urbana-Champaign |
| Organizer: Zhong, Xiangru | University of Illinois Urbana-Champaign |
| Organizer: Cheng, Hao | University of Illinois Urbana-Champaign |
| Organizer: Hu, Bin | University of Illinois at Urbana-Champaign |
| Organizer: Zhang, Huan | UIUC |
| |
| 13:30-15:00, Paper WeB03.1 | Add to My Program |
| Bridging Control with Neural Network Verifier Alpha-Beta-CROWN: A Tutorial (I) |
|
| Li, Haoyu | University of Illinois, Urbana-Champaign |
| Zhong, Xiangru | University of Illinois Urbana-Champaign |
| Cheng, Hao | University of Illinois Urbana-Champaign |
| Hu, Bin | University of Illinois at Urbana-Champaign |
| Zhang, Huan | UIUC |
Keywords: Neural networks, Machine learning, Reinforcement learning
Abstract: Learning-based methods for synthesizing controllers have gained popularity due to their high expressiveness and strong empirical performance. However, in safety-critical scenarios such as autonomous driving, robotics, and power systems, empirical performance alone is insufficient, and formal verification of controller properties such as stability and safety is highly desirable. Unfortunately, many prior verification approaches are either tied to specific structural assumptions on the system or certificate, making them difficult to transfer across settings, or suffer from poor scalability on higher-dimensional neural network systems. In this tutorial, we present a unified framework that aims at mitigating this gap via bridging control with the state-of-the-art neural network verifier α,β-CROWN (alpha-beta-CROWN). At its core, α,β-CROWN is a general-purpose bounding engine for general computation graphs: given an input domain, it can automatically produce certified bounds and explicit linear relaxation of the objective function. These certified bounds are useful on their own for tasks such as reachability analysis and local linear approximation, and they also provide the foundation for more complex routines that perform verification, satisfiability checking, and optimization. More specifically, many control problems reduce to verifying real-valued inequalities over a state domain (e.g., Lyapunov theory and barrier functions). Consequently, α,β-CROWN enables scalable verification of such conditions by computing tight bounds on these conditions and recursively partitioning and pruning subdomains based on the bounds. Thanks to GPU parallelization, this pipeline demonstrates superior scalability on verification and optimization problems that are challenging for traditional approaches. In this tutorial, we discuss the basics of α,β-CROWN, and introduce its application to various control-related tasks. Finally, we will discuss training frameworks for co-synthesizing controllers and corresponding verifier-friendly certificates.
|
| |
| WeB04 Regular Session, Grand Salon 4 |
Add to My Program |
| Aerospace II |
|
| |
| Chair: Nguyen, Tam W. | Kyoto University |
| Co-Chair: Wan, Yan | University of Texas at Arlington |
| |
| 13:30-13:45, Paper WeB04.1 | Add to My Program |
| Reliable Air-To-Air Communication for UAVs Using Directional Antennas |
|
| Zheng, Xinquan | San Diego State University |
| Zhang, Haomeng | San Diego State University |
| Xie, Junfei | San Diego State University |
| Chang, Jiajian | University of Texas at Arlington |
| Wan, Yan | University of Texas at Arlington |
Keywords: Communication networks, Flight control, Stability of nonlinear systems
Abstract: In unmanned aerial vehicle (UAV) air-to-air (A2A) communication with rigidly mounted directional antennas, flight-induced attitude variations can cause beam deflections, leading to degraded link quality or even complete signal loss. Existing studies typically assume level flight, thereby overlooking this critical effect. In this paper, we formulate communication reliability as a yaw alignment and pitch-constrained control problem, where yaw regulation ensures beam pointing and pitch is restricted within the half-power beamwidth (HPBW) to preserve link reliability. To address this formulation, we design a time-varying gain command-filtered backstepping controller that achieves accurate trajectory tracking while rigorously enforcing attitude constraints. Theoretical analysis shows that the closed-loop signals converge within a prescribed time and ultimately decay to zero, thus avoiding large transient responses. Simulation results demonstrate that the proposed method sustains stable and reliable communication under representative flight scenarios.
|
| |
| 13:45-14:00, Paper WeB04.2 | Add to My Program |
| Flight Control of a Fixed-Wing Aircraft with Angle-Of-Attack Sensor Failure |
|
| Richards, Riley J. | University of Michigan |
| Vander Schaaf, Jacob | University of Michigan |
| Islam, Syed Aseem Ul | University of Michigan |
| Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Flight control, Robust adaptive control, Fault tolerant systems
Abstract: Autopilots for fixed-wing aircraft typically use a combination of inertial sensors, which include accelerometers and rate gyros, and non-inertial sensors, which include GPS and air-data sensors. By measuring airspeed, angle of attack, and sideslip angle, air-data sensors determine the magnitude and direction of the aircraft velocity vector in the aircraft frame. Since lift and drag are aerodynamic effects, air-data sensors are essential for estimating the aerodynamic forces and moments applied to the vehicle. Unfortunately, air-data sensors are susceptible to failure, with potentially catastrophic consequences. For a fixed-wing flight control system based on adaptive model predictive control, this paper assesses the performance degradation due to the failure of an angle-of-attack sensor.
|
| |
| 14:00-14:15, Paper WeB04.3 | Add to My Program |
| A RRT+B-Spline Planner Guaranteeing Safe Obstacle Navigation for UAVs |
|
| Anderson, Dean B. | Brigham Young University |
| Beard, Randal W. | Brigham Young Univ |
Keywords: Flight control, Robotics
Abstract: This paper addresses the problem of obstacle nav- igation and path planning for UAVs. This is accomplished by finding a path from start to end points through an obstacle field. To address this problem, a modified Rapidly Exploring Random Trees (RRT) algorithm is used to randomly create a tree of rectangular Safe Flight Corridors (SFCs). When a valid series of SFCs has been found from the start to the goal, a flight path that stays within that series is generated using a Basis Spline (B-Spline), which provides continuous position, velocity, and acceleration control. The path is generated by adjusting the B-Spline control points to minimize the path length objective function with additional constraints on the curvature of the B- Spline. Simulation results demonstrate the effectiveness of this algorithm for UAV path planning.
|
| |
| 14:15-14:30, Paper WeB04.4 | Add to My Program |
| A Quaternion-Based Attitude Tracking Control of UAVs Using Model Predictive Control and Prescribed Performance Approaches |
|
| Mohammadzamani, Fatemeh | Carleton University |
| Shen, Chao | Carleton University |
| Hashim, Hashim A | Carleton University |
Keywords: Flight control, Predictive control for nonlinear systems, Robotics
Abstract: This paper presents an attitude tracking control method for unmanned aerial vehicles (UAVs) using a unit-quaternion representation. Initially, a prescribed performance controller (PPC) is developed to ensure closed-loop stability, demonstrating that tracking errors are exponentially converging within predefined bounds. Subsequently, the control performance is enhanced through the application of the Lyapunov-based model predictive control (LMPC). Compared with existing methods, the combination of the LMPC and PPC (i) optimizes the tracking performance, (ii) accounts for design constraints, and (iii) addresses the nonlinearities of the system's dynamics. The recursive feasibility of LMPC is rigorously proved, and the asymptotic stability of the closed-loop system can be shown using Lyapunov analysis. Finally, the applicability and validity of the theoretical results are demonstrated in simulations. To highlight the performance of the LMPC controller, it compares to two baseline controllers, PPC and an output feedback controller (OFC) that is widely studied for UAVs. The simulation results demonstrate the advantages of LMPC in attitude tracking applications. In particular, the combination of LMPC and PPC reduces the transient tracking error by 60.0% compared to OFC and 8.2% compared to PPC and improves the steady-state error by 92.0% compared to OFC and 46.4% compared to PPC.
|
| |
| 14:30-14:45, Paper WeB04.5 | Add to My Program |
| Fast RLS Identification Leveraging the Linearized System Sparsity: Predictive Cost Adaptive Control for Quadrotors |
|
| Nguyen, Tam W. | Kyoto University |
Keywords: Flight control, Indirect adaptive control, Optimal control
Abstract: This paper presents a centralized predictive cost adaptive control (PCAC) strategy for the position and attitude control of quadrotors. PCAC is an optimal, prediction-based control method that uses recursive least squares (RLS) to identify model parameters online, enabling adaptability in dynamic environments. Addressing challenges with black-box approaches in systems with complex couplings and fast dynamics, this study leverages the unique sparsity of quadrotor models linearized around hover points. By identifying only essential parameters related to nonlinear couplings and dynamics, this approach reduces the number of parameters to estimate, accelerates identification, and enhances stability during transients. Furthermore, the proposed control scheme removes the need for an attitude setpoint, typically required in conventional cascaded control designs.
|
| |
| 14:45-15:00, Paper WeB04.6 | Add to My Program |
| Coordinated UAV Beamforming and Control for Directional Jamming and Nulling |
|
| Fotiadis, Filippos | The University of Texas at Austin |
| Sadler, Brian M. | The University of Texas at Austin |
| Topcu, Ufuk | The University of Texas at Austin |
Keywords: Control over communications, Communication networks, Optimal control
Abstract: Efficient mobile jamming against eavesdroppers in wireless networks necessitates accurate coordination between mobility and antenna beamforming. We study the coordinated beamforming and control problem for a UAV that carries two omnidirectional antennas, and which uses them to jam an eavesdropper while leaving a friendly client unaffected. The UAV can shape its jamming beampattern by controlling its position, its antennas' orientation, and the relative phasing for each antenna. We derive a closed-form expression for the antennas' phases that guarantees zero jamming impact on the client. In addition, we determine the antennas’ orientation and the UAV’s position that maximizes jamming impact on the eavesdropper through an optimal control problem, optimizing the orientation pointwise and the position through the UAV’s control input. Simulations show how this coordinated beamforming and control scheme enables directional GPS denial while guaranteeing zero interference towards a friendly direction.
|
| |
| WeB05 Tutorial Session, Grand Salon 6 |
Add to My Program |
Redefining End-Of-Life: Intelligent Automation for Electronics
Remanufacturing |
|
| |
| Chair: Tian, Sibo | Texas A&M University |
| Co-Chair: Zheng, Minghui | Texas A&M University |
| Organizer: Zheng, Minghui | Texas A&M University |
| Organizer: Behdad, Sara | University of Florida |
| |
| 13:30-15:00, Paper WeB05.1 | Add to My Program |
| Redefining End-Of-Life: Intelligent Automation for Electronics Remanufacturing Systems (I) |
|
| Tian, Sibo | Texas A&M University |
| Liang, Xiao | Texas A&M University |
| Behdad, Sara | University of Florida |
| Zheng, Minghui | Texas A&M University |
Keywords: Manufacturing systems, Robotics, Machine learning
Abstract: Remanufacturing is fundamentally more challenging than traditional manufacturing due to the significant uncertainty, variability, and incompleteness inherent in end-of-life (EoL) products. At the same time, it has become increasingly essential and urgent for facilitating a circular economy, driven by the growing volume of discarded electronic products and the escalating scarcity of critical materials. In this paper, we review the existing literature and examine the key challenges as well as emerging opportunities in intelligent automation for EoL electronics remanufacturing, providing a comprehensive overview of how robotics, control, and artificial intelligence (AI) can jointly enable scalable, safe, and intelligent remanufacturing systems. This paper starts with the definition, scope, and motivation of remanufacturing within the context of a circular economy, highlighting its societal and environmental significance. Then it delves into intelligent automation approaches for disassembly, inspection, sorting, and component reprocessing in this domain, covering advanced methods for multimodal perception, decision-making under uncertainty, flexible planning algorithms, and force-aware manipulation. The paper further reviews several emerging techniques, including large foundation models, human-in-the-loop integration, and digital twins that have the potential to support future research in this area. By integrating these topics, we aim to illustrate how next-generation remanufacturing systems can achieve robust, adaptable, and efficient operation in the face of complex real-world challenges.
|
| |
| WeB06 Regular Session, Grand Salon 7 |
Add to My Program |
| Game Theory II |
|
| |
| Chair: Akyol, Emrah | SUNY Binghamton |
| Co-Chair: Nazari, Shima | UC Davis |
| |
| 13:30-13:45, Paper WeB06.1 | Add to My Program |
| Bayesian Holonic Systems: Equilibrium, Uniqueness, and Computation |
|
| Pan, Yunian | New York University |
| Zhu, Quanyan | New York University |
Keywords: Game theory, Networked control systems, Stochastic systems
Abstract: This paper addresses the challenge of modeling and control in hierarchical, multi-agent systems, known as holonic systems, where local agent decisions are coupled with global systemic outcomes. We introduce the Bayesian Holonic Equilibrium (BHE), a concept that ensures consistency between agent-level rationality and system-wide emergent behavior. We establish the theoretical soundness of the BHE by showing its existence and, under stronger regularity conditions, its uniqueness. We propose a two-time scale learning algorithm to compute such an equilibrium. This algorithm mirrors the system's structure, with a fast timescale for intra-holon strategy coordination and a slow timescale for inter-holon belief adaptation about external risks. The convergence of the algorithm to the theoretical equilibrium is validated through a numerical experiment on a continuous public good game. This work provides a complete theoretical and algorithmic framework for the principled design and analysis of strategic risk in complex, coupled control systems.
|
| |
| 13:45-14:00, Paper WeB06.2 | Add to My Program |
| A Generalized Potential Game Approach of UAV Swarm Coordination for Hidden Target Localization |
|
| Guckert, Mathis | Inria Centre at the University of Lille |
| Le Cadre, Helene | Inria Centre at the University of Lille |
| Le Hénaff, Jean | École Nationale Des Ponts Et Chaussées |
Keywords: Game theory, Optimization, Emerging control applications
Abstract: Considering a swarm of Unmanned Aerial Vehicles (UAVs) carrying sensors with nondeterministic detection and noisy localization measurements, while sharing observations with neighboring UAVs, we address the problem of localization of a hidden target in continuous space and discrete time. The goal is to coordinate UAVs to maximize the information gathered while minimizing their individual energy costs. We formulate the problem as a time-varying non-cooperative game with coupling constraints. We show that the target is localized in finite time with probability one and that the game has a generalized potential structure. Further, we provide an exact best-response algorithm for UAVs to iteratively compute their trajectories. Finally, we numerically compare the potential game to the team-based approach, demonstrating comparable performance under different communication graph structures and assessing the impact of the swarm size on various metrics.
|
| |
| 14:00-14:15, Paper WeB06.3 | Add to My Program |
| Optimal Modified Feedback Strategies in LQ Games under Control Imperfections |
|
| Rabbani, Mahdis | University of California, Davis |
| Mojahed Baghbadorani, Navid | UC Davis |
| Nazari, Shima | UC Davis |
Keywords: Game theory, Optimal control, Adaptive control
Abstract: Game-theoretic approaches and Nash equilibrium have been widely applied across various engineering domains. However, practical challenges such as disturbances, delays, and actuator limitations can hinder the precise execution of Nash equilibrium strategies. This work investigates the impact of such implementation imperfections on game trajectories and players' costs in the context of a two-player finite-horizon linear quadratic (LQ) nonzero-sum game. Specifically, we analyze how small deviations by one player, measured or estimated at each stage affect the state trajectory and the other player’s cost. To mitigate these effects, we construct a compensation law for the influenced player by augmenting the nominal game with the measurable deviation dynamics. The resulting policy is shown to be optimal within a causal affine policy class, and, for sufficiently small deviations, it locally outperforms the uncompensated equilibrium-derived feedback. Rigorous analysis and proofs are provided, and the effectiveness of the proposed approach is demonstrated through a representative numerical example.
|
| |
| 14:15-14:30, Paper WeB06.4 | Add to My Program |
| Game-Theory-Assisted Reinforcement Learning for Border Defense: Early Termination Based on Analytical Solutions |
|
| Das, Goutam | Purdue University |
| Dorothy, Michael | US Army Research Laboratory |
| Volle, Kyle | University of Florida |
| Shishika, Daigo | George Mason University |
Keywords: Game theory, Reinforcement learning, Cooperative control
Abstract: Game theory provides the gold standard for analyzing adversarial engagements, offering strong optimality guarantees. However, these guarantees often become brittle when assumptions such as perfect information are violated. Reinforcement learning (RL), by contrast, is adaptive but can be sample-inefficient in large, complex domains. This paper introduces a hybrid approach that leverages game-theoretic insights to improve RL training efficiency. We study a border defense game with limited perceptual range, where defender performance depends on both search and pursuit strategies, making classical differential game solutions inapplicable. Our method employs the Apollonius Circle (AC) to compute equilibrium in the post-detection phase, enabling early termination of RL episodes without learning pursuit dynamics. This allows RL to concentrate on learning search strategies while guaranteeing optimal continuation after detection. Across single- and multi-defender settings, this early termination method yields 10–20% higher rewards, faster convergence, and more efficient search trajectories.
|
| |
| 14:30-14:45, Paper WeB06.5 | Add to My Program |
| Multi-Agent Guided Policy Search for Non-Cooperative Dynamic Games |
|
| Li, Jingqi | University of Texas at Austin |
| Qu, Gechen | University of California, Berkeley |
| Choi, Jason J. | UCLA |
| Sojoudi, Somayeh | UC Berkeley |
| Tomlin, Claire J. | UC Berkeley |
Keywords: Game theory, Reinforcement learning, Iterative learning control
Abstract: Multi-agent reinforcement learning (MARL) optimizes strategic interactions in non-cooperative dynamic games, where agents have misaligned objectives. However, data-driven methods such as multi-agent policy gradients (MA-PG) often suffer from instability and limit-cycle behaviors. Prior stabilization techniques typically rely on entropy-based exploration, which slows learning and increases variance. We propose a model-based approach that incorporates approximate priors into the reward function as regularization. In linear quadratic (LQ) games, we prove that such priors stabilize policy gradients and guarantee local exponential convergence to an approximate Nash equilibrium. We then extend this idea to infinite-horizon nonlinear games by introducing Multi-agent Guided Policy Search (MA-GPS), which constructs short-horizon local LQ approximations from trajectories of current policies to guide training. Experiments on nonlinear vehicle platooning and a six-player strategic basketball formation show that MA-GPS achieves faster convergence and more stable learning than existing MARL methods.
|
| |
| 14:45-15:00, Paper WeB06.6 | Add to My Program |
| Revisiting Dynamic Pricing Games: Nash Equilibrium under Fairness |
|
| Anand, Anju | Binghamton University |
| Roberts, Cassidy | Binghamton University |
| Akyol, Emrah | SUNY Binghamton |
Keywords: Game theory, Smart grid, Optimization
Abstract: This paper studies the Nash equilibrium of a timeof- use electricity pricing game. Existing formulations in this literature are typically solved as Stackelberg problems, with the company posting prices and the user responding by adjusting load. In our prior work, we studied both Stackelberg orders of play and introduced a fairness penalty to prevent opportunistic price increases when the company acts as the follower. Here we formulate the simultaneous-move game directly, establish existence and uniqueness of the Nash equilibrium under the adopted model, derive the players’ best responses, and compare the resulting equilibrium numerically with the two Stackelberg benchmarks.
|
| |
| WeB07 Invited Session, Grand Salon 9 |
Add to My Program |
| Safety-Critical Control of Cyber-Physical Systems |
|
| |
| Chair: Pare, Philip E. | Purdue University |
| Co-Chair: Butler, Brooks A. | University of California, Irvine |
| Organizer: Samadi, Saba | Purdue University |
| Organizer: Butler, Brooks A. | University of California, Irvine |
| Organizer: Pare, Philip E. | Purdue University |
| |
| 13:30-13:45, Paper WeB07.1 | Add to My Program |
| Safe Event-Triggered Learning for Sampled-Data Systems (I) |
|
| Begzadić, Azra | University of California, San Diego |
| Lederer, Armin | National University of Singapore |
| Hirche, Sandra | Technische Universität München |
| Herbert, Sylvia | UC San Diego (UCSD) |
| Cortes, Jorge | UC San Diego |
Keywords: Uncertain systems, Sampled-data control, Machine learning
Abstract: Control Barrier Functions provide a principled framework for ensuring safety, yet their reliance on accurate system models is a critical limitation for practical deployment. Moreover, the majority of theory relies on continuous-time guarantees that may not hold for discrete controllers. To address these challenges, we employ Gaussian processes to learn the components of the CBF safety condition, which depend on the unknown system dynamics, directly from data. Since controllers typically need to be implemented in a sampled-data fashion, we develop robust CBF conditions that account for inter-sampling effects to ensure the safety of unknown control-affine systems. The Gaussian Process model is updated online under a bimodal control framework that switches between control-focused and learning-focused phases, and its uncertainty is leveraged to guarantee satisfaction of the CBF conditions with high probability. The effectiveness of the proposed framework is demonstrated in simulations.
|
| |
| 13:45-14:00, Paper WeB07.2 | Add to My Program |
| Data-Driven Robust Control under Input-Output Stealthy Attacks (I) |
|
| Bhowmik, Arijit | Michigan State University |
| Bopardikar, Shaunak D. | Michigan State University |
| Hespanha, Joao P. | Univ. of California, Santa Barbara |
Keywords: Data driven control, Robust control, Game theory
Abstract: We consider the problem of robust control of an unknown but minimal linear time-invariant system under input and output disturbances and adversarial manipulation. An attacker can (i) corrupt sensor measurements (deception attacks) and (ii) perturb the control channel (actuation attacks). To address the lack of model knowledge, we adapt the Data-enabled Predictive Control (DeePC) framework, which constructs predictors directly from input–output data with bounded disturbance. We formulate a finite-horizon open-loop control problem as a two-player zero-sum game with asymmetric information: the defender selects control inputs based on measured data and only knows an upper bound on disturbances, whereas the attacker has access to the true disturbance realization and can remain stealthy by hiding within this uncertainty set. The main contributions are (i) sufficient conditions for the existence of a Nash equilibrium corresponding to saddle-point policies for this game, and (ii) an analysis of the defender’s security strategy against deception and actuation attacks. Simulation studies on finite-horizon control demonstrate the effectiveness of the proposed approach.
|
| |
| 14:00-14:15, Paper WeB07.3 | Add to My Program |
| Partial Resilient Leader-Follower Consensus in Time-Varying Graphs (I) |
|
| Lee, Haejoon | University of Michigan |
| Panagou, Dimitra | University of Michigan |
Keywords: Network analysis and control, Control of networks, Fault tolerant systems
Abstract: This work studies resilient leader-follower consensus with a bounded number of adversaries. Existing approaches typically require robustness conditions of the entire network to guarantee resilient consensus. However, the behavior of such systems when these conditions are not fully met remains unexplored. To address this gap, we introduce the notion of partial leader-follower consensus, in which a subset of non-adversarial followers successfully tracks the leader’s reference state despite insufficient robustness. We propose a novel distributed algorithm - the Bootstrap Percolation and Mean Subsequence Reduced (BP-MSR) algorithm - and establish sufficient conditions for individual followers to achieve consensus via the BP-MSR algorithm in arbitrary time-varying graphs. We validate our findings through simulations, demonstrating that our method guarantees partial leader-follower consensus, even when standard resilient consensus algorithms fail.
|
| |
| 14:15-14:30, Paper WeB07.4 | Add to My Program |
| Collaborative Altruistic Safety in Coupled Multi-Agent Systems (I) |
|
| Butler, Brooks A. | University of California, Irvine |
| Tan, Xiao | Beihang University |
| Ames, Aaron D. | California Institute of Technology |
| Egerstedt, Magnus | University of North Carolina, Chapel Hill |
Keywords: Cooperative control, Networked control systems, Agents-based systems
Abstract: This paper presents a novel framework for ensuring safety in dynamically coupled multi-agent systems through collaborative control. Drawing inspiration from ecological models of altruism, we develop collaborative control barrier functions that allow agents to cooperatively enforce individual safety constraints under coupling dynamics. We introduce an altruistic safety condition based on the so-called Hamilton’s rule, enabling agents to trade off their own safety to support higher-priority neighbors. By incorporating these conditions into a distributed optimization framework, we demonstrate increased feasibility and robustness in maintaining system-wide safety. The effectiveness of the proposed approach is illustrated through simulation in a simplified formation control scenario.
|
| |
| 14:30-14:45, Paper WeB07.5 | Add to My Program |
| Robust Safety-Critical Control of Networked SIR Dynamics (I) |
|
| Samadi, Saba | Purdue University |
| Butler, Brooks A. | University of California, Irvine |
| Pare, Philip E. | Purdue University |
Keywords: Networked control systems, Network analysis and control, Control applications
Abstract: We develop a robust safety-critical control framework for networked susceptible–infected–recovered (SIR) dynamics using control barrier functions (CBFs). Each node applies a local intervention to keep infections below a prescribed threshold despite coupling with neighbors and uncertainty in epidemic parameters and measurements. We derive a closed-form nominal CBF controller and extend it to robust CBFs via compensation terms. We study two uncertainty models: a constant bounded disturbance and a low-prevalence–amplified model that increases conservatism when infections are small. Simulations on a small network show nominal safety under low uncertainty and formal robustness under higher uncertainty, with a clear safety–effort trade-off.
|
| |
| 14:45-15:00, Paper WeB07.6 | Add to My Program |
| Safe Bayesian Optimization across Noise Models Via Scenario Programming (I) |
|
| Tokmak, Abdullah | Aalto University |
| Schön, Thomas (Bo) | Uppsala University |
| Baumann, Dominik | Aalto University |
Keywords: Statistical learning, Optimization algorithms, Reinforcement learning
Abstract: Safe Bayesian optimization (BO) with Gaussian processes is an effective tool for tuning control policies in safety-critical real-world systems, specifically due to its sample efficiency and safety guarantees. However, most safe BO algorithms assume homoscedastic sub-Gaussian measurement noise, an assumption that does not hold in many relevant applications. In this article, we propose a straightforward yet rigorous approach for safe BO across noise models, including homoscedastic sub-Gaussian and heteroscedastic heavy-tailed distributions. We provide a high-probability bound on the measurement noise via the scenario approach, integrate these bounds into high probability confidence intervals, and prove safety and optimality for our proposed safe BO algorithm. We deploy our algorithm in synthetic examples and in tuning a controller for the Franka Emika manipulator in simulation.
|
| |
| WeB08 Invited Session, Grand Salon 10-13 |
Add to My Program |
| Multi-Agent Control and Coordination I |
|
| |
| Chair: Von Moll, Alexander | Air Force Research Laboratory |
| Co-Chair: Weintraub, Isaac | Air Force Research Laboratory |
| Organizer: Weintraub, Isaac | Air Force Research Laboratory |
| Organizer: Von Moll, Alexander | Air Force Research Laboratory |
| Organizer: Sinha, Abhinav | The University of Cincinnati |
| |
| 13:30-13:45, Paper WeB08.1 | Add to My Program |
| Trajectory Encryption Cooperative Salvo Guidance (I) |
|
| Gopikannan, Lohitvel | Indian Institute of Technology Bombay |
| Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
| Sinha, Abhinav | The University of Cincinnati |
Keywords: Aerospace, Multivehicle systems, Cooperative control
Abstract: This paper introduces the concept of trajectory encryption in cooperative simultaneous target interception, wherein heterogeneity in guidance principles across a team of unmanned autonomous systems is leveraged as a strategic design feature. By employing a mix of heterogeneous time-to-go formulations leading to a cooperative guidance strategy, the swarm of vehicles is able to generate diverse trajectory families. This diversity expands the feasible solution space for simultaneous target interception, enhances robustness under disturbances, and enables flexible time-to-go adjustments without predictable detouring. From an adversarial perspective, heterogeneity obscures the collective interception intent by preventing straightforward prediction of swarm dynamics, effectively acting as an encryption layer in the trajectory domain. Simulations demonstrate that the swarm of heterogeneous vehicles is able to intercept a moving target simultaneously from a diverse set of initial engagement configurations.
|
| |
| 13:45-14:00, Paper WeB08.2 | Add to My Program |
| On Robustness of Consensus Over Pseudo-Undirected Path Graphs (I) |
|
| Sinha, Abhinav | The University of Cincinnati |
| Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
| Kumar, Shashi Ranjan | Indian Institute of Technology Bombay |
Keywords: Aerospace, Cooperative control
Abstract: Consensus over networked agents is typically studied using undirected or directed communication graphs. Undirected graphs enforce symmetry in information exchange, leading to convergence to the average of initial states, while directed graphs permit asymmetry but make consensus dependent on root nodes and their influence. Both paradigms impose inherent restrictions on achievable consensus values and network robustness. This paper introduces a theoretical framework for achieving consensus over a class of network topologies, termed pseudo-undirected graphs, which retains bidirectional connectivity between node pairs but allows the corresponding edge weights to differ, including the possibility of negative values under bounded conditions. The resulting Laplacian is generally non-symmetric, yet it guarantees consensus under connectivity assumptions, to expand the solution space, which enables the system to achieve a stable consensus value that can lie outside the convex hull of the initial state set. We derive admissibility bounds for negative weights for a pseudo-undirected path graph, and show an application in the simultaneous interception of a moving target.
|
| |
| 14:00-14:15, Paper WeB08.3 | Add to My Program |
| Nonlinear Guidance for a Pursuer-Turret Team for Maneuvering Target Engagement (I) |
|
| Bajpai, Shivam | University of Cincinnati |
| Sinha, Abhinav | The University of Cincinnati |
| Weintraub, Isaac | Air Force Research Laboratory |
| Sharma, Rajnikant | IS4S |
| Casbeer, David W. | Air Force Research Laboratory |
Keywords: Aerospace, Control applications
Abstract: This paper addresses the problem of engaging a maneuvering target using a mobile pursuer equipped with a rotating turret. Unlike conventional point-capture formulations, the objective is to maintain a prescribed standoff distance between the pursuer and the target, while ensuring that the turret’s heading aligns with the instantaneous line of sight (LOS) with respect to the mobile target. The pursuer is modeled as a non-holonomic vehicle with lateral acceleration capability, whereas the turret can have free angular maneuverability. We develop a nonlinear guidance strategy that coordinates the pursuer’s motion with the turret’s rotation to neutralize the target. The proposed method explicitly accounts for the nonlinear coupled kinematics of the pursuer-turret team, ensuring stability and convergence without linearization. Simulation results demonstrate the effectiveness of the guidance strategy against a general maneuvering target.
|
| |
| 14:15-14:30, Paper WeB08.4 | Add to My Program |
| Adaptive Event-Triggered Policy Gradient for Multi-Agent Reinforcement Learning (I) |
|
| Umer, Muhammad | The University of Texas at San Antonio |
| Sinha, Abhinav | The University of Cincinnati |
| Cao, Yongcan | University of Texas, San Antonio |
Keywords: Reinforcement learning, Networked control systems
Abstract: Conventional multi-agent reinforcement learning (MARL) methods rely on time-triggered execution, where agents sample and communicate actions at fixed intervals. This approach is often computationally expensive and communication-intensive. To address this limitation, we propose ET-MAPG (Event-Triggered Multi-Agent Policy Gradient reinforcement learning)-- a framework that jointly learns an agent's control policy and its event-triggering policy. Unlike prior work that decouples these mechanisms, ET-MAPG integrates them into a unified learning process, enabling agents to learn what action to take and when to execute it. For scenarios with inter-agent communication, we introduce AET-MAPG, an attention-based variant that leverages a self-attention mechanism to learn selective communication patterns. AET-MAPG empowers agents to determine when to trigger an action, and also with whom to communicate and what information to exchange, thereby optimizing coordination. Both methods can be integrated with any policy gradient MARL algorithm. Extensive experiments across diverse MARL benchmarks demonstrate that our approaches achieve performance comparable to state-of-the-art, time-triggered baselines while significantly reducing both computational load and communication overhead.
|
| |
| 14:30-14:45, Paper WeB08.5 | Add to My Program |
| Cooperative Relay-Assisted Traveling Salesman Problem Using Graphs of Convex Sets (I) |
|
| Manyam, Satyanarayana Gupta | DCS Corporation (Air Force Research Labs Cont.) |
| Casbeer, David W. | Air Force Research Laboratory |
| Weintraub, Isaac | Air Force Research Laboratory |
Keywords: Optimization, Optimization algorithms, Autonomous systems
Abstract: A cooperative routing problem involving two vehicles, a sensing agent and a relay agent, is considered. The sensing agent must visit a given set of targets to collect data and transmit it to a ground station. Hardware limitations restrict the transmission range, requiring the second agent to act as a relay between the sensing agent and the ground station. The collected data is transmitted through a two-hop network: from the sensing agent to the relay agent, and then to the ground station. Given a set of target locations, the routing problem aims to determine the optimal sequence of targets to be visited by the sensing agent, along with the corresponding positions of the relay agent, to minimize the combined travel cost of both vehicles. This problem is formulated as a mixed-integer convex program utilizing the framework of graphs of convex sets. The formulation avoids ``big-M" constraints, resulting in a tighter convex relaxation. Furthermore, this approach is shown to be applicable to the neighborhood traveling salesman problem and can be extended to a three-agent scenario with two relay agents. Computational experiments are conducted to evaluate the proposed formulations and validate the tightness of the lower bounds obtained via convex relaxation.
|
| |
| 14:45-15:00, Paper WeB08.6 | Add to My Program |
| Tactical Asset Allocation Game (TAAG) with Sensor Performance, Deception, and Differential Subgames |
|
| Judy, Rachael L. | University of Cincinnati |
| Fuchs, Zachariah E. | University of Cincinnati |
Keywords: Game theory, Optimal control, Information theory and control
Abstract: We examine a Tactical Asset Allocation Game (TAAG) in which two opposing players attempt to place offensive and defensive assets to optimize their respective expected value upon initiation of an engagement. The Defender player selects an initial Target position to maximize the expected distance between the Target and a Penetrator agent. The Defender also positions an Interceptor to intercept the Penetrator agent and, incurring a corresponding cost, selects an obfuscation tactic for the sensor network. Based on the sensor data, the Attacker then selects an incursion point for the Penetrator with the objective of minimizing that same expected distance between Target and Penetrator. The values of the possible engagements emerging from the placement strategies are determined using a variation of the Target-Penetrator-Interceptor (TPI) game. The TAAG is then resolved by utilizing a linear program based on the matrix game established by combinations of different allocation and deception opportunities and further demonstrates the potential of differential games to be utilized in solving more complex problems in control and game theory.
|
| |
| WeB09 Invited Session, Grand Salon 12 |
Add to My Program |
| Control and Design for Energy Storage Systems |
|
| |
| Chair: Ouedraogo, Asmaou Sougri-Nooma | Texas Tech University |
| Co-Chair: Zhang, Dong | University of Oklahoma |
| Organizer: Ouedraogo, Asmaou Sougri-Nooma | Texas Tech University |
| Organizer: Docimo, Donald | Texas Tech University |
| Organizer: Soudbakhsh, Damoon | Temple University |
| Organizer: Zhang, Dong | University of Oklahoma |
| Organizer: Song, Ziyou | University of Michigan, Ann Arbor |
| Organizer: Araujo Xavier, Marcelo | Amazon Leo |
| Organizer: Moura, Scott | University of California, Berkeley |
| Organizer: Lin, Xinfan | University of California, Davis |
| Organizer: Cui, Xiaofan | University of California, Los Angeles |
| Organizer: Filgueira da Silva, Samuel | The Ohio State University |
| Organizer: Tang, Shuxia | Texas Tech University |
| Organizer: Dey, Satadru | The Pennsylvania State University |
| Organizer: Jahan, Tania Rifat | Texas Tech University |
| Organizer: Hasankhani, Arezoo | University of New Hampshire |
| |
| 13:30-13:45, Paper WeB09.1 | Add to My Program |
| Optimal and Heuristic Dispatch of PV-BESS for Multi-Load Peak Shaving (I) |
|
| McDougal, Caleb | Texas A&M University |
| Rasmussen, Bryan | Texas A&M University |
Keywords: Optimization, Energy systems, Control applications
Abstract: Simultaneous peak shaving for multi-load microgrids is an important problem with significant financial impact on industrial and commercial energy users. Cost savings can be realized by strategically reducing peak demand across multiple loads through the use of solar photovoltaic (PV) and battery energy storage systems (BESS). A mixed integer linear programming (MILP) optimization problem formulation for PV-BESS multi-load dispatch is outlined. It serves as a standard against which live controller performance can be compared. A heuristic dispatch control strategy is developed leveraging the nature of the 15 minute average demand billing structure. The practical controller has no need of complex live weather and demand forecasting and is developed for a multi-load application. Industrial electric utility data from a manufacturing facility in Houston, Texas is used for testing of the heuristic control strategy. Results show net savings within as near as 3.3% when comparing the heuristic controller and MILP on the industrial data.
|
| |
| 13:45-14:00, Paper WeB09.2 | Add to My Program |
| A Control Co-Design Framework to Achieve Solution Feasibility in Energy System Optimization Problems (I) |
|
| Jahan, Tania Rifat | Texas Tech University |
| Docimo, Donald | Texas Tech University |
Keywords: Control applications, Energy systems
Abstract: This work explores methods to identify energy system designs for infeasible control co-design optimization problems. Control co-design, or CCD, has been recognized as a powerful tool to maximize energy system capabilities through simultaneous determination of plant and controller parameters. However, due to the inherent nonlinearities, complexity, and conflicting criteria of energy systems, CCD optimization problems are susceptible to infeasibility and can lack potential solutions. While transforming the optimization problem by relaxing constraints has been developed for optimal control infeasibility challenges, solution feasibility for CCD is relatively unexplored. This paper proposes a framework to convert infeasible optimization problems into solvable forms for a class of CCD problems. The framework introduces a procedure to rank metric bounds from least likely to most likely to cause infeasibility. This provides guidance to algorithmically relax a limited number of constraints, leaving others intact. The proposed framework is applied to a CCD problem for designing a battery within a microgrid. Comparison against a baseline approach for relaxing optimization problems shows the framework requires only a reduced number of iterations to determine a solution.
|
| |
| 14:00-14:15, Paper WeB09.3 | Add to My Program |
| Safety-Aware Fast Charging Control for Lithium-Ion Batteries Via Physics-Enhanced Gaussian Process (I) |
|
| Choo, Wonoo | University of Oklahoma |
| Kajiura, Yuichi | University of Oklahoma |
| Espin, Jorge | University of Oklahoma |
| Xu, Zhicheng | University of Oklahoma |
| Zhang, Dong | University of Oklahoma |
Keywords: Energy systems, Machine learning, Optimal control
Abstract: Fast charging accelerates lithium-ion battery degradation and can trigger lithium plating, posing serious safety risks. While existing charging strategies such as Constant-Current Constant-Voltage (CCCV) or equivalent circuit model (ECM)-based charging methods are computationally efficient, they lack access to internal electrochemical states and cannot enforce degradation-aware safety limits. This work proposes a hybrid Model Predictive Control (MPC) framework that augments an ECM with a Gaussian Process (GP) surrogate trained on high-fidelity Doyle-Fuller-Newman (DFN) simulations. The key novelty lies in a GP surrogate that estimates lithium plating overpotential at the anode-separator interface, enabling real-time enforcement of physically meaningful safety constraints without the computational burden of electrochemical models. Simulation results on an LG-M50 cell show that the proposed GP-ECM MPC reduces plating violations by 85-95% compared to CCCV, ECM-MPC, and SPMe-MPC methods, with less than a 10% increase in charging time and only 7 ms of additional computation per optimization step. These results demonstrate a scalable, safety-conscious fast-charging strategy suitable for practical deployment in future battery management systems.
|
| |
| 14:15-14:30, Paper WeB09.4 | Add to My Program |
| Optimization of Phase Change Material Integration for Active Cooling Control (I) |
|
| Ouedraogo, Asmaou Sougri-Nooma | Texas Tech University |
| Docimo, Donald | Texas Tech University |
Keywords: Energy systems, Control applications
Abstract: This paper presents a unified optimization framework for phase change material (PCM) based cooling systems. Thermal management is critical in applications such as photovoltaic (PV) modules, battery packs, and power electronics, where excessive heat reduces performance and lifespan. Designing such systems is challenging because energy dynamics, capacity, heat rejection, and structural constraints must all be considered. Although prior studies have investigated PCM applications and heat transfer enhancement, there are limited efforts that unify such diverse performance objectives through formalized design methods. This paper develops a framework that formulates the PCM design problem using critical energy-based terms, with static and dynamic objectives capturing the PCM physical design and control aspects. Two case studies are used to validate the approach: the first explores passive cooling, and the second implements an active cooling configuration. The results compare the design and control of these systems, showing improvement in individual performance metrics between the two options.
|
| |
| 14:30-14:45, Paper WeB09.5 | Add to My Program |
| Threshold Policy-Enhanced Reinforcement Learning for Multi-Path Wireless Charging Scheduling in Coupled Power and Transportation Systems |
|
| Hao, Liangliang | Shandong University |
| Cai, Yuanyu | Shandong University |
| Jin, Jiangliang | Donghua University |
| Chen, Cong | Cornell University |
Keywords: Power systems, Reinforcement learning, Stochastic optimal control
Abstract: We study the optimal dynamic wireless charging (DWC) for electric vehicles in coupled power and transportation systems with stochastic renewable energy supply. We formulate the problem as a Markov decision process (MDP) that captures the interdependencies between traffic flows, charging demand, and grid operations. Our key contribution is a path-dependent threshold mechanism that reduces the high-dimensional charging decision space by exploiting the structural properties of optimal policies along vehicles’ paths. To approximate optimal thresholds, we develop a customized deep reinforcement learning framework that achieves efficient learning through intelligent action space reduction. The proposed method dynamically adjusts charging decisions by jointly considering traffic conditions, renewable availability, and grid capacity constraints. Experiments on realistic traffic-grid scenarios demonstrate up to 44% operational cost reduction compared to state-of-the-art benchmarks while reducing training time by 22%.
|
| |
| 14:45-15:00, Paper WeB09.6 | Add to My Program |
| Strategic Congestion Manipulation by Monopolistic Storage Aggregator in Power Networks |
|
| Davoudi, Mehdi | Purdue University |
| Qin, Junjie | Purdue University |
Keywords: Power systems, Smart grid, Optimization
Abstract: This paper studies the strategic behavior of a monopolistic storage aggregator in electricity markets, focusing on how coordinated operation of geographically dispersed grid-scale storage units can deliberately alter power flows to create artificial transmission congestion, i.e., congestion driven by strategic actions rather than fundamental supply-demand patterns. We model the interaction between the aggregator, who earns from both energy arbitrage and financial transmission rights (FTRs), and the system operator as a Stackelberg game. In a two-bus setting, we analytically characterize equilibrium outcomes and identify necessary and sufficient conditions for artificial congestion, expressed as a threshold on the total energy that the aggregator can strategically mobilize. We further show that FTR ownership strongly shapes incentives: Congestion is more likely when the aggregator owns/longs FTRs, but can be avoided if the aggregator is required to short them. For general networks, we present methods to solve for the equilibrium of the Stackelberg game and screen for artificial congestion.
|
| |
| WeB10 Regular Session, Grand Salon 15 |
Add to My Program |
| Linear Systems II |
|
| |
| Chair: Cichella, Venanzio | University of Iowa |
| Co-Chair: Miksits, Adam | Ericsson Research |
| |
| 13:30-13:45, Paper WeB10.1 | Add to My Program |
| Enabling Steep Slope Walking in Husky Using Reduced Order Modeling and Quadratic Programming |
|
| Venkatesh Krishnamurthy, Kaushik | Northeastern University |
| Sihite, Eric | California Institute of Technology |
| Wang, Chenghao | Northeastern University |
| Pitroda, Shreyansh | Northeastern University |
| Salagame, Adarsh | Northeastern University |
| Ramezani, Alireza | Northeastern University |
| Gharib, Morteza | Caltech |
Keywords: Linear systems, Predictive control for linear systems, Biologically-inspired methods
Abstract: Wing-assisted inclined running (WAIR) observed in some young birds, is an attractive maneuver that can be extended to legged aerial systems. This study proposes a control method using a modified Variable Length Inverted Pendulum (VLIP) by assuming a fixed zero moment point and thruster forces collocated at the center of mass of the pendulum. A QP MPC is used to find the optimal ground reaction forces and thruster forces to track a reference position and velocity trajectory. Simulation result of this VLIP model on a slope of 40 degrees is maintained and shows thruster forces that can be obtained through posture manipulation. The simulation also provides insight to how the combined efforts of the thrusters and the tractive forces from the legs make WAIR possible in thruster-assisted legged systems.
|
| |
| 13:45-14:00, Paper WeB10.2 | Add to My Program |
| Peak Bounds for the Estimation Error under Sensor Attacks (I) |
|
| Stafström, Axel | Uppsala University |
| Arnström, Daniel | Uppsala University |
| Miksits, Adam | Ericsson Research |
| Umsonst, David | Ericsson Research |
Keywords: Linear systems, Robust control, Observers for Linear systems
Abstract: This paper investigates bounds on the estimation error of a linear system affected by norm-bounded disturbances and full sensor attacks. The system is equipped with a detector that evaluates the norm of the innovation signal to detect faults, and the attacker wants to avoid detection. We utilize induced L∞ system norms, also called peak-to-peak norms, to compare the estimation error bounds under nominal operations and under attack. This leads to a sufficient condition for when the bound on the estimation error is smaller during an attack than during nominal operation. This condition is independent of the attack strategy and depends only on the attacker's desire to remain undetected and (indirectly) the observer gain. Therefore, we investigate both an observer design method, that seeks to reduce the error bound under attack while keeping the nominal error bound low, and detector threshold tuning. As a numerical illustration, we show how a sensor attack can deactivate a robust safety filter based on control barrier functions if the attacked error bound is larger than the nominal one. We also statistically evaluate our observer design method and the effect of the detector threshold.
|
| |
| 14:00-14:15, Paper WeB10.3 | Add to My Program |
| Direct Data-Driven Control to Output Tracking of Switched Linear Systems |
|
| Zhang, Zezhou | Rutgers University |
| Zou, Qingze | Rutgers, the State University of New Jersey |
Keywords: Linear systems, Switched systems, Behavioural systems
Abstract: In this paper, we consider the problem of directly synthesizing a control input from the past input-output data to achieve precision tracking for switched linear systems. To this end, we proposed a data-driven direct input synthesis controller that generates the control input from the past input-output data directly. An initialization process is designed to collect the information of transient response, and it's shown that when the past data is rich enough, i.e., satisfying the persistently excitation condition, the transient response can be precisely calculated. The control input is generated by the data-driven direct input synthesis control method, the tracking performance is guaranteed under the finite preview condition, and the tracking error is bounded by the summation of a monotonic increasing function, {alpha}_1(cdot), and a monotonic decreasing function, {alpha}_2(cdot). The proposed method is illustrated through a numerical simulation on a switched linear system.
|
| |
| 14:15-14:30, Paper WeB10.4 | Add to My Program |
| Generic Numbers of Finite Poles and Zeros: A Bond Graph-Based Determination |
|
| Laaribi, Amine | INSA Lyon |
| Eberard, Damien | Univ. LYON, INSA |
| Marquis-Favre, Wilfrid | Ampère Laboratory - INSA Of |
Keywords: Linear systems
Abstract: This paper presents a graph-based procedure for computing the generic number of finite poles and zeros. The two key ingredients are the path-cycle families and the separator-based decomposition. The former generically encodes degree and valuation of minors. The latter gives an equivalent system matrix that eases system zeros characterization within graphs. Using graph connectivity properties, the generic number of zeros located at s=0 are computed. The remaining finite structure is then derived.
|
| |
| 14:30-14:45, Paper WeB10.5 | Add to My Program |
| Design of Input-Output Observers for a Population of Systems with Bounded Frequency-Domain Variation Using DK-Iteration |
|
| Adams, Timothy Everett | McGill University |
| Forbes, James Richard | McGill University |
Keywords: Observers for Linear systems, Robust control, Uncertain systems
Abstract: This paper proposes a linear input-output observer design methodology for a population of systems in which each observer uses knowledge of the linear time-invariant dynamics of the particular device. Observers are typically composed of a known model of the system and a correction mechanism to produce an estimate of the state. The proposed design procedure characterizes the population variation in the frequency domain and synthesizes a single robust correction filter. The correction filter guarantees a level of estimation performance for all systems compatible with the uncertainty characterization. This is accomplished by posing a robust performance problem using the observer error dynamics and solving it using DK-iteration. The design procedure is experimentally demonstrated on a flexible joint robotic manipulator with varied joint stiffnesses. It is shown that the proposed robust correction filter achieves comparable estimation performance to a method using a correction gain tailored toward each joint stiffness configuration.
|
| |
| 14:45-15:00, Paper WeB10.6 | Add to My Program |
| Robust Linear Output Regulation under Piecewise Bernstein/Bézier Exogenous Signals |
|
| Cichella, Venanzio | University of Iowa |
| Mimmo, Nicola | University of Bologna |
| Marconi, Lorenzo | Univ. Di Bologna |
Keywords: Output regulation, Linear systems, Constrained control
Abstract: This paper addresses robust output regulation for linear systems driven by exogenous signals represented as composite Bernstein/Bézier polynomials. Building on prior work for single-segment signals, we develop a continuous internal-model-based controller that guarantees exponential convergence of the regulation error without hybrid switching laws. We rigorously characterize steady-state trajectories induced by composite polynomials and analyze transient errors at segment transitions. Based on this analysis, we propose a replanning strategy for future segments that mitigates switching effects while satisfying state and input constraints. Numerical simulations on navigation tasks demonstrate the effectiveness of the approach.
|
| |
| WeB11 Regular Session, Grand Salon 16 |
Add to My Program |
| Modeling |
|
| |
| Chair: Wang, Jing | Illinois State University |
| Co-Chair: Xu, Jiawei | University of Michigan |
| |
| 13:30-13:45, Paper WeB11.1 | Add to My Program |
| Data-Driven Balancing Formulation for Linear Systems with Quadratic Outputs |
|
| Padhi, Reetish | Virginia Tech |
| Gosea, Ion Victor | Max Planck Institute for Dynamics of Complex Technical Systems |
| Pontes Duff Pereira, Igor | Max Planck Institute for Dynamics of Complex Technical Systems |
| Gugercin, Serkan | Virginia Tech |
Keywords: Model/Controller reduction, Large-scale systems, Computational methods
Abstract: We develop the theoretical formulation for a non-intrusive, quadrature-based method for approximate balanced truncation (QuadBT) of linear systems with quadratic outputs, thus extending the applicability of QuadBT, which was originally designed for data-driven balanced truncation of standard linear systems with linear outputs only. The new approach makes use of the time-domain and frequency-domain quadrature-based representation of the system's infinite Gramians, only implicitly. We show that by sampling solely the extended impulse responses (kernels) of the original system and their derivatives (or the corresponding transfer functions), we construct a reduced-order model that mimics the approximation quality of the intrusive (projection-based) balanced truncation. Although the sampling of the required kernels via input/output simulations or physical experiments is still an open question, we demonstrate a proof of concept for the proposed framework on an example using numerically evaluated data.
|
| |
| 13:45-14:00, Paper WeB11.2 | Add to My Program |
| Modeling and Experimental Validations of Human Driver Controls under Advisory Speed Guidance in Connected Mixed Traffic |
|
| Wang, Rongyao | Clemson University |
| Han, Jihun | Argonne National Laboratory |
| Ard, Tyler | Argonne National Lab |
| Karbowski, Dominik | Argonne National Laboratory |
| Vahidi, Ardalan | Clemson University |
| Jia, Yunyi | Clemson Universtiy |
Keywords: Modeling, Automotive control, Supervisory control
Abstract: Human driver modeling is fundamental for analyzing connected mixed traffic, where human-driven and autonomous vehicles will coexist, and real-time advisory guidance is increasingly available. Traditional car-following models capture physical interactions with the lead vehicle but often fail to represent how drivers integrate advisory speed inputs into their decision-making. This paper proposes an optimization-based human driver model that imitates car-following behavior while accounting for advisory speed guidance. The model combines front-vehicle states with advisory information through a confidence mechanism that reflects driver trust, allowing it to reproduce both compliance with and rejection of advisories. Model parameters are optimized offline using naturalistic data collected in a mixed-reality digital twin driving simulator, enabling precise calibration to individual driving styles. Experimental results in congestive traffic scenarios show that the proposed model achieves sub–0.5 m/s trajectory error and outperforms the existing naive models.
|
| |
| 14:00-14:15, Paper WeB11.3 | Add to My Program |
| Regularized Online Adaptation of Parametric Interactive Multiple Model against Overfitting for Vehicle Motion Tracking |
|
| Wang, Zini | Central South University |
| Huang, Zhiwu | Central South University |
| Liu, Yongjie | Changsha University of Science & Technology |
| Li, Heng | Central South University |
| Liu, Weirong | Central South University |
| Wang, Jing | Illinois State University |
Keywords: Modeling, Control applications, Estimation
Abstract: Interactive Multiple Model (IMM) has been widely applied in motion tracking tasks due to its capability in handling multiple macroscopic behavior hypotheses. In the context of human-driven vehicle motion tracking, drivers often exhibit diverse and personalized driving preferences, which challenge conventional IMM algorithms that rely on predefined models. Nevertheless, the adaptation of multiple models to match observations may induce an overfitting issue, which compromises the ability to identify the real behavior. To address these challenges, this paper presents a regularized online adaptation approach against overfitting for IMM-based vehicle motion tracking. We introduce an advanced motion modeling framework in the Frenet coordinate frame, which naturally decouples longitudinal and lateral motions. Then, a differentiable parametric motion policy is incorporated for enhancing the model adaptability based on simultaneous state and parameter estimation using extended Kalman filtering (EKF). Furthermore, the overfitting problem is addressed by imposing regularization constraints on the model parameters in the filtering correction step based on the Alternating direction method of multipliers (ADMM) iteration. In simulation, the proposed method shows advantages in accurate motion tracking while remaining computationally efficient.
|
| |
| 14:15-14:30, Paper WeB11.4 | Add to My Program |
| From Ellipsoids to Midair Control of Dynamic Hitches |
|
| Xu, Jiawei | University of Michigan |
| Bhattacharya, Subhrajit | Lehigh University |
| Saldana, David | Lehigh University |
Keywords: Modeling, Lyapunov methods, Optimal control
Abstract: The ability to manipulate and interlace cables using aerial vehicles can greatly improve aerial transportation tasks. Such interlacing cables create hitches by winding two or more cables around each other, which can enclose payloads or can further develop into knots. Dynamic modeling and control of such hitches are key to mastering inter-cable interactions in the context of cable-suspended aerial manipulation. This paper introduces an ellipsoid-based kinematic model to connect the geometric nature of a hitch created by two cables and the dynamics of the hitch driven by four aerial vehicles, which reveals the control-affine form of the system. As the constraint for maintaining tension of a cable is also control-affine, we design a quadratic programming-based controller that combines Control Lyapunov and High-Order Control Barrier Functions (CLF-HOCBF-QP) to precisely track a desired hitch position and system shape while enforcing safety constraints like cable tautness. We convert desired geometric reference configurations into target robot positions and introduce a composite error into the Lyapunov function to ensure a relative degree of one to the input. Numerical simulations validate our approach, demonstrating stable, high-speed tracking of dynamic references.
|
| |
| 14:30-14:45, Paper WeB11.5 | Add to My Program |
| On Improved Statistical Accuracy of Low-Order Polynomial Chaos Approximations |
|
| Deshpande, Vedang M. | Mitsubishi Electric Research Laboratories |
| Bhattacharya, Raktim | Texas A&M |
Keywords: Reduced order modeling, Stochastic systems, Uncertain systems
Abstract: Polynomial chaos expansions provide surrogate models for stochastic systems, with coefficients typically derived using Galerkin projection, stochastic collocation, or least squares approximation. These traditional approaches often fail to accurately capture statistical moments without resorting to high-order approximations. We propose a constrained optimization framework that modifies standard techniques to determine polynomial chaos coefficients that precisely recover the first two statistical moments. The effectiveness of our approach is demonstrated on several candidate algebraic functions of random variables, showing significant improvements in statistical accuracy even with low-order approximations.
|
| |
| WeB12 Invited Session, Grand Salon 18 |
Add to My Program |
| Design, Control, and Optimization of Marine Energy Systems |
|
| |
| Chair: Vermillion, Christopher | University of Michigan |
| Co-Chair: Shabara, Mohamed | National Renewable Energy Labs (US) |
| Organizer: Hasankhani, Arezoo | University of New Hampshire |
| Organizer: Vermillion, Christopher | University of Michigan |
| Organizer: Zuo, Lei | University of Michigan |
| Organizer: Tang, Yufei | Florida Atlantic University |
| Organizer: Shabara, Mohamed | National Renewable Energy Labs (US) |
| |
| 13:30-13:45, Paper WeB12.1 | Add to My Program |
| Model Predictive Control of a Motion Rectifier Power Take-Off with Unidirectional Power Flow (I) |
|
| Yang, Lisheng | University of Michigan |
| Huang, Jianuo | University of Michigan |
| Li, Xiaofan | University of Michigan |
| Mi, Jia | Stevens Institute of Technology |
| Lambert, Scott | National Renewable Energy Laboratory |
| Sindler, Petr | National Renewable Energy Laboratory |
| Hajj, Muhammad | Stevens Institute of Technology |
| Zuo, Lei | University of Michigan |
Keywords: Mechatronics, Mechanical systems/robotics, Switched systems
Abstract: This paper describes a control algorithm and its implementation within a model predictive framework for a special motion rectifier power take-off (PTO). Commonly used for wave energy converters, the motion rectifier PTO has advantages in preventing unnecessary generator velocity reversals. However, control of this type of PTO is hard to design, especially when direct control optimization is needed in model predictive control (MPC). A modified dynamic programming (DP) algorithm is then developed to perform open-loop optimization of both the generator torque and the PTO switching. This DP optimizer is embedded inside a rolling horizon MPC style controller to enable real-time control of the PTO. Due to the sequential nature of the DP algorithm, unidirectional power constraint can be easily enforced in the optimization, allowing power evaluation of a passive power conversion system, which can use simpler components but is traditionally challenging for optimal control. Simulation results demonstrate the effectiveness of the proposed control algorithm. It is found that the rectifier PTO increases power by 120% for the selected large generator inertia, and unidirectional power flow constraint only decreases power by 8%. Finally, a hardware-in-loop experiment is conducted to demonstrate the control algorithm in real-time. Experiment results show a 50% power increase by enabling active switching control while keeping the generator damping fixed.
|
| |
| 13:45-14:00, Paper WeB12.2 | Add to My Program |
| Biconjugate Impedance Matching Control of the HERO WEC (I) |
|
| Riley, Allison | Colorado School of Mines |
| Shabara, Mohamed | National Laboratory of the Rockies |
| Aslangil, Denis | Colorado School of Mines |
Keywords: Energy systems
Abstract: Effective control strategies are essential for maximizing the power output of wave energy converters. This study applies port-based modeling for biconjugate impedance matching to the HERO WEC (hydraulic and electric reverse osmosis wave energy converter), a small-scale, modular point absorber for water desalination, exploring the benefits and limitations for this type of reactive control with a hydraulic system. Despite its foundational role in electrical engineering, biconjugate impedance matching has had limited use in wave energy and has not yet been demonstrated in wave-powered desalination systems. Representing this complex multidisciplinary system as an equivalent circuit enables analysis of frequency-based relations. This paper discusses important considerations of port-based modeling for hydraulic systems and identifies optimal parameters to reduce impedance mismatch by 99.93%, providing a clear framework for biconjugate impedance matching in wave-powered water desalination systems.
|
| |
| 14:00-14:15, Paper WeB12.3 | Add to My Program |
| From Cut-In to Rated: Multi-Region Floating Offshore Wind Farm Control for Secondary Frequency Regulation |
|
| Ampleman, Stephen | Johns Hopkins University |
| Gayme, Dennice | Johns Hopkins University |
Keywords: Energy systems, Predictive control for nonlinear systems, Reduced order modeling
Abstract: This paper describes a multi-region control framework for floating offshore wind farms. Specifically, we propose a novel generator torque controller that regulates rotor speed in Region 2, corresponding to wind speeds between the cut-in and rated values. In Region 3 (wind speeds at or above rated but below cut-out speed) we employ a PI-LQR for collective blade pitch. Control blending across the transitional wind speeds (Region 2.5) employs a sigmoid weighting function applied to the control variables. Two modeling paradigms are proposed for farm-level power tracking with rotor speed regularization: a nonlinear model predictive controller (NL-MPC) with a dynamic wake model, and a reduced order model predictive controller based on linear parameter varying turbine models with a time delay representation of wake advection (LPVTD-MPC). These approaches are evaluated over three wind inlet conditions using the PJM ancillary service certification criteria for participation in a secondary frequency regulation market. Results show that both approaches achieve scores of at least 89.9% for the three different testing scenarios, which are well above the qualification threshold of 75%. However, the LPVTD-MPC approach solves the problem in under half the time versus NL-MPC but with slightly larger fluctuations in farm-level power output, highlighting the trade-off between performance and computational tractability. The control framework is among the first to address multi-region wind turbine dynamics together with market driven power tracking objectives for floating offshore wind farms. Such multi-region control becomes increasingly necessary in the floating turbine setting where large (region spanning) wind speed variations are common due to wave induced platform pitching.
|
| |
| 14:15-14:30, Paper WeB12.4 | Add to My Program |
| A Hierarchical Optimization Method for Mission Planning of a Mobile, Anchorless Wave-Powered Desalination Device (I) |
|
| Alam, Minhazul | University of Michigan, Ann Arbor |
| McGuire, Carson | North Carolina State University |
| Liu, Limeng | University of MIchigan |
| Bryant, Matthew | North Carolina State University |
| Vermillion, Christopher | University of Michigan |
Keywords: Energy systems, Maritime control, Emerging control applications
Abstract: This paper presents a hierarchical approach for optimal mission planning of a mobile, anchorless wave-powered mobile desalination device in a spatiotemporally varying environment. The goal is to fill an on-board freshwater tank in minimum time by strategically targeting areas of maximum wave energy resource. In such an environment, the complexity of mission planning lies in the varying nature of the wave resource. In this paper, we first present the use of indirect methods to derive a continuous heading trajectory for the system. Owing to the fact that this optimization problem is ill-posed under many circumstances, we separate the problem into the optimization of a discrete set of waypoints and an effective speed at which to proceed between waypoints (while the system has very limited control over its absolute speed, it can "zig-zag" in order to achieve reduced speeds when advantageous). Here, an upper-level optimizer iteratively uses a genetic algorithm (GA) to generate populations of waypoints, and a lower-level optimizer fully optimizes the effective speed trajectories for each member of the population. We present a series of simulation results for a specified device design, based on wave energy data from the Hawaiian islands.
|
| |
| 14:30-14:45, Paper WeB12.5 | Add to My Program |
| Q-Learning Based Adaptive Linear Extended State Observer for Core Power Control of Pressurized Water Reactor |
|
| Haider, Faraz | Indian Institute of Technology Patna |
| Kumar, Sumit Ranjan | Indian Institute of Technology Patna |
| Ali, Ahmad | Indian Institute of Technology Patna |
Keywords: Control applications, Reinforcement learning, Observers for nonlinear systems
Abstract: Core power control of pressurized water reactor (PWR) is challenging as it involves highly nonlinear dynamics characterized by time-varying system parameters, modeling uncertainties, internal and external disturbances. Linear extended state observer (LESO) in the framework of active disturbance rejection control (ADRC) offers a simple yet robust paradigm for control of uncertain systems under diverse disturbances. This work proposes a maximum sensitivity constrained region based graphical approach, termed graphical LESO (GLESO), to tune the controller parameters of linear ADRC. Furthermore, a Q-learning based strategy, called Q-GLESO, is developed to adaptively tune the controller parameters between scaled multiples of those obtained from GLESO. Effectiveness of proposed schemes is demonstrated through comprehensive simulations on the nonlinear mathematical model of PWR which illustrate the superiority of Q-GLESO over GLESO and LESO for various tracking and disturbance rejection test scenarios.
|
| |
| WeB13 Regular Session, Grand Salon 19 |
Add to My Program |
| Autonomous Systems II |
|
| |
| Chair: Smith, Stephen L. | University of Waterloo |
| Co-Chair: Beaver, Logan E. | Old Dominion University |
| |
| 13:30-13:45, Paper WeB13.1 | Add to My Program |
| Trajectory Tracking with Reachability-Guided Quadratic Programming and Freeze-Resume |
|
| Gholampour, Hossein | Old Dominion University |
| Beaver, Logan E. | Old Dominion University |
Keywords: Autonomous systems, Optimization, Control applications
Abstract: Many robotic systems must follow planned paths yet pause safely and resume when people or objects intervene. We present an output–space method for systems whose tracked output can be feedback-linearized to a double integrator (e.g., manipulators). The approach has two parts. Offline, we perform a pre-run reachability check to verify that the motion plan respects speed and acceleration magnitude limits. Online, we apply a quadratic program to track the motion plan under the same limits. We use a one-step reachability test to bound the maximum disturbance the system is capable of rejecting. When the state coincides with the reference path we recover perfect tracking in the deterministic case, and we correct errors using a KKT-inspired weight. We demonstrate that safety stops and unplanned deviations are handled efficiently, and the system returns to the motion plan without replanning. We demonstrate our system's improved performance over pure pursuit in simulation.
|
| |
| 13:45-14:00, Paper WeB13.2 | Add to My Program |
| Barometer-Aided Attitude Estimation |
|
| Nyoba Tchonkeu, Melone | University of Quebec in Outaouais (UQO) |
| Berkane, Soulaimane | University of Quebec in Outaouais |
| Hamel, Tarek | I3S-CNRS-UCA |
Keywords: Autonomous vehicles, Observers for nonlinear systems, Sensor fusion
Abstract: Accurate and robust attitude estimation is a central challenge for autonomous vehicles operating in GNSS-denied or highly dynamic environments. In such cases, Inertial Measurement Units (IMUs) alone are insufficient for reliable tilt estimation due to the ambiguity between gravitational and inertial accelerations. While auxiliary velocity sensors, such as GNSS, Pitot tubes, Doppler radar, or visual odometry, are often used, they can be unavailable, intermittent, or costly. This work introduces a barometer-aided attitude estimation architecture that leverages barometric altitude measurements to infer vertical velocity and attitude within a nonlinear observer on SO(3). The design cascades a deterministic Riccati observer with a complementary filter, ensuring Almost Global Asymptotic Stability (AGAS) under a uniform observability condition while maintaining geometric consistency. The analysis highlights barometer-aided estimation as a lightweight and effective complementary modality.
|
| |
| 14:00-14:15, Paper WeB13.3 | Add to My Program |
| A Multi-Layer Resilient Architecture for Constrained Autonomous Vehicles Subject to False Data Injections During Long Range Missions |
|
| Franze, Giuseppe | Universita' Della Calabria |
| Lucia, Walter | Concordia University |
| Tedesco, Francesco | Università Della Calabria |
| Venturino, Antonello | Università Della Calabria |
| Youssef, Amr | Concordia University |
Keywords: Autonomous systems, Predictive control for linear systems, Networked control systems
Abstract: In this paper, the problem of safely driving an autonomous vehicle for long range missions despite malicious actions on the data shared with the remote side is considered. This operating scenario poses two key questions: the capability of the underlying control algorithm to be feasible until the mission is accomplished; the effectiveness of adequate countermeasures to contrast the anomalous dynamical system behavior resulting from unpredictable attack phenomena. These requirements are here addressed by designing a layered control architecture that takes advantage of three methodologies: model predictive control, data encryption and perturbation analysis.
|
| |
| 14:15-14:30, Paper WeB13.4 | Add to My Program |
| Application of Actively Coupled Sensor Configuration and Planning to an Unknown Forest Fire Environment |
|
| DesRoches, Jeffrey | Worcester Polytechnic Institute |
| Poudel, Prakash | Worcester Polytechnic Institute |
| Cowlagi, Raghvendra V. | Worcester Polytechnic Institute |
Keywords: Autonomous systems, Information theory and control, Sensor fusion
Abstract: We study autonomous path planning in an unknown, time-varying forest fire threat field, where an ego vehicle must minimize temperature exposure using partial observations from small unmanned aerial vehicle (SUAV) sensors. Threat estimates are obtained indirectly from overhead imagery processed by a U-Net segmentation model, producing noisy, biased measurements of the evolving field. To address this problem, we apply an Actively Coupled Sensor Configuration and Path-planning (A-CSCP) framework that jointly optimizes sensor placement and ego trajectory. Four sensor reconfiguration strategies are compared: random placement, lawnmower sweep, standard mutual information (SMI), and context-relevant mutual information (CRMI). To remain tractable for the large state dimension, SMI and CRMI are implemented in approximate diagonal form. Simulation results show that CRMI provides the best performance, eliminating exposure and reducing mission duration by aligning sensing actions with the ego vehicle’s future path. These results demonstrate the practical value of approximate information-driven sensing and planning in dynamic hazardous environments.
|
| |
| 14:30-14:45, Paper WeB13.5 | Add to My Program |
| Output-Feedback for UAS Using Bearing and Angular Size in an Anti-Pronav Collision Avoidance Scheme |
|
| Liu, Jen-Jui | Brigham Young University |
| Evans, Curtis P. | Brigham Young University |
| Beard, Randal W. | Brigham Young Univ |
Keywords: Autonomous systems, Vision-based control, Nonlinear output feedback
Abstract: Small unmanned aircraft must avoid mid-air collisions under strict size, weight, and power limits that constrain sensing and estimation. We propose a sensor-minimal collision-avoidance controller that uses only monocular bearing, apparent angular diameter, and the sign of bearing rate, without estimating range, time-to-collision, or relative velocity. A constructive geometric proof establishes forward invariance of the safety set and finite-time recovery under realistic kinematic and control bounds for planar encounters. Monte Carlo trials (10,000 per scenario) across intruder sizes from small UAS to Boeing 737, including non-spherical silhouettes, achieved 100% success for constant-velocity encounters and remained robust to pixel-level noise and modest intruder maneuvers. The results indicate that camera-level cues alone can support provably safe and computationally lightweight avoidance for resource-constrained aircraft.
|
| |
| 14:45-15:00, Paper WeB13.6 | Add to My Program |
| Hierarchical Informative Path Planning Via Graph Guidance and Trajectory Optimization |
|
| Iskandar, Avraiem | University of Waterloo |
| Dutta, Shamak | University of Waterloo |
| Murrant, Kevin | Memorial University |
| Pant, Yash Vardhan | University of Waterloo |
| Smith, Stephen L. | University of Waterloo |
Keywords: Autonomous systems, Optimization, Estimation
Abstract: We study informative path planning (IPP) with travel budgets in cluttered environments, where an agent collects measurements of a latent field modeled as a Gaussian process (GP) to reduce uncertainty at target locations. Graph-based solvers provide global guarantees but assume pre-selected measurement locations, while continuous trajectory optimization supports path-based sensing but is computationally intensive and sensitive to initialization in obstacle-dense settings. We propose a hierarchical framework with three stages: (i) graph-based global planning, (ii) segment-wise budget allocation using geometric and kernel bounds, and (iii) spline-based refinement of each segment with hard constraints and obstacle pruning. By combining global guidance with local refinement, our method achieves lower posterior uncertainty than graph-only and continuous baselines, while running faster than continuous-space solvers (up to 9x faster than gradient-based methods and 20x faster than black-box optimizers) across synthetic cluttered environments and Arctic datasets.
|
| |
| WeB14 Regular Session, Grand Salon 21 |
Add to My Program |
| Koopman |
|
| |
| Chair: Sabelhaus, Andrew | Boston University |
| Co-Chair: Liu, Hugh Hong-Tao | Univ. of Toronto |
| |
| 13:30-13:45, Paper WeB14.1 | Add to My Program |
| Efficient Optimal Path Planning in Dynamic Environments Using Koopman MPC |
|
| Abtahi, Mohammad | University of California Davis |
| Mojahed Baghbadorani, Navid | University of California Davis |
| Nazari, Shima | University of California Davis |
Keywords: Robotics, Nonlinear systems identification, Optimal control
Abstract: This paper presents a data-driven model predictive control framework for mobile robots navigating in dynamic environments, leveraging Koopman operator theory. Unlike the conventional Koopman-based approaches that focus on the linearization of system dynamics only, our work focuses on finding a global linear representation for the optimal path planning problem that includes both the nonlinear robot dynamics and collision-avoidance constraints. We deploy extended dynamic mode decomposition to identify linear and bilinear Koopman realizations from input–state data. Our open-loop analysis demonstrates that only the bilinear Koopman model can accurately capture nonlinear state–input couplings and quadratic terms essential for collision avoidance, whereas linear realizations fail to do so. We formulate a quadratic program (QP) for the robot path planning in the presence of moving obstacles in the lifted space and determine the optimal robot action in an MPC framework. Our approach is capable of finding the safe optimal action 320 times faster than a nonlinear MPC counterpart that solves the path planning problem in the original state space. Our work highlights the potential of bilinear Koopman realizations for linearization of highly nonlinear optimal control problems subject to nonlinear state and input constraints to achieve computational efficiency similar to linear problems.
|
| |
| 13:45-14:00, Paper WeB14.2 | Add to My Program |
| Koopman Spectral Analysis and System Identification for Stochastic Dynamical Systems Via Yosida Approximation of Generators |
|
| Zhou, Jun | Huazhong University of Science and Technology |
| Meng, Yiming | University of Illinois Urbana-Champaign |
| Liu, Jun | University of Waterloo |
Keywords: Identification, Stochastic systems, Estimation
Abstract: System identification and Koopman spectral analysis are crucial for uncovering physical laws and understanding the long-term behaviour of stochastic dynamical systems governed by stochastic differential equations (SDEs). In this work, we propose a novel method for estimating the Koopman generator of systems of SDEs, based on the theory of resolvent operators and the Yosida approximation. This enables both spectral analysis and accurate estimation and reconstruction of system parameters. The proposed approach relies on only mild assumptions about the system and effectively avoids the error amplification typically associated with direct numerical differentiation. It remains robust even under low sampling rates or with only a single observed trajectory, reliably extracting dominant spectral modes and dynamic features. We validate our method on two systems and compare it with existing techniques as benchmarks. The experimental results demonstrate the effectiveness and improved performance of our approach in system parameter estimation, spectral mode extraction, and overall robustness.
|
| |
| 14:00-14:15, Paper WeB14.3 | Add to My Program |
| Koopman Control Parametrization: Data-Driven Convex Controller Design for a Class of Nonlinear Systems |
|
| Ondogan, Taha | Boston University |
| Jing, Ran | Boston University |
| Sabelhaus, Andrew | Boston University |
| Tron, Roberto | Boston University |
Keywords: Identification for control, Lyapunov methods, Optimization
Abstract: Although Koopman operators provide a way to represent autonomous nonlinear systems as linear operators in a high-dimensional space, they do not immediately apply to non-autonomous systems with inputs. State (or output) feedback controller design therefore remains nonconvex in typical formulations, even after the use of existing approximations via bilinear control-affine terms. We address this gap by introducing the Koopman Control Parametrization, a novel parameterization of control-affine dynamical systems that directly includes a feedback controller defined as a linear combination of nonlinear measurements. With this choice, the Koopman operator of the closed-loop system is a bilinear combination of two matrices: one representing the system, and the other the controller. We propose a set of sufficient conditions such that the parametrization holds. Then, we present an algorithm that calculates the feedback matrix via semi-definite programming, producing a Lyapunov-stable closed-loop system with convex optimization. We evaluate the proposed controllers on two canonical examples of control-affine nonlinear systems (inverted pendulums), showing that our approach successfully stabilize both with a proper choice of basis functions. In summary, this manuscript introduces a control synthesis method for stabilization of nonlinear systems that is broadly applicable, can be computed efficiently, is verifiably stable and data-driven.
|
| |
| 14:15-14:30, Paper WeB14.4 | Add to My Program |
| Data-Driven Observer Synthesis for Autonomous Limit Cycle Systems through Estimation of Koopman Eigenfunctions |
|
| Ni, Angela | North Carolina State University |
| Tang, Wentao | NC State University |
Keywords: Observers for nonlinear systems, Machine learning, Nonlinear systems identification
Abstract: The signal of system states needed for feedback controllers is estimated by state observers. A generic observer design is the Kazantzis--Kravaris/Luenberger (KKL) observer, an extension of the Luenberger observer for linear to nonlinear systems. The main challenge in the KKL design is the construction of an injective mapping of states, which requires solving PDEs based on a first-principles model. This paper proposes a data-driven, Koopman operator-based method for the construction of KKL observers for planar limit cycle systems. Specifically, for such systems, the KKL injection is guaranteed to be a linear combination of Koopman eigenfunctions. Hence, the determination of such an injection is reduced to a least-squares regression problem, and the inverse of the injective mapping is then approximated via kernel ridge regression. The entire synthesis procedure uses solely convex optimization. We apply the proposed approach to the Brusselator system, demonstrating accurate estimations of the system states.
|
| |
| 14:30-14:45, Paper WeB14.5 | Add to My Program |
| Trajectory Tracking of Quadrotors Using Deep Koopman Operator-Based Data-Driven Model Predictive Control |
|
| Zhang, Kunwu | University of Toronto |
| Liu, Hugh Hong-Tao | Univ. of Toronto |
Keywords: Predictive control for nonlinear systems, Flight control, Learning
Abstract: This work investigates the trajectory tracking of quadrotors using data-driven model predictive control (DDPC). We first propose a novel deep learning-enabled Koopman operator to establish the linear representation of the quadrotor systems. Rather than employing dynamic mode decomposition, this approach leverages the physics-informed neural network (PINN) to learn both observable functions and model parameters, thereby integrating the strengths of both model-based and model-free techniques. As a result, the bluetext{proposed PINN-based deep Koopman operator} can accurately capture both the dynamic behaviours and physical constraints of actual quadrotor systems. In addition, we develop a DDPC strategy based on the learned Koopman model to stabilize the quadrotor without relying on prior knowledge of the system dynamics. The proposed deep Koopman-based MPC strategy (DK-MPC) yields a quadratic programming formulation, enabling efficient computational solutions. Finally, two numerical examples and the comparison study are provided to verify the efficacy of the proposed method.
|
| |
| 14:45-15:00, Paper WeB14.6 | Add to My Program |
| On the Parametric Interpolation of the Koopman Operator for Dynamical Systems Prediction |
|
| Pavel, Radu | Ohio State University |
| Stockar, Stephanie | The Ohio State University |
| Goswami, Debdipta | The Ohio State University |
Keywords: Nonlinear systems identification, Modeling, Identification
Abstract: This paper proposes the parameter-interpolated extended dynamic mode decomposition (piEDMD) method that explicitly incorporates parameter dependence into the Koopman operator formulation by exploiting parameter-affine system structure. The approach enables interpolation across parameter values without retraining, providing a pathway toward generalizable surrogate models. The proposed method is evaluated on two nonlinear systems, namely a damped pendulum and a Van der Pol oscillator, demonstrating accuracy comparable to standard extended dynamic mode decomposition (EDMD) while eliminating the need for repeated retraining. In testing, the standard EDMD achieved an average RMSE of 8.9% for the pendulum and 11.11% for the Van der Pol oscillator, while the piEDMD achieved 7.8% and 12.69%, respectively.
|
| |
| WeB15 Regular Session, Grand Salon 22 |
Add to My Program |
| Nonholonomic Systems |
|
| |
| Chair: Leung, Jordan | Mitsubishi Electric Research Laboratories |
| Co-Chair: Krstic, Miroslav | University of California, San Diego |
| |
| 13:30-13:45, Paper WeB15.1 | Add to My Program |
| Invariant-Set Motion-Planner for Unicycle Dynamics under Closed-Loop Feedback Linearization |
|
| Leung, Jordan | Mitsubishi Electric Research Laboratories |
| Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Nonholonomic systems, Autonomous robots, Feedback linearization
Abstract: This paper develops an invariant-set motion-planner (ISMP) for vehicles with unicycle-like dynamics. ISMPs operate by determining a sequence of obstacle-free positively invariant (PI) sets for the closed-loop system that connect the initial and target state. In this paper, we derive PI sets under a dynamic feedback linearization law. We demonstrate that the PI sets are simple in geometry and we provide an efficient algorithm for obstacle-free scaling. The PI sets are then used to develop a graph-based search for finding an obstacle-free path between two states. The approach is demonstrated in an obstacle-rich simulated environment.
|
| |
| 13:45-14:00, Paper WeB15.2 | Add to My Program |
| Inverse Optimal Feedback and Gain Margins for Unicycle Stabilization |
|
| Kim, Kwang Hak | University of California San Diego |
| Todorovski, Velimir | University of California San Diego |
| Krstic, Miroslav | University of California, San Diego |
Keywords: Nonholonomic systems, Lyapunov methods, Adaptive control
Abstract: The recent development of globally strict control Lyapunov functions (CLFs) for the challenging unicycle parking problem provides a foundation for pursuing optimality. We address this in the inverse optimal framework, thereby avoiding the need to solve the Hamilton–Jacobi–Bellman (HJB) equations, and establish a general result that is optimal with respect to a meaningful cost. We present several design examples that impose varying levels of penalty on the control effort, including arbitrarily bounded control. Furthermore, we show that the inverse optimal controller possesses an infinite gain margin thanks to the system being driftless, and by leveraging this property, we extend the design to an adaptive controller that handles model uncertainty. Finally, we compare the performance of the non-adaptive inverse optimal controller with its adaptive counterpart.
|
| |
| 14:00-14:15, Paper WeB15.3 | Add to My Program |
| Half-Global Deadbeat Parking for Dubins Vehicle |
|
| Krstic, Miroslav | University of California, San Diego |
| Kim, Kwang Hak | University of California San Diego |
| Todorovski, Velimir | University of California San Diego |
Keywords: Nonholonomic systems, Lyapunov methods, Constrained control
Abstract: This paper presents a framework for stabilizing the Dubins vehicle model to zero in finite time (deadbeat parking) by interpreting distance as a time-like variable. We develop control laws that bring the system to a desired position and orientation even when the forward velocity cannot be directly actuated. While the controllers employ inverse-distance gains, we show that the control input remains bounded for all time. In addition to basic deadbeat parking, we incorporate safety considerations by proposing algorithms that prevent the vehicle from crossing in front of the target, enforce deceleration as it approaches the target, and guarantee parking without curb violations. The resulting methods are well-suited for missile guidance and fixed-wing pursuit, however they are broadly applicable to physical systems that are represented by the Dubins vehicle model.
|
| |
| 14:15-14:30, Paper WeB15.4 | Add to My Program |
| Modular Design of Strict Control Lyapunov Functions for Global Stabilization of the Unicycle in Polar Coordinates |
|
| Todorovski, Velimir | University of California San Diego |
| Kim, Kwang Hak | University of California San Diego |
| Krstic, Miroslav | University of California, San Diego |
Keywords: Nonholonomic systems, Lyapunov methods
Abstract: Since the mid-1990s, it has been known that, unlike in Cartesian form where Brockett’s condition rules out static feedback stabilization, the unicycle is globally asymptotically stabilizable by smooth feedback in polar coordinates. In this note, we introduce a modular framework for designing smooth feedback laws that achieve global asymptotic stabilization in polar coordinates. These laws are bidirectional, enabling efficient parking maneuvers, and are paired with families of strict control Lyapunov functions (CLFs) constructed in a modular fashion. The resulting CLFs guarantee global asymptotic stability with explicit convergence rates and include barrier variants that yield “almost global” stabilization, excluding only zero-measure subsets of the rotation manifolds. The strictness of the CLFs is further leveraged in our companion paper, where we develop inverse-optimal redesigns with meaningful cost functions and infinite gain margins.
|
| |
| 14:30-14:45, Paper WeB15.5 | Add to My Program |
| Integrator Forwarding Design for Unicycles with Constant and Actuated Velocity in Polar Coordinates |
|
| Krstic, Miroslav | University of California, San Diego |
| Todorovski, Velimir | University of California San Diego |
| Kim, Kwang Hak | University of California San Diego |
| Astolfi, Alessandro | KAUST |
Keywords: Nonholonomic systems, Lyapunov methods
Abstract: In a companion paper, we present a modular framework for unicycle stabilization in polar coordinates that provides smooth steering laws through backstepping. Surprisingly, the same problem also allows application of integrator forwarding. In this work, we leverage this feature and construct new smooth steering laws together with control Lyapunov functions (CLFs), expanding the set of CLFs available for inverse optimal control design. In the case of constant forward velocity (Dubins car), backstepping produces finite-time (deadbeat) parking, and we show that integrator forwarding yields the very same class of solutions. This reveals a fundamental connection between backstepping and forwarding in addressing both the unicycle and, the Dubins car parking problems.
|
| |
| 14:45-15:00, Paper WeB15.6 | Add to My Program |
| Extremum Seeking in 3-D Environments with Obstacle Avoidance |
|
| Jiang, Lai | Zhejiang University |
| Pan, Gaofeng | Institute of Cyber-Systems and Control, Zhejiang University |
| Zhu, Yang | Zhejiang University |
Keywords: Optimal control, Adaptive control, Nonholonomic systems
Abstract: This paper presents a three-dimensional extremum seeking (ES) control framework for autonomous source localization in environments containing obstacles. Unlike conventional path planning methods that rely on prior maps or GPS, our approach enables a nonholonomic vehicle to seek an unknown source using only local measurements, without explicit position information. We adopt a Rimon–Koditschek navigation function to simultaneously encode the objective of source attraction and obstacle repulsion, allowing the ES controller to operate safely in complex 3-D environments. Theoretical analysis based on averaging theory establishes exponential convergence to an ellipsoidal region around the extremum point. Two simulation scenarios are conducted to validate the effectiveness of the proposed method: one with a single ellipsoidal obstacle and another involving multiple convex obstacles. The results demonstrate that the vehicle can successfully reach the source while avoiding collisions. This framework offers a feasible solution for GPS-denied applications such as rescue robotics and environmental monitoring.
|
| |
| WeB16 Regular Session, Grand Salon 24 |
Add to My Program |
| Reinforcement Learning II |
|
| |
| Chair: Fierro, Rafael | University of New Mexico |
| Co-Chair: Lian, Bosen | Auburn University |
| |
| 13:30-13:45, Paper WeB16.1 | Add to My Program |
| Convex Optimization-Based Inverse Reinforcement Learning for Linear Quadratic Control |
|
| Lian, Bosen | Auburn University |
| Jia-cheng, Wu | Nanyang Technological University |
| Joy, Md Hasanuzzaman Chowdhury | Auburn University |
| Xue, Wenqian | University of Florida |
| Nguyen, Nhan | NASA Ames Research Center |
Keywords: Reinforcement learning, Optimal control, Identification for control
Abstract: This paper develops a convex-optimization-based inverse reinforcement learning (CO-IRL) framework that infers all cost-functional weights and the optimal policy for linear-quadratic regulator (LQR) control systems in both discrete and continuous time. The framework incorporates adaptive dynamic programming (ADP) to eliminate the need for explicit knowledge of the system dynamics. It solves the inference problem as an iterative CO problem under appropriate convexity constraints, enabling low-cost weight recovery subject to constraints and jointly learning cost weights (for both states and controls) and value functions for policy improvement, resulting in faster convergence. We first present a model-based variant, then extend it to a model-free, data-driven version by constructing a Q-function. Convergence is proven, and simulations show substantially faster convergence than non-optimization-based IRL.
|
| |
| 13:45-14:00, Paper WeB16.2 | Add to My Program |
| Defending against Poisoning Attacks in LQR Controllers Using Reinforcement Unlearning |
|
| John, Varkey Medayil | Georgia Institute of Technology |
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Reinforcement learning, Optimal control, Optimization
Abstract: In this paper, we introduce ``unlearning''-based, data-driven defenses against poisoning attacks on the learning of linear quadratic controllers. We first establish conditions for detecting whether a learned controller has been compromised. We then develop defense strategies that either reconstruct the system matrix to recover the unattacked Riccati solution or solve optimization problems that adjust output and cost matrices to neutralize the attack’s influence. Simulation results confirm the effectiveness of the proposed methods in restoring optimal control performance.
|
| |
| 14:00-14:15, Paper WeB16.3 | Add to My Program |
| Necessary and Sufficient Conditions for the Optimization-Based Concurrent Execution of Learned Robotic Tasks |
|
| Tahmid, Sheikh | University of Waterloo |
| Notomista, Gennaro | University of Waterloo |
Keywords: Reinforcement learning, Robotics, Lyapunov methods
Abstract: In this work, we consider the problem of executing multiple tasks encoded by value functions, each learned through Reinforcement Learning, using an optimization-based framework. Prior works develop this framework but did not address when learned value functions can be concurrently executed. This work’s main contributions consist of theorems which provide necessary and sufficient conditions to concurrently execute sets of learned tasks within subsets of the state space using the previously proposed min-norm controller. These theorems provide insight into when learned control tasks can be made concurrently executable, when they may already be so, and when concurrent execution is not possible under the proposed framework. We also extend the proposed framework to account for value functions trained with a discount factor, making it more compatible with standard RL practices.
|
| |
| 14:15-14:30, Paper WeB16.4 | Add to My Program |
| Adversarial Reinforcement Learning for Robust Control of Fixed-Wing Aircraft under Model Uncertainty |
|
| Marquis, Dennis | Virginia Tech |
| Wilhelm, Blake | Virginia Tech |
| Muniraj, Devaprakash | Indian Institute of Technology Madras |
| Farhood, Mazen | Virginia Tech |
Keywords: Reinforcement learning, Robust control, Autonomous systems
Abstract: This paper presents a reinforcement learning-based path following controller for a fixed-wing small uncrewed aircraft system (sUAS) that is robust to uncertainties in the aerodynamic model of the sUAS. The controller is trained using the Robust Adversarial Reinforcement Learning framework, where an adversary perturbs the environment (aerodynamic model) to expose the agent (sUAS) to demanding scenarios. In our formulation, the adversary introduces rate-bounded perturbations to the aerodynamic model coefficients. We demonstrate that adversarial training improves robustness compared to controllers trained using stochastic model uncertainty. The learned controller is also benchmarked against a switched uncertain initial condition controller. The effectiveness of the approach is validated through high-fidelity simulations using a realistic six-degree-of-freedom fixed-wing aircraft model, showing accurate and robust path following performance under a variety of uncertain aerodynamic conditions.
|
| |
| 14:30-14:45, Paper WeB16.5 | Add to My Program |
| Improved Robustness of Deep Reinforcement Learning for Control of Time-Varying Systems by Bounded Extremum Seeking |
|
| Saxena, Shaifalee | University of New Mexico |
| Williams, Alan | Los Alamos National Laboratory |
| Fierro, Rafael | University of New Mexico |
| Scheinker, Alexander | Los Alamos National Lab |
Keywords: Reinforcement learning, Time-varying systems, Control applications
Abstract: In this paper, we study the use of robust model-independent bounded extremum seeking (ES) feedback control to improve the robustness of deep reinforcement learning (DRL) controllers for a class of nonlinear time-varying systems. DRL has the potential to learn from large datasets to quickly control or optimize the outputs of many-parameter systems, but its performance degrades catastrophically when the system model changes rapidly over time. Bounded ES can handle time-varying systems with unknown control directions, but its convergence speed slows down as the number of tuned parameters increases and, like all local adaptive methods, it can get stuck in local minima. We demonstrate that together, DRL and bounded ES result in a hybrid controller whose performance exceeds the sum of its parts with DRL taking advantage of historical data to learn how to quickly control a many-parameter system to a desired setpoint while bounded ES ensures its robustness to time variations. We demonstrate the generality of our combined ES-DRL controller approach with numerical studies of three very different dynamic systems: 1) a general time-varying system, 2) automatic tuning of the Low Energy Beam Transport section at the Los Alamos Neutron Science Center linear particle accelerator, and 3) an intermittent-contact robotic block pushing task with a time-varying goal.
|
| |
| 14:45-15:00, Paper WeB16.6 | Add to My Program |
| Reinforcement Learning-Based Safe Optimal Control of Nonlinear Discrete-Time Affine Systems under Adversarial Inputs |
|
| Farzanegan, Behzad | Missouri University of Science and Technology |
| Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Keywords: Neural networks, Reinforcement learning, Optimal control
Abstract: This paper presents a resilient reinforcement learning (RL)-based safety-aware optimal adaptive tracking framework for nonlinear discrete-time affine systems under state constraints and adversarial sensor, actuator, and reward attacks. Sensor and actuator attacks are modeled through networked communication, while reward attacks affect the reward function or temporal difference error (TDE). A multilayer neural network (MNN)-based actor-critic architecture estimates the cost and optimal policy, using a clipped TDE and an adaptive Gaussian-based forgetting factor to balance resilience and optimality. Sensor and actuator attack resilience is achieved via a fault-aware cost-to-go function and a control barrier function (CBF) constraint embedded in a quadratic program (QP). The effectiveness is validated on a rear-wheel-drive vehicle.
|
| |
| WeB17 Invited Session, Churchill A1 |
Add to My Program |
| Healthcare and Medical Systems (II) |
|
| |
| Chair: Hahn, Jin-Oh | University of Maryland |
| Co-Chair: Pereira, Emily | Texas Tech University |
| Organizer: Menezes, Amor A. | University of Florida |
| Organizer: Hahn, Jin-Oh | University of Maryland |
| Organizer: Medvedev, Alexander V. | Uppsala University |
| Organizer: Mesbah, Ali | University of California, Berkeley |
| Organizer: Pereira, Emily | Texas Tech University |
| Organizer: Zhang, Wenlong | Arizona State University |
| |
| 13:30-13:45, Paper WeB17.1 | Add to My Program |
| Optimal Control Strategies for Cutaneous Inflammatory Disease Treatment Using a Predictive Multi-Scale Model (I) |
|
| Subedar, Gowri Yathishchandran | Stanford University |
| Lau, Saimai | Stanford University |
| Mayalu, Michaelle | Stanford |
Keywords: Systems biology, Cellular dynamics, Biomedical
Abstract: Mathematical models of chronic inflammatory diseases such as psoriasis often capture biological detail but overlook the regulatory feedback structures that sustain pathological persistence and shape therapeutic response. We develop a predictive multi-scale framework that links systemic immune–skin interactions with gut microbiome feedback, extending beyond conventional psoriasis models that focus narrowly on local cytokine pathways. This formulation reveals that disease trajectories may be governed by a small set of nonlinear motifs that generate multistability-consistent with observed remission, relapse, and residual inflammation. Sensitivity and bifurcation analyses identify keratinocyte decay and probiotic-associated microbial growth as critical parameters for steering the system away from "diseased" equilibria. Leveraging these insights, we design a hybrid intervention strategy that combines impulsive phototherapy with continuous probiotic supplementation. Numerical solutions show that this approach may reduce treatment burden, accelerate remission, and be adaptable to heterogeneous initial conditions, highlighting opportunities for tailoring therapy to patient-specific conditions. Beyond psoriasis, the framework establishes a scalable foundation for analyzing coupled microbial–immune systems and for designing therapeutic strategies that exploit underlying feedback structures.
|
| |
| 13:45-14:00, Paper WeB17.2 | Add to My Program |
| Angular Momentum Linear Inverted Pendulum Model for Bipedal Walkers under Foot Slip: Analysis and Experiments (I) |
|
| Huang, Xinyan | Rutgers University |
| Chen, Kuo | Motional AD, LLC |
| Chen, Xunjie | Rutgers, the State University of New Jersey |
| Yi, Jingang | Rutgers University |
Keywords: Biomedical, Modeling, Robotics
Abstract: Maintaining balance during unexpected foot slip is a challenging task for bipedal locomotion. Simplified motion models from linear inverted pendulum (LIP) have demonstrated as an effective tool for recovery gait planning. However, the existing used LIP models often neglect whole-body angular momentum for slip-and-fall recovery design. We propose to use an angular-momentum LIP with flywheel (ALIP-F) model to incorporate two control inputs, center of pressure (CoP) and whole-body angular momentum, into balance recovery design. It is shown that the angular momentum provides a comparable balance recoverable actuation as the CoP control. The model analysis and comparison explain and quantify the use of angular momentum in bipedal balance recovery under foot slip. We further conduct human slip-and-fall experiments to validate the model analysis and properties. This work provides modeling and analysis methods for understanding slip recovery and designing robust biped balance recovery control.
|
| |
| 14:00-14:15, Paper WeB17.3 | Add to My Program |
| A Model of Cell Size Control in Proliferating Human Cells (I) |
|
| Rezaee, Sayeh | University of Delaware |
| Nieto, Cesar | University of Delaware |
| Singh, Abhyudai | University of Delaware |
Keywords: Biological systems, Stochastic systems, Modeling
Abstract: Regulation of cell size requires coordination between growth and division to maintain stable size distributions despite intrinsic stochastic fluctuations. Experiments have established the adder principle as a robust mechanism of size homeostasis, where cells add a nearly constant amount of size each cycle, independent of birth size. Theoretical studies suggest that this behavior emerges when cell cycle progression is coupled with cell growth. However, human cells may exhibit growth dynamics that deviate from the classical exponential assumption. Motivated by this, a stochastic framework for size regulation in human cells has been developed by incorporating a Hill-type growth law into the adder-based model to capture growth saturation. To represent the sequential nature of division, a stochastic multi-step model is implemented where cells progress through internal regulatory stages before dividing. This framework yields exact analytical expressions for statistical moments of the steady-state cell size distribution. Results show that stronger growth saturation increases the mean cell size while slightly reducing fluctuations compared to exponential growth. Importantly, the adder property is preserved, indicating that the reduced variability is a consequence of the growth law rather than simple scaling with mean size. Finally, we show how growth saturation affects variability at the population level during stochastic clonal proliferation.
|
| |
| 14:15-14:30, Paper WeB17.4 | Add to My Program |
| Static Friction Modeling and Compensation for Improved Force Tracking in Pneumatic Cylinders Using Sliding Mode Control |
|
| Yacoub, Ammar | Southern Methodist University |
| Richer, Edmond | SMU |
| Hurmuzlu, Yildirim | Southern Methodist Univ |
Keywords: Biomedical, Fluid power control, Variable-structure/sliding-mode control
Abstract: Static friction, also known as stiction, is a primary source of tracking error in pneumatic cylinders, which significantly limits their use in high-precision force control tasks. This paper presents a detailed experimental study of the static friction behavior of a small pneumatic cylinder, examining how the friction force depends on piston position and supply pressure. Based on these findings, a new static friction model that considers pressure and position is developed to accurately represent the stiction phenomenon. Additionally, a Sliding Mode Controller (SMC) is designed and implemented to produce a desired output force. The SMC's performance is evaluated experimentally both with and without the proposed friction compensation model. Results show that the SMC with the integrated compensation reduces steady-state force tracking errors compared to the uncompensated controller.
|
| |
| 14:30-14:45, Paper WeB17.5 | Add to My Program |
| Adaptive Chemotherapy Control under Tumor Heterogeneity Via Reinforcement Learning |
|
| Kidane, Bereket Sitotaw | The University of Texas at Arlington |
| Motayed, Md Samiul Haque | The University of Texas at Arlington |
| Wang, Shuo | University of Texas at Arlington |
Keywords: Reinforcement learning, Optimal control, Biological systems
Abstract: Designing effective chemotherapy regimens is hindered by tumor heterogeneity and drug resistance, which complicate the deployment of patient-specific model-based optimal control across diverse populations. We develop and compare closed-loop deep reinforcement learning (DRL) dosing policies with continuous (TD3) and discrete (DQN) action spaces trained on a high-dimensional heterogeneous tumor model. The DRL policies are benchmarked against a Pontryagin's Maximum Principle (PMP)-derived open-loop benchmark. We assess generalization under parametric heterogeneity using a 100-patient virtual cohort with 10% uniform perturbations in growth and drug-sensitivity parameters. Across this cohort, TD3 achieves higher average tumor reduction, while DQN yields tighter inter-patient dosing consistency, revealing a clear efficacy-consistency trade-off in this study. Our simulations assume full observation of all tumor subpopulations; translation to sparse and noisy clinical measurements will require partial-observability formulations and/or state estimation. Overall, the results show that simulation-trained DRL can learn state-dependent feedback dosing policies that complement open-loop optimal control benchmarks.
|
| |
| 14:45-15:00, Paper WeB17.6 | Add to My Program |
| Fractional-Order Neural Networks for Data-Driven Model Predictive Control of an Artificial Pancreas |
|
| Baldisseri, Federico | Sapienza University of Rome |
| Wrona, Andrea | La Sapienza |
| Menegatti, Danilo | DIAG, Università Degli Studi Di Roma La Sapienz |
| Delli Priscoli, Francesco | Università Di Roma |
| Koledin, Nebojsa | University of Messina |
| Caponetto, Riccardo | University of Messina |
| Patanè, Luca | University of Messina |
Keywords: Biomedical, Identification for control, Neural networks
Abstract: Accurate prediction of blood glucose trajectories is essential for safe automated insulin delivery and clinical decision support in type 1 diabetes. Yet existing data-driven methods based on historical patient data perform poorly in predicting long-term outcomes. This work presents a fractional-order neural network (FONN) framework for the identification and control of blood glucose dynamics in patients affected by type~1 diabetes. The proposed approach integrates Grünwald–Letnikov operators into the neural network architecture, enabling the inclusion of long-term memory effects into the learning process. In open-loop system identification, the FONN is benchmarked against its integer-order ablation (IONN), and a Long Short-Term Memory (LSTM) model. On a widely used in-silico simulator, LSTM attains the best short-horizon forecasts, whereas FONN yields higher accuracy as the prediction horizon increases. Closed-loop evaluations in the context of data-driven model predictive control confirm the advantages of the FONN compared to a linear model-based formulation and the LSTM approach itself, with improved performance in terms of time in normoglycemia and euglycemia.
|
| |
| WeB18 Regular Session, Churchill A2 |
Add to My Program |
| Constrained Control II |
|
| |
| Chair: Li, Huayi | University of Kentucky |
| Co-Chair: Hosseinzadeh, Mehdi | Washington State University |
| |
| 13:30-13:45, Paper WeB18.1 | Add to My Program |
| Safety-Critical Control with Bounded Inputs: A Closed-Form Solution for Backup Control Barrier Functions |
|
| van Wijk, David E. J. | California Institute of Technology |
| Das, Ersin | Illinois Institute of Technology |
| Molnar, Tamas G. | Wichita State University |
| Ames, Aaron D. | California Institute of Technology |
| Burdick, Joel W. | California Inst. of Tech |
Keywords: Constrained control, Lyapunov methods, Autonomous systems
Abstract: Verifying the safety of controllers is critical for many applications, but is especially challenging for systems with bounded inputs. Backup control barrier functions (bCBFs) offer a structured approach to synthesizing safe controllers that are guaranteed to satisfy input bounds by leveraging the knowledge of a backup controller. While powerful, bCBFs require solving a high-dimensional quadratic program at run time, which may be too costly for computationally-constrained systems such as aerospace vehicles. We propose an approach that optimally interpolates between a nominal controller and the backup controller, and we derive the solution to this optimization problem in closed form. We prove that this closed-form controller is guaranteed to be safe while obeying input bounds. We demonstrate the effectiveness of the approach on a double integrator and a nonlinear fixed-wing aircraft example.
|
| |
| 13:45-14:00, Paper WeB18.2 | Add to My Program |
| From Vertices to Convex Hulls: Certifying Set-Wise Compatibility for CBF Constraints |
|
| Mousavi, Shima Sadat | California Institute of Technology (Caltech) |
| Tan, Xiao | Beihang University |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Optimization
Abstract: This paper develops certificates that propagate compatibility of multiple control barrier function (CBF) constraints from sampled vertices to their convex hull. Under mild concavity and affinity assumptions, we present three sufficient feasibility conditions under which feasible inputs over the convex hull can be obtained per coordinate, with a common input, or via convex blending. We also describe the associated computational methods, based on interval intersections or an offline linear program (LP). Beyond certifying compatibility, we give conditions under which the quadratic-program (QP) safety filter is affine in the state. This enables explicit implementations via convex combinations of vertex-feasible inputs. Case studies illustrate the results.
|
| |
| 14:00-14:15, Paper WeB18.3 | Add to My Program |
| Neural Explicit Reference Governor for Safe Control of Resource-Constrained Autonomous Systems in Dynamic Environments |
|
| Momani, Mu'taz | Washington State University |
| Hosseinzadeh, Mehdi | Washington State University |
Keywords: Constrained control, Autonomous systems, Machine learning
Abstract: The explicit reference governor is a systematic approach for ensuring safety constraint satisfaction in pre-stabilized systems. Its core principle is to adjust the applied reference by solving an optimization problem in real time, a task that becomes computationally demanding when the safe set is non-convex and time-varying, particularly for autonomous systems with limited computational resources. To address this challenge, this paper proposes leveraging neural networks to approximate the solution of such complex optimization problems. A complementary mechanism is introduced to guarantee constraint satisfaction despite approximation errors in the network outputs. Under reasonable assumptions on the rate of change of the constraints and system dynamics, both constraint satisfaction and convergence properties are rigorously established. The effectiveness of the proposed scheme is demonstrated experimentally on a drone navigation task, where the admissible flight region is non-convex, time-varying, and contains a moving obstacle represented by a human.
|
| |
| 14:15-14:30, Paper WeB18.4 | Add to My Program |
| CPU and GPU-Based Parallelization of the Robust Reference Governor |
|
| Ossareh, Hamid | University of Vermont |
| Shayne, William | University of Vermont |
| Chevalier, Samuel | MIT |
Keywords: Constrained control, Predictive control for nonlinear systems, Robust control
Abstract: Constraint management is a central challenge in modern control systems. A solution is the Reference Governor (RG), which is an add-on strategy to pre-stabilized feedback control systems to enforce state and input constraints by shaping the reference command. While robust formulations of RG exist for linear systems, their extension to nonlinear systems is often computationally intractable. This paper develops a scenario-based robust RG formulation for nonlinear systems and investigates its parallel implementation on multi-core CPUs and CUDA-enabled GPUs. We analyze the computational structure of the algorithm, identify parallelization opportunities, and implement the resulting schemes on modern parallel hardware. Benchmarking on a nonlinear hydrogen fuel cell model demonstrates up to three orders of magnitude speed-up as compared to sequential implementations.
|
| |
| 14:30-14:45, Paper WeB18.5 | Add to My Program |
| A Less-Conservative Reference Governor Strategy for Safe Command Tracking of Nonlinear Systems |
|
| Li, Huayi | University of Kentucky |
Keywords: Constrained control, Predictive control for nonlinear systems
Abstract: Disturbance rejection reference governors based on linear models can enforce constraints for command tracking of nonlinear systems by considering the mismatch between the linear prediction model and the plant dynamics as a fictitious disturbance. Strategies to implement a linear model-based reference governor on a nonlinear system often use a constant error set for the mismatch, which may lead to a large safety margin and thus conservative performance. Solutions proposed in the literature often trade off the advantages and proven properties of the conventional design, such as recursive feasibility and computational effectiveness. To provide a balanced treatment, this paper proposes exploiting error sets that converge to the singleton of the origin, achieved by enforcing limits to the change rate of the modified reference. We formulate the proposed reference governor as a quadratic programming command governor problem with linear inequality constraints to contain the computational footprint and prove that the problem is recursively feasible. Numerical simulation results are reported.
|
| |
| 14:45-15:00, Paper WeB18.6 | Add to My Program |
| Connectivity-Preserving Multi-Agent Area Coverage Via Density-Driven Optimal Control (D2OC) |
|
| Lee, Kooktae | New Mexico Tech |
| Brook, Ethan | New Mexico Tech |
Keywords: Constrained control, Cooperative control, Decentralized control
Abstract: Multi-agent systems are widely used for area coverage tasks in applications such as search-and-rescue, environmental monitoring, and precision agriculture. Achieving non-uniform coverage, where certain regions are prioritized, requires coordinating agents while accounting for dynamic and communication constraints. Existing density-driven methods effectively distribute agents according to a reference density but typically do not guarantee connectivity, which can lead to disconnected agents and degraded coverage in practical deployments. This letter presents a connectivity-preserving approach within the Density-Driven Optimal Control (D2OC) framework. The coverage problem, expressed via the Wasserstein distance between agent distributions and a reference density, is formulated as a quadratic program. Communication constraints are incorporated through a smooth penalty function, ensuring strict convexity and global optimality while naturally maintaining inter-agent connectivity without rigid formations. Simulation results demonstrate that the proposed method effectively keeps agents within communication range, improving coverage quality and convergence speed compared to methods without explicit connectivity enforcement.
|
| |
| WeB19 Regular Session, Churchill B1 |
Add to My Program |
| Optimal Control I |
|
| |
| Chair: Stein, Adrian | Louisiana State University |
| Co-Chair: Ghorbanpour, Amin | Duquesne University |
| |
| 13:30-13:45, Paper WeB19.1 | Add to My Program |
| A Geometric Optimal Control Algorithm for Navigation through Heterogeneous Regions |
|
| Oradiambalam Sachidanandam, Sarjana | University of Maryland |
| Diaz-Mercado, Yancy | University of Maryland |
Keywords: Optimal control, Autonomous robots, Robotics
Abstract: Autonomous robots often require the ability to autonomously navigate in heterogeneous regions. The most efficient and the safest path, i.e., the least cost path in these scenarios, may no longer be the same as the shortest path. Optimal control approaches can allow incorporating these considerations into the controller design. However, they can be computationally costly to solve. In this paper, we propose a navigation algorithm that integrates geometrical approaches with optimal control to enable efficient robot navigation across heterogeneous terrains. The work introduces a system representation that exploits the underlying geometrical properties of the environment and the robot dynamics to formulate the optimal control problem. Through simulations, we demonstrate that this representation effectively captures system behavior and yields control outcomes consistent with theoretical expectations. Numerical results validate the effectiveness of the proposed approach: the model captures key aspects of system behavior, and drives the agent, consistently, toward low-cost regions. When combined with a shooting method approach, the model facilitates solving the optimal control problem, effectively driving the agent to any desired goal location in the domain.
|
| |
| 13:45-14:00, Paper WeB19.2 | Add to My Program |
| Online Intention Prediction Via Control-Informed Learning |
|
| Zhou, Tianyu | Purdue University |
| Liang, Zihao | Purdue University |
| Lu, Zehui | Independent Researcher |
| Mou, Shaoshuai | Purdue University |
Keywords: Optimal control, Autonomous systems, Estimation
Abstract: This paper presents an online intention prediction framework for estimating the goal state of autonomous systems in real time, even when intention is time-varying, and system dynamics or objectives include unknown parameters. The problem is formulated as an inverse optimal control / inverse reinforcement learning task, with the intention treated as a parameter in the objective. A shifting horizon strategy discounts outdated information, while online control-informed learning enables efficient gradient computation and online parameter updates. Simulations under varying noise levels and hardware experiments on a quadrotor drone demonstrate that the proposed approach achieves accurate, adaptive intention prediction in complex environments
|
| |
| 14:00-14:15, Paper WeB19.3 | Add to My Program |
| Differential Flatness Based Path Planning Around Obstacles of a 2D Gantry Crane |
|
| Mountain, Eric | University at Buffalo |
| Stein, Adrian | Louisiana State University |
| Maxime, Jesse | University at Buffalo |
| Singh, Tarunraj | State Univ. of New York at Buffalo |
Keywords: Optimal control, Control applications, Flexible structures
Abstract: This paper presents a method of obstacle avoidance for a nonlinear gantry crane model using differential flatness. The crane motion is two dimensional, allowing for lateral motion of the gantry, and hoisting motion of the payload. A unique method of synthesizing the obstacle avoiding trajectory is presented by defining a safe ``tube" for the point mass to pass through. Bernstein polynomials are used to parameterize the flat outputs. The point mass trajectory is discretized, and linear programming is utilized to solve for the polynomial coefficients that ensure the trajectory is within the tube. Simulations are generated to show the point mass trajectory and states over time, and experimental data is captured to validate these simulations. Furthermore, experimental data of a gantry crane following a rectangular motion profile is captured. This is used as a reference to illustrate the profound improvement in the vibration minimization of the flat trajectory.
|
| |
| 14:15-14:30, Paper WeB19.4 | Add to My Program |
| Output-Feedback Optimal Control of Heterogeneous Quadrotor Swarms with Communication Delays |
|
| Soleimani, Ehsan | Missouri University of Science and Technology |
| Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Keywords: Optimal control, Neural networks, Aerospace
Abstract: This paper presents a unified framework for optimal output-feedback control of heterogeneous quadrotor unmanned aerial vehicles (QUAVs) operating in coordinated formations under communication delays, without requiring exact system dynamics. To address partial state observability, a multi-layer neural network (MNN) observer is developed to precisely estimate unmeasured states. Reinforcement learning (RL) is employed to achieve optimal control using an MNN critic that ensures adaptability. Together, the MNN observer and RL relax the need for explicit system dynamics, enabling the framework to be applied to heterogeneous QUAVs with differing masses and parameters. A Lyapunov–Krasovskii functional is incorporated into the learning process to compensate for communication delays, which arise either in transmitting commands from the ground station to the leader or in relaying leader information to the followers. Such delays, as a non-stationary source, can otherwise cause instability and degrade formation performance. A three-dimensional leader–follower formation strategy expressed in spherical coordinates enables efficient maneuvering. Theoretical analysis establishes closed-loop stability, and simulation studies validate the effectiveness of the proposed framework in ensuring optimal, stable, and adaptive formation control of heterogeneous QUAVs under communication delays.
|
| |
| 14:30-14:45, Paper WeB19.5 | Add to My Program |
| Model Predictive Control for Energy-Optimal Regenerative Braking in Autonomous Electric Vehicle |
|
| Ghorbanpour, Amin | Duquesne University |
| Baldwin, Stephen | Duquesne University |
Keywords: Optimal control, Control applications, Differential-algebraic systems
Abstract: This paper models and optimizes the braking phase of an autonomous electric vehicle (AEV) with the goal of maximizing regenerative energy while maintaining tire-road safety and passenger comfort. The longitudinal drivetrain dynamics are formulated as an index-1 differential-algebraic equation (DAE), and its ODE reduction is used to pose a constrained optimal control problem. Wheel Slip ratios are regularized for low speed and penalized with a hinge-squared (ReLU) term to discourage excessive slip. A nonlinear model predictive control (NMPC) scheme is then developed to compute optimal braking actions while respecting key physical and safety constraints. These include limits on the maximum practical braking force, bounds on passenger comfort expressed through limits on longitudinal acceleration, and a no-late-acceleration condition that prevents acceleration during the braking maneuver. The resulting optimal control problem is discretized using a direct multiple-shooting method that is solved using the CasADi optimization framework. We establish the existence of optimal solutions for the continuous DAE-based optimal control problem under standard index-1 regularity and compactness assumptions, and we note the per-step existence of minimizers for the discretized NMPC problems. Simulation studies illustrate that the controller harvests energy while keeping slip within prescribed bounds.
|
| |
| 14:45-15:00, Paper WeB19.6 | Add to My Program |
| Optimal Path Planning for Wheel Loader Automation Via Hybrid Physics-Informed Data-Driven Soil-Tool Interaction Modeling |
|
| Abdolmohammadi, Armin | University of California, Davis |
| Mojahed Baghbadorani, Navid | University of California, Davis |
| Ravani, Bahram | University of California, Davis |
| Nazari, Shima | University of California, Davis |
Keywords: Optimal control, Predictive control for nonlinear systems, Nonlinear systems identification
Abstract: Earthmoving operations with wheel loaders require substantial power and incur high operational costs. This letter presents an efficient automation framework based on a physics-informed, data-driven soil-tool interaction model. A reduced-order multi-step parameter estimation method, guided by the Fundamental Earthmoving Equation (FEE), is deployed for excavation force estimation. An optimal control problem is then formulated to compute energy-efficient bucket trajectories using soil parameters identified in the previous digging cycle. The results are validated using Algoryx Dynamics physics-based digital model of a wheel loader. It is shown that up to 40% energy saving is possible during the excavation phase compared to typical operator executed paths. Furthermore, the total computation time is comparable to a single digging cycle, highlighting the framework’s potential for real-time, energy-optimized wheel loader automation.
|
| |
| WeB20 Invited Session, Churchill B2 |
Add to My Program |
| Set-Based Methods in Dynamic Systems and Control II |
|
| |
| Chair: Jain, Neera | Purdue University |
| Co-Chair: Ruths, Justin | University of Texas at Dallas |
| Organizer: Koeln, Justin | University of Texas at Dallas |
| Organizer: Pangborn, Herschel | The Pennsylvania State University |
| Organizer: Jain, Neera | Purdue University |
| Organizer: Ruths, Justin | University of Texas at Dallas |
| |
| 13:30-13:45, Paper WeB20.1 | Add to My Program |
| Efficient Norm-Based Reachable Sets Via Iterative Dynamic Programming (I) |
|
| Harapanahalli, Akash | Georgia Institute of Technology |
| Coogan, Samuel | Georgia Institute of Technology |
Keywords: Formal verification/synthesis, Optimal control, Optimization
Abstract: In this work, we present a numerical optimal control framework for reachable set computation using normotopes, a new set representation as a norm ball with a shaping matrix. In reachable set computations, we expect to continuously vary the shape matrix along trajectories. Incorporating the shape dynamics as an input, we build a controlled embedding system using a linear differential inclusion bounding the dynamics of the system, where a single forward simulation of this embedding system always provides an overapproximating reachable set of the system, no matter the choice of hypercontrol. By iteratively solving a linear quadratic approximation of the nonlinear optimal hypercontrol problem, we synthesize less conservative final reachable sets, providing a natural tradeoff between runtime and accuracy. Terminating our algorithm at any point always returns a valid overapproximating reachable set.
|
| |
| 13:45-14:00, Paper WeB20.2 | Add to My Program |
| Exact Representation Complexity Reduction for Constrained Zonotopes with Applications to Dynamic Systems and Control (I) |
|
| Dulal, Aayush | University of Texas Dallas |
| Koeln, Justin | University of Texas at Dallas |
Keywords: Computational methods, Numerical algorithms
Abstract: Zonotopes and constrained zonotopes have become one of the most widely used set representations for the analysis and design of dynamic systems and their controllers. While a linear increase in set representation complexity of zonotopic sets can represent an exponential increase in set complexity in terms of vertices and facets, practical application of set-based methods is still limited by the effects of set representation complexity growth with repeated set operations. This paper presents the characterization of several sources of redundancy in zonotope and constrained zonotope set representations and algorithms for representation complexity reduction while preserving the set. First, formal characterizations of minimal, or irredundant, zonotopic representations and criteria for redundancy and representation complexity reduction are provided. Then, a proposed algorithm for redundancy detection and removal is presented and demonstrated to provide a practical balance between computational costs and the achieved reduction for several numerical examples, including robust controllable sets and sets produced by the domain partitioning of a ReLU neural network.
|
| |
| 14:00-14:15, Paper WeB20.3 | Add to My Program |
| Computationally Efficient State and Model Estimation Via Interval Observers for Partially Unknown Systems (I) |
|
| Khajenejad, Mohammad | The University of Tulsa |
| Jin, Zeyuan | Arizona State University |
Keywords: Estimation, Learning, Observers for nonlinear systems
Abstract: This paper addresses the synthesis of interval observers for partially unknown nonlinear systems subject to bounded noise, aiming to simultaneously estimate system states and learn a model of the unknown dynamics. Our approach leverages Jacobian sign-stable (JSS) decompositions, tight decomposition functions for nonlinear systems, and a data-driven over-approximation framework to construct interval estimates that provably enclose the true augmented states. By recursively computing tight and tractable bounds for the unknown dynamics based on current and past interval framers, we systematically integrate these bounds into the observer design. Additionally, we formulate semi-definite programs (SDP) for observer gain synthesis, ensuring input-to-state stability and optimality of the proposed framework. Finally, simulation results demonstrate the computational efficiency of our approach compared to a method previously proposed by the authors.
|
| |
| 14:15-14:30, Paper WeB20.4 | Add to My Program |
| Enhanced Set-Based State Estimation Via Dependency Preservation (I) |
|
| Wagner, Jonas | University of Texas at Dallas |
| Ruths, Justin | University of Texas at Dallas |
Keywords: Uncertain systems, Observers for Linear systems, Estimation
Abstract: This paper introduces a dependency-preserving set-based observer which can improve state estimation by accounting for correlation which is regularly ignored by traditional set-based observers. A proposed state estimation method is motivated using a DC motor example where the measurement noise depends on the system noise. Further applications where correlation arises are discussed, e.g., using autoregressive models or closed-loop feedback within reachability forecasting. Throughout the paper zonotopic sets are used to illustrate the dependency preservation within the parameterized factor space, as well as to produce the numerical results.
|
| |
| 14:30-14:45, Paper WeB20.5 | Add to My Program |
| Improved Systematic Interval Observers for Bounded-Error LTI Systems (I) |
|
| Gedefaw, Elisabeth | University of Tulsa |
| Mareddu, Siva Rohit | University of Tulsa |
| Khajenejad, Mohammad | The University of Tulsa |
Keywords: Estimation, Observers for Linear systems, Uncertain systems
Abstract: This paper presents an improved interval observer design for linear time-invariant (LTI) continuous-time (CT) and discrete-time (DT) systems with bounded uncertainties by integrating a systematic coordinate transformation approach with a multiple-gain structure. The proposed method relaxes potentially restrictive Metzler/positivity assumptions typically required in coordinate transformation-based designs. To further reduce conservatism, particularly in approaches that rely solely on solving the Sylvester equation for the transformation, the observer gains are optimized by minimizing the L1-norm of the error dynamics. The effectiveness of the proposed interval observer is demonstrated through several DT and CT examples, with comparisons against two benchmark designs.
|
| |
| 14:45-15:00, Paper WeB20.6 | Add to My Program |
| A Reachability-Oriented Approach to Linear System Identification under Unknown Bounded Noise (I) |
|
| Jeevanandam, Sibibalan | Purdue University |
| Jain, Neera | Purdue University |
Keywords: Identification for control, Linear systems, Uncertain systems
Abstract: This paper presents a reachability-oriented approach to system identification for linear time-invariant (LTI) systems subject to unknown‑but‑bounded noise. Unlike traditional methods that assume prior knowledge of the noise bounds or distribution, the proposed method jointly estimates the system matrices and the smallest convex noise set that guarantees containment of the observed trajectories by the predicted reachable sets. Our main result shows that, for any set-size measure that is monotone with respect to inclusion and is invariant for a tight convex over-approximating set, the noise set need not be a decision variable. Instead, an optimal noise set is analytically recoverable from the optimal system matrices alone, reducing identification to an optimization solely over the system matrices. We also establish asymptotic convergence of the parameter estimates under i.i.d., bounded noise with full support and under persistent excitation and show that all system matrices converge to their true values when the noise set-size is measured by the p-norm of its axis-aligned widths, for any finite p. Monte Carlo experiments under uniform, truncated Gaussian, and truncated exponential noise demonstrate tighter noise sets than those inferred from ordinary least-squares residuals, with comparable parameter bias but slightly lower coverage of test trajectories by the predicted reachable sets, thereby highlighting the tradeoff between set tightness and conservatism.
|
| |
| WeB21 Regular Session, Churchill C1 |
Add to My Program |
| Optimization II |
|
| |
| Chair: Paternain, Santiago | Rensselaer Polytechnic Institute |
| Co-Chair: Nash, Austin | Rose-Hulman Institute of Technology |
| |
| 13:30-13:45, Paper WeB21.1 | Add to My Program |
| Policy Optimization in Robust Control: Weak Convexity and Subgradient Methods (I) |
|
| Watanabe, Yuto | University of California, San Diego |
| Liao, Feng-Yi | University of California San Diego |
| Zheng, Yang | University of California San Diego |
Keywords: H-infinity control, Optimization algorithms, Reinforcement learning
Abstract: Robust control seeks stabilizing policies that perform reliably under adversarial disturbances, with H-infinity control as a classical formulation. It is known that policy optimization of robust H-infinity control naturally leads to nonsmooth and nonconvex problems. This paper builds on recent advances in nonsmooth optimization to analyze the discrete-time static output-feedback H-infinity control. We show that the H-infinity cost is weakly convex over any convex subset of a sublevel set. This structural property allows us to establish the first non-asymptotic deterministic convergence rate for the subgradient method under suitable assumptions. In addition, we prove a weak Polyak-Lojasiewicz (PL) inequality in the state-feedback case, implying that all stationary points are globally optimal. We finally present a few numerical examples to validate the theoretical results.
|
| |
| 13:45-14:00, Paper WeB21.2 | Add to My Program |
| Topology-Control-Based Moving Target Defense against False Data Injection in Distribution Networks |
|
| Ding, Lei | Shanghai Jiao Tong University |
| Long, Chengnian | Shanghai Jiao Tong University |
| Wu, Jing | Shanghai Jiao Tong University |
Keywords: Optimization, Power systems
Abstract: False Data Injection (FDI) attack is one of critical threats to the reliability of power systems, where moving target defense (MTD) is an effective proactive detection defense strategy. Existing MTD studies have primarily concentrated on transmission networks through Distributed Flexible AC Transmission System (D-FACTS) devices. However, those methods cannot be extended to distribution networks due to significantly limited D-FACTS deployment. To detect FDI attacks in distribution networks, a topology-control-based moving target defense (TC-MTD) approach is proposed. Unlike D-FACTSbased MTD that alters line impedances, TC-MTD perturbs the system topology through switch operations and the rank of the admittance matrix difference is employed as a performance metric which indicates the extent of perturbation. Furthermore, a heuristic algorithm is developed to reconfigure the system topology, aiming to maximize detection performance and minimize power loss with the fewest switch operations. Case studies conducted on both modified and standard distribution networks validate the effectiveness of TC-MTD.
|
| |
| 14:00-14:15, Paper WeB21.3 | Add to My Program |
| Generalized Multi-Constraint Extremum Seeking |
|
| Williams, Alan | Los Alamos National Laboratory |
| Cortes, Jorge | UC San Diego |
| Scheinker, Alexander | Los Alamos National Lab |
Keywords: Optimization algorithms, Adaptive control
Abstract: We generalize the Safe Extremum Seeking algorithm to address the minimization of an unknown objective function subject to multiple unknown inequality and equality constraints, relying on recent results of gradient flow systems. These constraints may represent safety or other critical conditions. The proposed ES algorithm functions as a general nonlinear programming tool, offering practical maintenance of all constraints and semiglobal practical asymptotic stability, utilizing a Lyapunov argument on the penalty function and the set-valued Lie derivative. The efficacy of the algorithm is demonstrated on a 2D problem.
|
| |
| 14:15-14:30, Paper WeB21.4 | Add to My Program |
| Robust Control Co-Design with Experimental Validation for an Electromechanical System |
|
| Nash, Austin | Rose-Hulman Institute of Technology |
| Morehouse, James | Rose-Hulman Institute of Technology |
Keywords: Optimization, Robust control, Predictive control for linear systems
Abstract: Control co-design (CCD) broadens the performance envelope for engineering systems by considering the notion of feedback control at the system (plant) design stage. Because CCD is a model-based design and control framework, quantifying and enabling robustness to uncertainty is critically important. While recent research has focused on developing CCD frameworks that incorporate notions of uncertainty, validation has largely been limited to model-based simulation. This work experimentally validates a robust CCD framework with a mass-spring cart system actuated by a controllable drive motor. Separate systems are co-designed to generate physical systems with maximum mass and minimum required control effort, with each system capable of meeting position tracking constraints in the presence of disturbance uncertainty introduced through a separate disturbance motor. The resulting systems are built and tested experimentally with results showing that (1) the experimental system responses to a baseline (planned) disturbance closely match the predictions from the robust CCD optimization process, and (2) the experimental system responses are robust to bounded disturbance uncertainties. This works represents an important step toward bringing robust CCD into engineering practice.
|
| |
| 14:30-14:45, Paper WeB21.5 | Add to My Program |
| PAD-TRO: Projection-Augmented Diffusion for Direct Trajectory Optimization |
|
| Chen, Jushan | Rensselaer Polytechnic Institute |
| Paternain, Santiago | Rensselaer Polytechnic Institute |
Keywords: Optimization, Robotics, Stochastic systems
Abstract: Recently, diffusion models have gained popularity and attention in trajectory optimization due to their capability of modeling multi-modal probability distributions. However, addressing nonlinear equality constraints, i.e, dynamic feasibility, remains a great challenge in diffusion-based trajectory optimization. Recent diffusion-based trajectory optimization frameworks rely on a single-shooting style approach where the denoised control sequence is applied to forward propagate the dynamical system, which cannot explicitly enforce constraints on the states and frequently leads to sub-optimal solutions. In this work, we propose a novel direct trajectory optimization approach via model-based diffusion, which directly generates a sequence of states. To ensure dynamic feasibility, we propose a gradient-free projection mechanism that is incorporated into the reverse diffusion process. Our results show that, compared to a recent state-of-the-art baseline, our approach leads to zero dynamic feasibility error and approximately 4x higher success rate in a quadrotor waypoint navigation scenario involving dense static obstacles.
|
| |
| 14:45-15:00, Paper WeB21.6 | Add to My Program |
| A Multi-Rate Feedback Control System to Model and Analyze Distributed Algorithms with Error Feedback |
|
| Liang, Liyuan | University of California, Berkeley |
| Zhang, Xinwei | Amazon.com |
| Elia, Nicola | University of Minnesota |
| Hong, Mingyi | University of Minnesota |
Keywords: Optimization algorithms, Large-scale systems, Nonlinear output feedback
Abstract: Error Feedback (EF)—also known as error compensation—has emerged as a fundamental technique for correcting biases that arise from quantization and compression operations, thereby enabling efficient communication in distributed systems. While numerous algorithms have recently been proposed that combine various implementations of EF with different base distributed algorithms, a unified theoretical framework for understanding these approaches has remained elusive. This paper addresses this gap by leveraging multi-rate feedback control theory to establish a comprehensive foundation for EF mechanisms in distributed optimization. We demonstrate that when both the EF procedure and the distributed optimization process satisfy a set of easily verifiable conditions, their integration yields a convergent algorithm with provable rates. Our theoretical framework not only simplifies the analysis of existing EF-based distributed algorithms but, more significantly, provides a systematic approach for designing new algorithms. This unified perspective bridges the gap between theoretical understanding and practical implementation, advancing the field of distributed optimization.
|
| |
| WeB22 Invited Session, Churchill C2 |
Add to My Program |
| Estimation and Control of Distributed Parameter Systems II |
|
| |
| Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
| Co-Chair: Hu, Weiwei | University of Georgia |
| Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
| Organizer: Hu, Weiwei | University of Georgia |
| |
| 13:30-13:45, Paper WeB22.1 | Add to My Program |
| Design of Restricted Kalman Filters for Infinite Dimensional Systems with Moving Sensors (I) |
|
| Demetriou, Michael A. | Worcester Polytechnic Institute |
| Hu, Weiwei | University of Georgia |
Keywords: Distributed parameter systems, Kalman filtering
Abstract: This paper presents conditions under which a Kalman filter designed for a parabolic PDE over a larger spatial domain will have a filter kernel with spatial support over a prescribed subdomain contained in the spatial domain of the PDE. This comes from the implementation of a domain decomposition filter for PDEs and aims at minimizing filter design complexity and reducing computational costs. When an optimal filter designed over the larger spatial domain has a kernel that vanishes outside a specific subdomain, then the resulting domain decomposition filter becomes optimal since the solution to the Kalman filter over the entire spatial domain and that over the smaller subdomain will be identical.
|
| |
| 13:45-14:00, Paper WeB22.2 | Add to My Program |
| Boundary Control of Infinite Horizon Semilinear Parabolic Equations (I) |
|
| Casas Rentería, Eduardo | Universidad De Cantabria |
| Kunisch, Karl | University of Graz |
Keywords: Distributed parameter systems
Abstract: This work concentrates on a class of Neumann optimal control problems for semilinear parabolic equations on an infinite horizon domain Q = Omega times (0,infty) subject to a control constraint of the form alpha le u(x,t) le beta for (x,t) in Gamma times (0,infty), where Gamma is the boundary of Omega. Existence of a solution, first- and second-order optimality conditions are established. Finally, the approximation of the solution by finite horizon control problems is addressed and some error estimates are provided.
|
| |
| 14:00-14:15, Paper WeB22.3 | Add to My Program |
| Optimal Control in Poroelastic Systems Via E-Radiality Theory (I) |
|
| Bociu, Lorena | North Carolina State University |
| Alalabi, Ala' | University of Waterloo |
Keywords: Optimal control, Distributed parameter systems, Differential-algebraic systems
Abstract: Fluid flows through deformable, porous media arise in numerous applications, ranging from geomechanics to biomechanics. These processes are governed by coupled systems of partial differential equations (PDEs) that describe the interaction between fluid transport and mechanical deformation. A widely adopted framework for such systems is Biot's theory of poroelasticity, which leads, under quasi-static assumptions, to a system of coupled parabolic-elliptic PDEs, falling under the umbrella of implicit, degenerate evolution equations. While the existence and regularity theory for such systems is well established, optimal control problems constrained by poroelastic dynamics, especially relevant in biomedical contexts where internal pressure and displacement must be regulated, have only recently been considered. These control problems are challenging due to the degeneracy of the equations, the implicit nature of the coupling, and the structure of the control-to-state map. In this work, we reinterpret the coupled flow-deformation problem as an implicit dynamical system on Hilbert spaces and analyze it using E-radiality theory - a generalization of the classical Hille-Yosida framework for non-explicit evolution equations. This abstract formulation yields new insights into well-posedness theory and enables a unified treatment of linear control problems constrained by degenerate evolution dynamics.
|
| |
| 14:15-14:30, Paper WeB22.4 | Add to My Program |
| Discrete-Time Luenberger Observer and Kalman Filter Design for Non-Isothermal Pipeline Systems (I) |
|
| Khari, Safa | University of Alberta |
| Koch, Charles Robert | University of Alberta |
| Dubljevic, Stevan | University of Alberta |
Keywords: Estimation, Kalman filtering, Distributed parameter systems
Abstract: This work develops a non-isothermal pipeline flow model, with an emphasis on state estimation through the Kalman filter and Luenberger observer design. A nonlinear infinite-dimensional model is derived for flow velocity, pressure, density, and temperature based on nonlinear coupled hyperbolic partial differential equations (PDEs). The model is linearized around an operational steady state, and the Cayley–Tustin transformation is used for temporal discretization. This approach preserves essential system properties such as controllability, observability, and stabilizability without requiring spatial discretization or model reduction. The resolvent operator is then determined, enabling the formulation of a Kalman filter for the infinite-dimensional pipeline model in the presence of process and measurement noise, and a Luenberger observer for state reconstruction in the noise-free process. The performance of both estimation strategies is validated through numerical simulations, demonstrating their effectiveness for pipeline flow monitoring.
|
| |
| 14:30-14:45, Paper WeB22.5 | Add to My Program |
| Distributed Attraction-Repulsion Potential for Multi-Agents Formation Control (I) |
|
| Ban, Hemanta | University of Tennessee |
| Djouadi, Seddik, M. | University of Tennessee |
| Tomsovic, Kevin | Clemson University |
Keywords: Distributed control, Distributed parameter systems, Networked control systems
Abstract: In this paper, a distributed multi-agent formation control driven the gradient of the Lennard-Jones potential is analyzed. For a collision-free initial data we prove global well-posedness together with a uniform lower bound on all inter-agent distances, thereby excluding hard collisions. Taking the total energy as a Lyapunov function, LaSalle’s invariance principle shows that every positive limit point is an equilibrium. Since trajectories remain uniformly away from collisions, the energy is analytic along the flow and an argument yields convergence to a single equilibrium modulo translations. Local exponential rate of convergence is determined explicitly. Illustrative numerical examples are presented.
|
| |
| 14:45-15:00, Paper WeB22.6 | Add to My Program |
| Reachability Analysis for Design Optimization (I) |
|
| Nguyen, Steven | University of California, San Diego |
| Cortes, Jorge | UC San Diego |
| Kramer, Boris | University of California San Diego |
Keywords: Aerospace, Optimization, Emerging control applications
Abstract: We present an approach to approximate reachable sets for linear systems with bounded L-infinity controls in finite time. Our first approach investigates the boundaries of these sets and reveals an exact characterization for single-input, planar systems with real, distinct eigenvalues. The second approach leverages convergence of the Lp-norms to L-infinity and uses Lp-norm reachable sets as an approximation of the L-infinity-norm reachable sets. Our optimal control results yield insights that make computational approximations of the Lp-norm reachable sets more tractable, and yield exact characterizations for L-infinity with the previous assumptions on the system. As an example, we incorporate our reachability analysis into the design optimization of a highly-maneuverable aircraft. Introducing constraints based on reachability allow us to factor physical limitations to desired flight maneuvers into the design process.
|
| |
| WeC03 Tutorial Session, Grand Salon 3 |
Add to My Program |
Energy-Based Dynamical Models for Neurocomputation, Learning, and
Optimization |
|
| |
| Chair: Bullo, Francesco | Univ of California at Santa Barbara |
| Co-Chair: Montanari, Arthur | Northwestern University |
| Organizer: Bullo, Francesco | Univ of California at Santa Barbara |
| Organizer: Krotov, Dmitry | MIT-IBM Watson AI Lab |
| Organizer: Montanari, Arthur | Northwestern University |
| Organizer: Motter, Adilson E. | Northwestern University |
| |
| 15:30-17:00, Paper WeC03.1 | Add to My Program |
| Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization (I) |
|
| Montanari, Arthur N. | Northwestern University |
| Bullo, Francesco | Univ of California at Santa Barbara |
| Krotov, Dmitry | Independent Researcher |
| Motter, Adilson E. | Northwestern University |
Keywords: Learning, Biologically-inspired methods, Neural networks
Abstract: Recent advances at the intersection of control theory, neuroscience, and machine learning have revealed novel mechanisms by which dynamical systems perform computation. These advances encompass a wide range of conceptual, mathematical, and computational ideas, with applications for model learning and training, memory retrieval, data-driven control, and optimization. This tutorial focuses on neuro-inspired approaches to computation that aim to improve scalability, robustness, and energy efficiency across such tasks, bridging the gap between artificial and biological systems. Particular emphasis is placed on energy-based dynamical models that encode information through gradient flows and energy landscapes. We begin by reviewing classical formulations, such as continuous-time Hopfield networks and Boltzmann machines, and then extend the framework to modern developments. These include dense associative memory models for high-capacity storage, oscillator-based networks for large-scale optimization, and proximal-descent dynamics for composite and constrained reconstruction. The tutorial demonstrates how control-theoretic principles can guide the design of next-generation neurocomputing systems, steering the discussion beyond conventional feedforward and backpropagation-based approaches to artificial intelligence.
|
| |
| WeC04 Regular Session, Grand Salon 4 |
Add to My Program |
| Spacecraft Control |
|
| |
| Chair: Taheri, Ehsan | Auburn University |
| Co-Chair: Soderlund, Alexander | The Ohio State University |
| |
| 15:30-15:45, Paper WeC04.1 | Add to My Program |
| Comparison of Gain Matrix Parameterizations for Nonlinear Spacecraft Attitude Control |
|
| Nurre, Nicholas | Auburn University |
| Taheri, Ehsan | Auburn University |
Keywords: Spacecraft control, Control applications, Optimization
Abstract: An important step in designing closed-loop controllers is the tuning of one or more square matrices constrained by definiteness, e.g., penalty or gain matrices. This work considers four different parameterizations of such a matrix: one is a (standard) diagonal matrix and, using spectral decomposition, the other three consider the larger set of "full" ("dense" or "non-diagonal") matrices. Definiteness can be enforced via simple box constraints on the decision variables of each parameterization. We compare the benefits of each parameterization for optimizing the gain matrices of a closed-loop nonlinear spacecraft attitude control law to minimize control effort. Several gradient-free global optimization algorithms are used for tuning and are also compared. Results show that improved performance with respect to control effort can be found by considering the full gain matrix parameterizations. Furthermore, the choice of parameterization for the orthogonal eigenvector matrix, in the full matrix parameterization, can significantly impact the tuning outcome.
|
| |
| 15:45-16:00, Paper WeC04.2 | Add to My Program |
| Efficient Stabilization of Hybrid Coulomb Spacecraft Formations Using Control Lyapunov Functions |
|
| Tahir, Adam | University of Washington |
Keywords: Spacecraft control, Lyapunov methods
Abstract: A control allocation algorithm using control Lyapunov functions to determine stabilizing charges and thrusts of hybrid Coulomb spacecraft formations (HCSFs) is presented. The goal is to stabilize a desired configuration while minimizing the thruster actuation and maximizing Coulomb actuation to minimize propellant usage. A proportion of the decrease of the control Lyapunov function is designated for Coulomb actuation and the rest is performed by thrusters. Simulations show that an 85% reduction of propellant compared to using solely thrusters is attainable using the proposed algorithm. It is shown that the best role for thrusters in a HCSF is to provide small corrections that cannot be provided by Coulomb actuation.
|
| |
| 16:00-16:15, Paper WeC04.3 | Add to My Program |
| Real-Time Retargeting Using Controllability Boundary for Chandrayaan-3 Lunar Landing |
|
| Kumar, Suraj | U R Rao Satellite Center, Indian Space Research Organization |
| Chakrabarti, Debajyoti | University of California, Los Angeles |
| Rallapalli, Aditya | UR Rao Satellite Centre |
| Bharat Kumar, Gvp | U R Rao Satellite Centre , Isro |
| Ashok Kumar, Kakula | U R Rao Satellite Centre(ursc) , Isro |
Keywords: Spacecraft control, Optimization, Machine learning
Abstract: This paper presents the real-time retargeting guidance policy developed for the Chandrayaan-3 lunar landing mission. The baseline guidance generates approximate fuel-optimal descent trajectories, while a high-level policy enables safe retargeting to alternate sites when the nominal site becomes infeasible. The retargeting strategy leverages a convex representation of the controllability boundary, allowing rapid feasibility checks and real-time target updates. To the best of the authors’ knowledge, this represents the first application of a data-driven retargeting framework in an operational lunar landing mission. Pre-flight simulations and Chandrayaan-3 flight results validate the effectiveness of the proposed approach.
|
| |
| 16:15-16:30, Paper WeC04.4 | Add to My Program |
| Nonlinear MPC for Cislunar Trajectory Correction under Impulsive Maneuver Uncertainties: A Computational Performance Analysis |
|
| Yamamoto, Koya | Texas A&M Univeristy |
| Taheri, Ehsan | Auburn University |
| Junkins, John L. | Texas A&M University |
Keywords: Predictive control for nonlinear systems, Optimization, Spacecraft control
Abstract: This work studies nonlinear model predictive control (NMPC) for correcting cislunar transfer trajectories under impulsive maneuver uncertainties. By combining pre-planned impulsive changes with low-thrust NMPC corrections in the CR3BP, the method robustly compensates for execution errors. A maneuver between southern L2 Halo orbits in the Earth-Moon system is considered. A computational comparison of IPOPT and MATLAB's fmincon solvers shows IPOPT is up to 2 times faster. NMPC achieves sub-kilometer tracking errors and a total Delta v tracking cost around 315 m/s, demonstrating an effective and practical guidance for future cislunar missions.
|
| |
| 16:30-16:45, Paper WeC04.5 | Add to My Program |
| Model Predictive Control of Collinear Coulomb Spacecraft Formations |
|
| Tahir, Adam | University of Washington |
Keywords: Spacecraft control, Predictive control for nonlinear systems, Optimal control
Abstract: A model predictive control scheme to stabilize desired configurations of collinear Coulomb spacecraft formations is derived in this paper. The nonlinearities of the dynamics with respect to the input make this problem difficult to solve, computationally. It is shown that the nonlinearities in the input lead to a finite horizon optimization problem which is a nonconvex quadratically-constrained quadratic program (QCQP). A convex relaxation of the nonconvex QCQP is therefore derived which can be solved quickly using a convex optimization solver. A simulation of a four spacecraft formation is provided which demonstrates why optimizing over a prediction horizon is a prudent approach to Coulomb spacecraft formation control.
|
| |
| 16:45-17:00, Paper WeC04.6 | Add to My Program |
| Stochastic Optimal Control with Weighted Terminal Accuracy under Control-Linear Noise |
|
| Amarell, Kristen I. | The University of Colorado Boulder |
| Scheeres, Daniel J. | The University of Colorado Boulder |
| McMahon, Jay W. | The University of Colorado Boulder |
Keywords: Spacecraft control, Stochastic optimal control, Linear systems
Abstract: Space missions involving launch, rendezvous, or orbit insertion often require precise targeting of specific state components, such as position or velocity along preferred directions, while tolerating dispersion in others. This motivates control strategies that differentially penalize deviations across the state. We address this need by minimizing the weighted mean squared deviation from a target state for a stochastic linear system with control-linear noise and noiseless, full-state measurements at specified times. Unlike prior unweighted formulations, this approach enables mission-specific prioritization of state components via reachability theory, promoting feasibility in nonlinear launch-to-rendezvous scenarios. We use an approximation of the controllable set as the terminal weighting matrix, improving rendezvous feasibility and guiding neighboring trajectories about a nonlinear launch profile. A complete solution to this optimization problem is presented, in which the optimal feedback control law is linear in the initial state, the target state, and the sampled measurements. The feedback gain matrices can be computed offline, reducing the onboard computational cost. This paper presents the following innovations to existing methods: the addition of a terminal state weighting matrix to the cost function and the application as a reachability-informed neighboring guidance algorithm about a launch trajectory under nonlinear control.
|
| |
| WeC05 Tutorial Session, Grand Salon 6 |
Add to My Program |
| Decision and Control in Sports Analytics |
|
| |
| Chair: Archibald, Christopher | Brigham Young University |
| Co-Chair: Grimsman, David | Brigham Young University |
| Organizer: Grimsman, David | Brigham Young University |
| Organizer: Melville, William | Texas Rangers |
| Organizer: Archibald, Christopher | Brigham Young University |
| Organizer: Nestler, Scott | University of Florida |
| |
| 15:30-17:00, Paper WeC05.1 | Add to My Program |
| Decision and Control in Sports Analytics: A Tutorial (I) |
|
| Grimsman, David | Brigham Young University |
| Nestler, Scott | University of Florida |
| Archibald, Christopher | Brigham Young University |
| Melville, William | Texas Rangers |
Keywords: Game theory, Markov processes, Emerging control applications
Abstract: This tutorial discusses the intersection between control theory, decision-making, and sports analytics. Its purpose is to motivate researchers to engage with open problems in sports, demonstrate how foundational tools from controls and optimization can be applied to these domains, and provide practical guidance for entering the field. The paper covers: (1) the formulation of sports decision problems as sequential decision-making problems; (2) the evolving landscape of sports analytics data and its implications for research and education; (3) modeling athlete and team decision processes through execution error and uncertainty quantification; and (4) applications of game theory to baseball strategy and managerial decision-making. Together, these sections highlight how well-known methods can be adapted to model, evaluate, and improve strategic behavior in sports. The paper concludes with several open research questions in the area.
|
| |
| WeC06 Regular Session, Grand Salon 7 |
Add to My Program |
| Game Theory III |
|
| |
| Chair: Anderson, Brendon G. | California Polytechnic State University |
| Co-Chair: Dayanikli, Gokce | University of Illinois Urbana-Champaign |
| |
| 15:30-15:45, Paper WeC06.1 | Add to My Program |
| Linear-Quadratic Mean Field Games with Multiple Heterogeneous Major Agents |
|
| Sharifi, Arman | University of Nevada, Reno |
| Xu, Hao | University of Nevada, Reno |
Keywords: Mean field games, Game theory, Optimal control
Abstract: This paper presents a framework for linear-quadratic (LQ) mean-field games with multiple heterogeneous major agents, deriving the coupled Riccati equations that define the Nash equilibrium. A key finding is that system complexity scales polynomially for heterogeneous agents but remains bounded for the special case of homogeneous agents, isolating heterogeneity as the primary driver of computational cost. We also introduce a Complexity-Adjusted Cost (CAC) framework to create a quantitative decision boundary between the tractable MFG equilibrium and the computationally intensive social welfare solution. Simulations validate our complexity analysis and demonstrate the CAC framework's utility for practical system design.
|
| |
| 15:45-16:00, Paper WeC06.2 | Add to My Program |
| Choice Paralysis in Evolutionary Games |
|
| Anderson, Brendon G. | California Polytechnic State University |
Keywords: Game theory, Mean field games
Abstract: In this paper, we consider finite-strategy approximations of infinite-strategy evolutionary games. We prove that such approximations converge to the true dynamics over finite-time intervals, under mild regularity conditions which are satisfied by classical examples, e.g., the replicator dynamics. We identify and formalize novel characteristics in evolutionary games: choice mobility, and its complement choice paralysis. Choice mobility is shown to be a key sufficient condition for the long-time limiting behavior of finite-strategy approximations to coincide with that of the true infinite-strategy game. An illustrative example is constructed to showcase how choice paralysis may lead to the infinite-strategy game getting "stuck," even though every finite approximation converges to equilibrium.
|
| |
| 16:00-16:15, Paper WeC06.3 | Add to My Program |
| Approximately Solving Continuous-Time Mean Field Games with Finite State Spaces |
|
| Eich, Yannick | Technische Universität Darmstadt |
| Fabian, Christian | Technische Universität Darmstadt |
| Cui, Kai | Technische Universität Darmstadt |
| Koeppl, Heinz | Technische Universitat Darmstadt |
Keywords: Mean field games, Game theory
Abstract: Mean field games (MFGs) offer a powerful framework for modeling large-scale multi-agent systems. This paper addresses MFGs formulated in continuous time with discrete state spaces, where agents' dynamics are governed by continuous-time Markov chains - relevant to applications like population dynamics and queueing networks. While prior research has largely focused on theoretical aspects of continuous-time discrete-state MFGs, efficient computational methods for determining equilibria remain underdeveloped. Inspired by discrete-time approaches, we approximate the classical Nash equilibria by regularization methods, enabling more computationally tractable solution algorithms. Specifically, we define regularized equilibria for continuous-time MFGs and extend the classical fixed-point iteration and fictitious play algorithm to these equilibria. We validate the effectiveness and practicality of our approach via illustrative numerical examples.
|
| |
| 16:15-16:30, Paper WeC06.4 | Add to My Program |
| Modeling Global Financial Stability through Policy Design: A Hierarchical Stackelberg Mean Field Game Perspective |
|
| Rathod, Prathmesh | University of Illinois at Urbana-Champaign |
| Dayanikli, Gokce | University of Illinois Urbana-Champaign |
Keywords: Mean field games, Optimal control, Optimization
Abstract: In this paper, we propose a hierarchical game theoretical model of systemic risk in interconnected banking networks. The model consists of three layers: local banks, central banks, and global regulators such as the International Monetary Fund (IMF). Local banks' model is represented through multi-population MFGs, whose Nash equilibria are characterized by coupled forward–backward stochastic differential equations and reduced to Riccati-type ODE systems. At the intermediate layer, we model the central banks that are interacting with local banks, possibly with each other, and also with the IMF through their policy guidelines. We characterize and analyze the multi-population Nash equilibrium between central banks and local banks. At the top level, we model the problem of IMF that acts as a Stackelberg leader interacting with both central banks and the local banks and optimizing global policies. After we present the theoretical results that include equilibrium characterization and existence and uniqueness results, we give a numerical algorithm and analyze the effects of different IMF objectives and also model parameters such as interaction levels between populations and central banks.
|
| |
| 16:30-16:45, Paper WeC06.5 | Add to My Program |
| Convexifying Mean-Field Control: An Occupation-Measure and Frank–Wolfe Approach |
|
| Yu, Di | Purdue University |
| You, Sixiong | Purdue University |
| Pei, Chaoying | Missouri University of Science and Technology |
Keywords: Mean field games, Optimization algorithms, Robotics
Abstract: Abstract— Large-scale robotic swarms motivate the use of mean-field control (MFC). Classical partial differential equation (PDE)-based formulations provide a principled framework but can become computationally challenging in higher dimensions, whereas machine learning achieves scalability at the cost of approximation and guarantees. In this work, we establish an optimization-based framework that lifts the MFC problem into the space of occupation measures, resulting in a convex relaxation formulated as an optimization over measures. The resulting problem is solved using a Frank–Wolfe (FW) algo- rithm in the measure space, with each iteration reduced to a tractable optimal control problem. This approach retains the O(1/k) convergence rate of FW, avoids discretization of the state space, and naturally incorporates interaction and safety constraints. Numerical experiments demonstrate agreement with analytic and PDE-based baselines in two dimensions and show that the method scales to three-dimensional environments with multiple obstacles, where standard grid-based PDE solvers become impractical. A full 3D instance with ten obstacles is solved in minutes on a standard workstation, underscoring the practicality and scalability of the proposed framework.
|
| |
| WeC07 Regular Session, Grand Salon 9 |
Add to My Program |
| Control Applications II |
|
| |
| Chair: Ghassani, Rashad | Schneider Electric |
| Co-Chair: Allen, Brendon C. | Auburn University |
| |
| 15:30-15:45, Paper WeC07.1 | Add to My Program |
| DC-Link Stability and Rectification Current Observer for AC Drives |
|
| Ghousein, Mohammad | ArianeGroup |
| Ghassani, Rashad | Schneider Electric |
Keywords: Electrical machine control
Abstract: The article addresses two problems: 1) modeling and stability of the DC-Link in motor drives and 2) observer design of the DC-Link rectification current. In the first part, we propose an original model of the DC-Link dynamics, including a virtual diode modeling the direction of the rectification current. We also study the impact of the equivalent series resistance (ESR) of the DC-Link capacitor on the stability of the DC-Link voltage. In the second part, we propose a time-varying observer to estimate the rectifier current. The observer is of Luenberger type with time-varying gains. We obtain sufficient conditions for the exponential decay of the estimation error towards a bounded interval. We then validate the theoretical results through numerical simulations.
|
| |
| 15:45-16:00, Paper WeC07.2 | Add to My Program |
| A Comparison of Triangulation and Multilateration Localization Techniques for Crane Cable Detection |
|
| Harkonen, Eemil | Georgia Institute of Technology |
| Zonnenberg, Alexander | Georgia Institute of Technology |
| Graham, Monica | Georgia Institute of Technology |
| Adams, Christopher | Georgia Institute of Technology |
| Singhose, William | Georgia Institute of Technology |
Keywords: Embedded systems, Mechatronics, Mechanical systems/robotics
Abstract: Crane payload liftoff operations require precise alignment of a crane's overhead trolley with the payload being lifted to ensure the hoisting cable is vertical. In cases where alignment is not achieved, an off-centered lift occurs, resulting in uncontrolled swinging of the payload. This has the potential to damage the crane's cable and/or payload, the crane itself, or cause serious injury or even the death of a nearby worker. This paper introduces a LiDAR-based small-scale localization solution for cable angle sensing. Cable angle sensing solutions exist, but they are expensive and are designed to work in well controlled environments. This paper discusses the design of an inexpensive cable angle sensing solution that can be deployed in varying environments. The efficacy of the employed multilateration and triangulation localization techniques are shown to achieve a maximum position error of 12mm.
|
| |
| 16:00-16:15, Paper WeC07.3 | Add to My Program |
| Feedback-Linearization Control for Psychrometric Testing Chambers |
|
| Kakani, Raghav | Purdue University |
| Shaikh, Adil Manzoor | Purdue University |
| Rhee, Seungho | Purdue University |
| Kim, Dohyeon | Purdue University |
| Kircher, Kevin | Purdue University |
Keywords: Feedback linearization, Building and facility automation, PID control
Abstract: Psychrometric test facilities – insulated chambers with air-handling units that provide heating, cooling, humidification, and dehumidification – are used to evaluate the performance of heating and cooling equipment. Accurate emulation of indoor and outdoor environments during equipment testing relies on effective psychrometric control. However, the psychrometric system is nonlinear, time-varying, and exhibits strong input–output coupling, limiting the effectiveness of conventional controllers across wide operating ranges. To address these challenges, this study proposes a MIMO nonlinear control strategy using feedback linearization. A simplified physics-based state-space model is employed to derive the feedback-linearization control law, which decouples interactions and compensates for nonlinearities. The control strategy is evaluated using a high-fidelity model calibrated to experimental data from a physical psychrometric chamber. In temperature and relative humidity control simulations, the feedback-linearization controller maintains overshoot within the specified constraint and achieves faster settling times than a well-tuned baseline controller, reducing the duration of a standard dynamic test by 80.2 minutes (22.3%).
|
| |
| 16:15-16:30, Paper WeC07.4 | Add to My Program |
| Saturated ARISE Control for Uncertain Lower Limb Hybrid-Exoskeleton Dynamics |
|
| Crapet, Joseph | Auburn University |
| Basyal, Sujata | Auburn University |
| Mishra, Kislaya | Auburn University |
| Ting, Jonathan | Auburn University |
| Allen, Brendon C. | Auburn University |
Keywords: Nonlinear output feedback, Robust adaptive control, Biotechnology
Abstract: Abstract--- Millions of Americans are affected by neurological conditions (NCs), which can be caused by strokes, Parkinson's disease, and more. NCs can negatively impact individuals by decreasing their muscle mass, motor control, and bone density, often as a result of physical inactivity. This deterioration leads to secondary health conditions such as obesity, cardiovascular disease, among others. There is an ongoing need to develop methods that help those afflicted by NCs to effectively recover. One rehabilitation method is to use a hybrid-exoskeleton, which combines the benefits of robot-assisted therapy with functional electrical stimulation (FES). Due to this combination, hybrid-exoskeletons use motors to assist or resist the motion of the user, ensuring proper performance of the rehabilitative exercises, while also implementing FES to ensure active participation from the user during the exercise, enhancing the overall rehabilitative effectiveness of the device. Although this rehabilitation approach has many benefits, the dynamics of a hybrid-exoskeleton are nonlinear and uncertain in nature. As a result, an ineffective controller design will generate a shaky, uncomfortable, or unstable motion, yielding ineffective rehabilitation. To effectively control a hybrid-exoskeleton for rehabilitation, this paper presents the development of a robust control method that is tailored for this unified and complex rehabilitation system. Specifically, a saturated auxiliary robust integral of the sliding mode (ARISE) control law is developed for the first time for a hybrid-exoskeleton. To validate the performance of the designed control law, a Lyapunov-like stability analysis and a preliminary simulation are performed.
|
| |
| 16:30-16:45, Paper WeC07.5 | Add to My Program |
| Integrating Target-Distance Feedback with Bayesian Optimization for Cost-Effective Process Optimization |
|
| Singhal, Bharat | Tokyo Electron America, INC |
| Matsumoto, Masashi | Tokyo Electron America, INC |
| Shinagawa, Jun | Tokyo Electron America, INC |
Keywords: Manufacturing systems, Materials processing, Optimization
Abstract: Abstract—In semiconductor manufacturing, as device sizes continue to shrink, each process step must meet increasingly stringent requirements that are challenging to achieve through traditional trial-and-error approaches. This has resulted in a shift toward machine learning methods, such as Bayesian optimization, which has shown promising results in reducing the number of experiments needed to meet process specifications in critical manufacturing steps such as etching, deposition, and lithography. Nevertheless, the performance of Bayesian optimization is sensitive to design parameters—such as acquisition functions and their settings—and suboptimal choices can significantly degrade its effectiveness. To address this, in this paper, we incorporate the distance between current performance and target specifications as feedback to dynamically adjust acquisition function parameters, enabling a strategy that emphasizes exploration initially and transitions to exploitation automatically as current performance approaches the target. Our approach, dynamic Bayesian optimization, reduces the number of experiments required to meet the target specification by balancing exploration and exploitation. We first demonstrate the effectiveness of our methodology on standard benchmark functions commonly used in Bayesian optimization. Then, we develop virtual metrology models to accurately predict the etched amounts of SiO2 and Si3N4 layers and employ these models to simultaneously optimize SiO2 etched amount and its selectivity to Si3N4 for the gaseous-phase etching process using our approach.
|
| |
| 16:45-17:00, Paper WeC07.6 | Add to My Program |
| Adaptive DHFD-DSMC Based Control for Five-Phase PMSM |
|
| Igbokwe, Daniel | Ampère/Renault Group, Centrale Nantes |
| Ghanes, Malek | Centrale Nantes |
| Bodson, Marc | Univ. of Utah |
| Hamida, Mohamed Assaad | LS2N, Ecole Centrale De Nantes |
| Messali, Amir | Ampère/Renault Group |
Keywords: Control applications, Electrical machine control
Abstract: Multiphase machines, which employ more than three phases for electrical energy conversion, have attracted renewed interest due to their enhanced efficiency, fault tolerance, and power density compared to conventional three-phase systems—qualities. These advantages make them well-suited for electric vehicles, renewable energy systems, aerospace, and high-power industrial drives. This paper presents practical refinements to existing control strategies for multiphase motor drives. Specifically, we propose a simplified adaptive discrete hybrid filtering differentiator based on a cube-root approximation of the established hybrid bi-homogeneous differentiator, building on prior work by Levant, Livne, and Jbara. Additionally, we introduce a computationally efficient reaching law that approximates the behavior of the exponential reaching law while significantly reducing online calculation burden. The modified differentiator is integrated with the discrete sliding-mode controller to replace conventional filtered differentiators typically paired with linear controllers in Electric Vehicle (EV) applications, thereby improving robustness without increasing computational load. The combined approach is designed for realtime implementation on resource-constrained embedded processors commonly used in EV drive systems.
|
| |
| WeC08 Invited Session, Grand Salon 10-13 |
Add to My Program |
| Multi-Agent Control and Coordination II |
|
| |
| Chair: Weintraub, Isaac | Air Force Research Laboratory |
| Co-Chair: Sinha, Abhinav | The University of Cincinnati |
| |
| 15:30-15:45, Paper WeC08.1 | Add to My Program |
| Strategic Concealment of Environment Representations in Competitive Games (I) |
|
| Guan, Yue | Georgia Institute of Technology |
| Maity, Dipankar | University of North Carolina at Charlotte |
| Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Game theory, Agents-based systems, Autonomous systems
Abstract: This letter studies the strategic concealment of environment representations used by players in competitive games. We consider a defense scenario in which one player (the Defender) seeks to infer and exploit the representation used by the other player (the Attacker). The interaction between the two players is modeled as a Bayesian game: the Defender infers the Attacker’s representation from its trajectory and places barriers to counteract, while the Attacker obfuscates its representation type to mislead the Defender. We solve for the Perfect Bayesian Nash Equilibrium via a bilinear program that integrates Bayesian inference, strategic planning, and belief manipulation. Simulations show that purposeful concealment naturally emerges: the Attacker randomizes its trajectory to manipulate the Defender’s belief, inducing suboptimal barrier selections and thereby gaining a strategic advantage.
|
| |
| 15:45-16:00, Paper WeC08.2 | Add to My Program |
| Reach-Avoid Games in a Grid World (I) |
|
| Berneburg, James | George Mason University |
| Dorothy, Michael | US Army Research Laboratory |
| Shishika, Daigo | George Mason University |
Keywords: Game theory
Abstract: Reach-avoid games model scenarios where an attacker seeks to reach a target region that a defender aims to protect. The challenge compared to pursuit-evasion games is that a defender cannot simply capture an attacker, but is concerned with what points the attacker can reach before it is captured. We consider for the first time, to the best of our knowledge, a reach-avoid game played in a graph environment with simultaneous actions. We solve the game of kind played on an obstacle-free 2-dimensional grid world between a single Attacker and a single Defender. We find Attacker has a winning strategy when it can reach any point in the target before Defender, and Defender has a winning strategy in the complementary case, provided Defender's tagging radius is positive, despite players having equal speeds so Defender cannot guarantee capture. We additionally provide minor results and discussion to connect this to more general problems, such as with multiple defenders and attackers.
|
| |
| 16:00-16:15, Paper WeC08.3 | Add to My Program |
| Safe Navigation in the Presence of Range-Limited Pursuers (I) |
|
| Chapman, Thomas | Air Force Research Laboratory |
| Von Moll, Alexander | Air Force Research Laboratory |
| Weintraub, Isaac | Air Force Research Laboratory |
Keywords: Autonomous systems, Aerospace, Intelligent systems
Abstract: This paper examines the degree to which an evader seeking a safe and efficient path to a target location can benefit from increasing levels of knowledge regarding one or more range-limited pursuers seeking to intercept it. Unlike previous work, this research considers the time of flight of the pursuers actively attempting interception. It is shown that additional knowledge allows the evader to safely steer closer to the threats, shortening paths without accepting additional risk of capture. A control heuristic is presented, suitable for real-time implementation, which capitalizes on all knowledge available to the evader.
|
| |
| 16:15-16:30, Paper WeC08.4 | Add to My Program |
| Target Defense Using a Turret and Mobile Defender Team (I) |
|
| Von Moll, Alexander | Air Force Research Laboratory |
| Maity, Dipankar | University of North Carolina at Charlotte |
| Pachter, Meir | AFIT/ENG |
| Shishika, Daigo | George Mason University |
| Dorothy, Michael | US Army Research Laboratory |
Keywords: Game theory, Cooperative control, Autonomous vehicles
Abstract: A scenario is considered wherein a stationary, turn constrained agent (Turret) and a mobile agent (Defender) cooperate to protect the former from an adversarial mobile agent (Attacker). The Attacker wishes to reach the Turret prior to getting captured by either the Defender or Turret, if possible. Meanwhile, the Defender and Turret seek to capture the Attacker as far from the Turret as possible. This scenario is formulated as a differential game and solved using a geometric approach. Necessary and sufficient conditions for the Turret-Defender team winning and the Attacker winning are given. In the case of the Turret-Defender team winning equilibrium strategies for the min max terminal distance of the Attacker to the Turret are given. Three cases arise corresponding to solo capture by the Defender, solo capture by the Turret, and capture simultaneously by both Turret and Defender.
|
| |
| 16:30-16:45, Paper WeC08.5 | Add to My Program |
| Heterogeneous Pursuit of an Active Target under Sensing Constraints (I) |
|
| Surve, Prajakta | Michigan State University |
| Bopardikar, Shaunak D. | Michigan State University |
| Von Moll, Alexander | Air Force Research Laboratory |
| Weintraub, Isaac | Air Force Research Laboratory |
| Casbeer, David W. | Air Force Research Laboratory |
Keywords: Agents-based systems, Autonomous systems, Game theory
Abstract: This paper studies a heterogeneous three-agent pursuit-evasion scenario in which a sensor–attacker team attempts to capture an active target capable of changing its heading at fixed time intervals. The sensor has a limited sensing range, and the attacker must intercept the target before it escapes sensing. We formulate this problem as a emph{game of kind} and extend the optimal sensor and attacker strategies from prior work on passive targets to the active target setting. The sensor updates its heading in each interval by assuming that the target will keep its heading fixed for the rest of the engagement, while the attacker uses an Apollonius circle-based approach for minimum-time interception and updating its heading corresponding to the target heading in every interval. We show that the conditions for capture or escape of a passive target also extend to the case of an active target. In particular, if the speed of the active target is less than a critical value identified for passive targets in our prior work, then capture is guaranteed.
|
| |
| 16:45-17:00, Paper WeC08.6 | Add to My Program |
| Decentralized CBF-Based Safety Filters for Collision Avoidance of Cooperative Missile Systems with Input Constraints |
|
| Autenrieb, Johannes | German Aerospace Center (DLR) |
| Spiller, Mark | German Aerospace Center (DLR) |
Keywords: Aerospace, Agents-based systems, Flight control
Abstract: This paper presents a decentralized safety filter for collision avoidance in multi-agent aerospace interception scenarios. The approach leverages robust control barrier functions (RCBFs) to guarantee forward invariance of safe sets under bounded inputs and high-relative-degree dynamics. Each effector executes its nominal cooperative guidance command, while a local quadratic program (QP) modifies the input only when necessary. Event-triggered activation based on range and zero-effort miss (ZEM) criteria ensures scalability by restricting active constraints to relevant neighbors. To ensure feasibility under multiple simultaneously active constraints, a slack-variable relaxation scheme is introduced that prioritizes critical agents in a Pareto-optimal manner. Simulation results in many-on-many interception scenarios demonstrate that the proposed framework maintains collision-free operation with minimal deviation from nominal guidance, providing a computationally efficient and scalable solution for safety-critical multi-agent aerospace systems.
|
| |
| WeC09 Invited Session, Grand Salon 12 |
Add to My Program |
| Dynamics and Behavior of Energy Storage Systems |
|
| |
| Chair: Filgueira da Silva, Samuel | The Ohio State University |
| Co-Chair: Moura, Scott | University of California, Berkeley |
| Organizer: Docimo, Donald | Texas Tech University |
| Organizer: Soudbakhsh, Damoon | Temple University |
| Organizer: Zhang, Dong | University of Oklahoma |
| Organizer: Song, Ziyou | University of Michigan, Ann Arbor |
| Organizer: Araujo Xavier, Marcelo | Amazon Leo |
| Organizer: Moura, Scott | University of California, Berkeley |
| Organizer: Lin, Xinfan | University of California, Davis |
| Organizer: Cui, Xiaofan | University of California, Los Angeles |
| Organizer: Filgueira da Silva, Samuel | The Ohio State University |
| Organizer: Tang, Shuxia | Texas Tech University |
| Organizer: Dey, Satadru | The Pennsylvania State University |
| |
| 15:30-15:45, Paper WeC09.1 | Add to My Program |
| Design and Modeling of a Heat-Exchange Sleeve for Enhanced Thermal Safety of Lithium-Ion Batteries (I) |
|
| Ferreira, Patryck | Texas Tech University |
| Tang, Shuxia | Texas Tech University |
Keywords: Modeling, Energy systems, Simulation
Abstract: This paper presents the design and modeling of a phase change material (PCM) sleeve enclosed in a Polyethylene Terephthalate Glycol (PETG) case for the thermal management of lithium-ion batteries. The model introduces five states: PETG inner wall, PCM solid, PCM liquid, phase fraction, and PETG outer surface, ensuring accurate representation of conduction, latent heat absorption, and convective heat loss. The formulation is validated experimentally using a 26650 Li-ion cell cycled at 11 A under controlled laboratory conditions. Results show satisfactory agreement between model predictions and experimental measurements, with root-mean-square errors of 0.56 ◦ C at the PETG inner wall and 1.44 ◦ C in the PCM domain. The PETG+PCM sleeve reduces case temperature rise by approximately 5 ◦ C and attenuates oscillation amplitude, indicating the buffering role of PCM.
|
| |
| 15:45-16:00, Paper WeC09.2 | Add to My Program |
| Unbalancing Second-Life Battery Pack with Bidirectional DC/DC Converter for Extending Life (I) |
|
| Xu, Zhicheng | University of Oklahoma |
| Kajiura, Yuichi | University of Oklahoma |
| Choo, Wonoo | University of Oklahoma |
| Espin, Jorge | University of Oklahoma |
| Zhang, Dong | University of Oklahoma |
Keywords: Energy systems, Power electronics, Modeling
Abstract: This paper proposes a novel battery pack state of charge (SoC) unbalancing strategy to extend the lifespan of series connected heterogeneous second-life batteries with parallel connected, isolated bidirectional boost converters. By utilizing a balancing bus via converters, the system is capable of regulating the current of individual battery. This feature deliberately unbalances the SoC of each battery module based on its estimated state of health (SoH) and an ampere-hour throughput degradation model. This strategy ensures that all battery modules reach the end of life (EoL) simultaneously, thereby maximizing the overall lifetime of the pack system. Simulation results demonstrate that the proposed approach can significantly increase the cycle life by approximately 50%, especially in scenarios with substantial SoH heterogeneity, a common condition for second-life battery applications.
|
| |
| 16:00-16:15, Paper WeC09.3 | Add to My Program |
| Conformalized Transfer Learning for Li-Ion Battery State of Health Forecasting under Manufacturing and Usage Variability (I) |
|
| Filgueira da Silva, Samuel | The Ohio State University |
| Ozkan, Mehmet | The Ohio State University |
| El Idrissi, Faissal | The Ohio State University |
| Canova, Marcello | The Ohio State University |
Keywords: Energy systems, Estimation, Machine learning
Abstract: Accurate forecasting of state-of-health (SOH) is essential for ensuring safe and reliable operation of lithium-ion cells. However, existing models calibrated on laboratory tests at specific conditions often fail to generalize to new cells that differ due to small manufacturing variations or operate under different conditions. To address this challenge, an uncertainty-aware transfer learning framework is proposed, combining a Long Short-Term Memory (LSTM) model with domain adaptation via Maximum Mean Discrepancy (MMD) and uncertainty quantification through Conformal Prediction (CP). The LSTM model is trained on a virtual battery dataset designed to capture real-world variability in electrode manufacturing and operating conditions. MMD aligns latent feature distributions between simulated and target domains to mitigate domain shift, while CP provides calibrated, distribution-free prediction intervals. This framework improves both the generalization and trustworthiness of SOH forecasts across heterogeneous cells.
|
| |
| 16:15-16:30, Paper WeC09.4 | Add to My Program |
| Analysis of the Single Particle Model with Electrolyte for Lithium Iron Phosphate Batteries (I) |
|
| Lee, Jaewoong | University of California, Berkeley |
| Jiang, Shida | Univeristy of California, Berkeley |
| Tao, Shengyu | Chalmers University of Technology |
| Zeng, Wente | TotalEnergies S.E |
| Lamare, Pierre-Olivier | SAFT |
| Bertin, Clement | SAFT |
| Monier-Reyes, Daniel | SAFT |
| Benjamin, Sebastien | Saft S.A |
| Moura, Scott | University of California, Berkeley |
Keywords: Energy systems, Modeling
Abstract: This paper examines the challenges in accurately modeling lithium iron phosphate (LFP) batteries using the single particle model with electrolyte (SPMe). The SPMe offers a practical balance between the high-fidelity Doyle-Fuller-Newman (DFN) model and computationally efficient equivalent circuit models (ECM) for battery management systems (BMS). While the SPMe has demonstrated success across many Li-ion chemistries, it encounters structural limitations when applied to LFP batteries. These limitations arise from the unique two-phase kinetics of LFP cathodes, which result in a flat cathode open-circuit potential (OCP). The two-phase transformation mechanisms in the LFP cathodes are not explicitly captured by the single-phase diffusion assumption of the SPMe, and the flat cathode OCP can degrade voltage prediction accuracy. We quantify these challenges and identify root causes through voltage component and parameter sensitivity analyses. Based on these findings, we highlight the need for additional voltage correction terms in the SPMe voltage equation and propose an empirical correction approach for LFP batteries.
|
| |
| 16:30-16:45, Paper WeC09.5 | Add to My Program |
| System Identification of Lithium-Ion Battery Equivalent Circuit Models Using Ensemble Kalman Inversion (I) |
|
| Barat, Farzaneh | University of Kansas |
| Wilson, Sara | University of Kansas |
| Kim, Huijeong | University of Kansas |
| Fang, Huazhen | Michigan State University |
Keywords: Identification, Energy systems, Optimization
Abstract: System identification remains an intriguing challenge for lithium-ion batteries, as many models are nonlinear, exhibit multi-physics coupling, and involve a large number of parameters. In this paper, we address this challenge using the ensemble Kalman inversion (EnKI) method for battery system identification. EnKI performs maximum a posteriori parameter estimation through successive local Gaussian approximations, enabling an iterative and incremental search for unknown parameters. The search combines Monte Carlo sampling with Kalman-type updates to evolve an ensemble of samples, thereby offering empirical stability and the ability to handle strongly nonlinear models. We validate the proposed approach on two equivalent circuit models with coupled electro-thermal dynamics, through both simulation and experiments. The results demonstrate that the proposed approach achieves accurate parameter estimation with rapid iterative convergence, and it shows strong potential for application to other battery models.
|
| |
| 16:45-17:00, Paper WeC09.6 | Add to My Program |
| Concurrent Optimization of Lithium-Ion Battery Charging and Balancing Via Safe and Data-Efficient Deep Reinforcement Learning |
|
| Hailemichael, Habtamu | Clemson University |
| Ayalew, Beshah | Clemson University |
| Figueroa-Santos, Miriam | Ground Vehicle System Center |
| Barron, Morgan | Ground Vehicle System Center |
Keywords: Modeling, Reinforcement learning, Energy systems
Abstract: This paper presents a novel deep reinforcement learning (DRL)-based approach for the joint optimization of charging and balancing of lithium-ion battery packs. Traditional methods often rely on conservative, rule-based strategies that limit efficiency and adaptability, or on model-based approaches that are computationally intensive for real-time implementation and degrade in optimality as the battery ages. In contrast, our framework employs DRL to develop adaptive, optimal control policies that dynamically respond to the battery’s changing state. DRL training is conducted through interaction with continually learned, high-fidelity hybrid virtual environment that combines physics-based battery models with Transformer-based neural residuals. This continually updated environment enables safe, efficient, and adaptive learning that reflects the battery’s evolving behavior over time. Compared to conventional constant current–constant voltage (CCCV) charging with rule-based balancing, our method achieves up to 2× faster charging and maintains SOC imbalance below 0.25%, demonstrating substantial improvements in performance and safety.
|
| |
| WeC10 Regular Session, Grand Salon 15 |
Add to My Program |
| Linear Matrix Inequalities |
|
| |
| Chair: Seiler, Peter | University of Michigan, Ann Arbor |
| Co-Chair: Peet, Yulia | Arizona State University |
| |
| 15:30-15:45, Paper WeC10.1 | Add to My Program |
| Replay Attack Detection with Chaotic Masking in Cyber-Physical Systems |
|
| Lu, Hongnian | Zhejiang University |
| Liu, Zhitao | Zhejiang University |
| Chen, Tao | Zhejiang University |
| Su, Hongye | Zhejiang Univ |
Keywords: Chaotic systems, Neural networks, LMIs
Abstract: Replay attacks are a common threat in cyber-physical systems, potentially causing damage to the physical plant while avoiding detection by replaying historical sensor measurements. In this paper, inspired by the fact that systems with periodic steady states, e.g., constant or sinusoid, are susceptible to the replay attacks, we propose a chaotic masking scheme to break this periodicity, thereby facilitating replay attack detection. Specifically, chaotic signals are superimposed on the sensor measurements, intentionally rendering the transmitted data non-periodic. To de-mask the transmitted data, an extended observer is designed, which estimates the states of both the physical plant and the chaotic system. Finally, the effectiveness of the proposed scheme for replay attack detection is verified by simulation.
|
| |
| 15:45-16:00, Paper WeC10.2 | Add to My Program |
| Hinf Analysis and Synthesis for Vector-Valued Edge Consensus Networks |
|
| Mihaly, Vlad Mihai | Technical University of Cluj-Napoca |
| Susca, Mircea | Technical University of Cluj-Napoca |
| Abou Jaoude, Dany | American University of Beirut |
Keywords: Network analysis and control, LMIs, Robust control
Abstract: This paper focuses on networks of agents described using vector-valued and time-scaled multi-rate integrators interconnected through matrix-weighted edges. Both the agents/nodes and their edges/interconnections are affected by zero-mean Gaussian noise. The finite-energy noise impact on network consensus is measured using the Hinf-norm of the resulting edge representation of the dynamical system. For this setup, we provide three technical innovations. First, we propose a method to convexify the computation of the aforementioned norm in terms of the weights and time scales of the graph. Second, we provide an iterative convex synthesis method to minimize the said norm based on the tunable weights and time scales. Third, we cover a general output performance index based on arbitrary linear configurations of the system's states, whereas previous literature only considers the spanning tree and co-tree edge states for the network output. A numerical example illustrates the proposed novel algorithm.
|
| |
| 16:00-16:15, Paper WeC10.3 | Add to My Program |
| A PIE Representation of Cylindrical PDEs and Stability Analysis Using LPIs |
|
| Purra, Varshitha | Arizona State University |
| Peet, Yulia | Arizona State University |
Keywords: Linear systems, Distributed parameter systems, LMIs
Abstract: This paper extends the Partial Integral Equation (PIE) framework to cylindrical PDEs. Singular radially-dependent terms, such as 1/r and 1/r2, preclude a direct application of a PDE-to-PIE reformulation, developed for Cartesian coordinates, and the subsequent PIE-based stability analysis, which requires polynomial coefficients. We develop two exact reformulation strategies, (i) a variable substitution s = r^2 and (ii) a weighted formulation multiplying the PDE by an appropriate power of r, and prove the equivalence between the developed reformulations and the original PDE. Using these strategies, we construct PIE representations of a canonical axisymmetric reaction-diffusion equation. We then use Linear Partial Inequalities (LPIs) in PIETOOLS 2024 to test asymptotic stability and compare the results with the analytically-derived stability bounds. Finally, we apply two reformulation strategies to a tokamak flux-gradient PDE model and show that the weighted formulation eliminates all fractional powers and results in an admissible PIE representation, whereas the substitution method fails to do so for this more complicated PDE. This work establishes a foundation for Lyapunov-based stability analysis of cylindrical infinite-dimensional systems.
|
| |
| 16:15-16:30, Paper WeC10.4 | Add to My Program |
| Echo State Network Controller Design for a Class of Leaky-Integrator Nonlinear Systems |
|
| Deng, Hao | University Paris-Saclay |
| Stoica, Cristina | CentraleSupélec/L2S, Univ. Paris-Saclay |
| Ossmann, Daniel | Munich University of Applied Sciences HM |
| Chadli, M. | University of Paris-Saclay - UEVE |
Keywords: LMIs
Abstract: This paper proposes a novel sufficient condition for the leaky-integrator echo state network controller design based on Incremental Input-to-State Stability criterion for discrete-time systems. The controller design conditions are derived via Linear Matrix Inequalities. A novel leaky-integrator echo state network-based controller structure is presented, demonstrating enhanced system performance. The simulation results highlight the performance improvements achieved by adopting the proposed controller structure.
|
| |
| 16:30-16:45, Paper WeC10.5 | Add to My Program |
| Robust Data-Driven Control for Nonlinear Systems Using Their Digital Twins and Quadratic Funnels |
|
| Shakeri, Shiva | University of Washington |
| Mesbahi, Mehran | University of Washington |
Keywords: Lyapunov methods, Iterative learning control, LMIs
Abstract: This paper examines a robust data-driven approach for the safe deployment of systems with nonlinear dynamics using their imperfect digital twins. Our contribution involves proposing a method that fuses the digital twin's nominal trajectory with online, data-driven uncertainty quantification to synthesize robust tracking controllers. Specifically, we derive data-driven bounds to capture the deviations of the actual system from its prescribed nominal trajectory informed via its digital twin. Subsequently, the dataset is used in the synthesis of quadratic funnels—robust positive invariant tubes around the nominal trajectory—via linear matrix inequalities built on the time-series data. The resulting controller guarantees constraint satisfaction while adapting to the true system behavior through a segmented learning strategy, where each segment's controller is synthesized using uncertainty information from the previous segment. This work establishes a systematic framework for obtaining safety certificates in learning-based control of nonlinear systems with imperfect models.
|
| |
| 16:45-17:00, Paper WeC10.6 | Add to My Program |
| Discrete-Time Stability Analysis of ReLU Feedback Systems Via Integral Quadratic Constraints |
|
| Vahedi Noori, Sahel | University of Michigan |
| Hu, Bin | University of Illinois at Urbana-Champaign |
| Dullerud, Geir E. | University of Minnesota |
| Seiler, Peter | University of Michigan, Ann Arbor |
Keywords: Robust control, LMIs, Neural networks
Abstract: This paper analyzes internal stability of a discrete-time feedback system with a ReLU nonlinearity. This feedback system is motivated by recurrent neural networks. We first review existing static quadratic constraints (QCs) for slope-restricted nonlinearities. Next, we derive hard integral quadratic constraints (IQCs) for scalar ReLU by using finite impulse filters and structured matrices. These IQCs are combined with a dissipation inequality leading to an LMI condition that certifies internal stability. We show that our new dynamic IQCs for ReLU are a superset of the well-known Zames-Falb IQCs specified for slope-restricted nonlinearities. Numerical results show that the proposed hard IQCs give less conservative stability margins than Zames-Falb multipliers and prior static QC methods, sometimes dramatically so.
|
| |
| WeC11 Invited Session, Grand Salon 16 |
Add to My Program |
| Mechatronics |
|
| |
| Chair: Xia, Fangzhou | The University of Texas at Austin |
| Co-Chair: Vikas, Vishesh | University of Alabama |
| Organizer: Barton, Kira | University of Michigan, Ann Arbor |
| Organizer: Su, Hao | North Carolina State University |
| Organizer: Mazumdar, Yi | Georgia Institute of Technology |
| Organizer: Vikas, Vishesh | University of Alabama |
| Organizer: Xia, Fangzhou | The University of Texas at Austin |
| Organizer: Zhang, Jun | University of Nevada Reno |
| Organizer: He, Binghan | The University of Texas at San Antonio |
| Organizer: Zhang, Qiang | The University of Alabama |
| Organizer: Han, Feng | New York Institute of Technology |
| Organizer: Zuo, Shan | University of Connecticut |
| |
| 15:30-15:45, Paper WeC11.1 | Add to My Program |
| Magnetic Position Estimation in Ferrous Actuators under Hysteresis: A Switched-Gain Non-Linear Observer Approach (I) |
|
| Daroudi, Sajjad | University of Minnesota |
| Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Mechatronics, Control applications, Nonlinear systems identification
Abstract: Estimating actuator position in construction and earthmoving vehicles is critical for enabling precise control, operator assistance, and automation of repetitive tasks. Conventional approaches based on string potentiometer or LVDTs can be costly, require maintenance, and lack robustness in harsh environments. This paper develops a magnetic-field-based position estimation method that uses a low-cost anisotropic magnetoresistive (AMR) sensor mounted on the actuator. A strong magnet attached to the piston provides magnetic field variations measured in longitudinal and vertical directions. A major challenge is that movement of the magnet magnetizes and demagnetizes the actuator’s body causing strong hysteresis in the magnetic field–position relationship, along with non-monotonic sensor characteristics. To address this, a Preisach model is identified from both major and minor loop data, and a polynomial function is used to represent the hysteresis-free baseline. The hysteresis model is then applied in real time to estimate and subtract the hysteresis contribution from the measurements, yielding corrected signals. A switched-gain nonlinear observer is subsequently designed to account for the non-monotonicity of the measurement functions. Experimental results on a hydraulic actuator show that the proposed approach provides accurate position tracking: the observer converges rapidly from unknown initial conditions and maintains estimation errors within 6 mm, with a root-mean-square error (RMSE) of about 1.9 mm.
|
| |
| 15:45-16:00, Paper WeC11.2 | Add to My Program |
| Sample Tilt Control for Atomic Force Microscopes with Multiple Parallel Active Cantilevers (I) |
|
| Lin, Tingyi | University of Texas, Austin |
| Hao, Yizhou | University of Texas at Austin |
| Li, Wenyue | University of Michigan |
| Sung, Joshua | The University of Texas at Austin |
| Xia, Fangzhou | The University of Texas at Austin |
Keywords: Mechatronics, MEMs and Nano systems, Control applications
Abstract: Parallel operation of multiple atomic force microscope (AFM) probes has emerged as a promising approach for high-throughput nanoscale metrology, particularly in semiconductor inspection where rapid and reliable surface characterization is critical. However, when two or more cantilevers operate simultaneously, unavoidable tilt misalignment between the probes and the sample surface can cause asynchronous engagement, unstable imaging, and measurement errors. To address this challenge, this paper presents the design and implementation of a dual-probe AFM (DP-AFM) integrated with an active tilt compensation control strategy. The system employs a hierarchical dual closed-loop framework, where coarse alignment is achieved using a goniometer motor and fine compensation is realized through a z-axis piezoelectric positioner in coordination with a custom tilt stage. Engagement thresholds derived from probe amplitude signals, measured via lock-in amplifiers, serve as feedback inputs to ensure stable and simultaneous probe--sample contact. A proportional-integral (PI) controller regulates the oscillation amplitude during scanning, thereby mitigating disturbances from tilt, environmental noise, and stage dynamics. Experimental results demonstrate that the proposed control scheme reduces engagement error, improves stability, and enables parallel scanning with dual probes. These results highlight the potential of control-based strategies to advance AFM toward scalable multi-probe configurations applicable to semiconductor metrology, biomedical research, and other nanoscale applications.
|
| |
| 16:00-16:15, Paper WeC11.3 | Add to My Program |
| Machine Learning-Based Stability Lobe Diagram Estimation and Chatter Suppression Control in Milling Process (I) |
|
| Huang, Yi | Rutgers University |
| Han, Feng | New York Institute of Technology |
| Liu, Wenyi | Rutgers University |
| Yi, Jingang | Rutgers University |
| Guo, Yuebin | Rutgers University |
Keywords: Mechatronics, Machine learning, Manufacturing systems
Abstract: Chatter is a self-excited vibration in milling that degrades surface quality and accelerates tool wear. This paper presents an adaptive process controller that suppresses chatter by leveraging machine learning-based online estimation of the Stability Lobe Diagram (SLD) in realtime. Stability analysis is conducted using the semi-discretization method for milling dynamics modeled by delay differential equations. An integrated machine learning framework estimates the SLD from sensor data and predicts surface roughness for chatter detection in real time. These estimates are integrated into an optimal controller that adaptively adjusts spindle speed to maintain process stability and improve smoothness of surface finish. Simulations and experiments are performed to demonstrate the superior performance compared to the existing approaches.
|
| |
| 16:15-16:30, Paper WeC11.4 | Add to My Program |
| Frequency-Domain Fault Analysis and Diagnostic-Enhanced Design of Hydraulic Systems (I) |
|
| Chen, Weichen | Oakland University |
| Yoon, Yongsoon | Oakland University |
Keywords: Fault diagnosis, Fluid power control, Mechatronics
Abstract: This paper presents a frequency-domain framework for fault analysis and its application to diagnostic-enhanced design of hydraulic systems. To this end, physics-based inverse generalized frequency response functions are first derived. On this basis, fault sensitivity functions are defined as a scaled gradient vector, enabling a quantitative measure of fault effects in the frequency domain. Fault sensitivity analysis is conducted across light-, medium-, and heavy-duty applications. The analysis reveals that diagnostic performance depends strongly on nominal parameters, and faults can be categorized into two groups according to their effects on sensitivity trajectories in the complex plane: energy-transmission faults and dissipative faults. Building upon these insights, a diagnostic-enhanced design methodology is proposed and applied to a medium-duty hydraulic system, originally exhibiting poor fault isolation. The redesigned system achieves a substantially improved separation metric, demonstrating enhanced diagnostic capability.
|
| |
| 16:30-16:45, Paper WeC11.5 | Add to My Program |
| Compliance Control of an Electro-Hydraulic Large-Scale Manipulator for Robotic On-Site Construction |
|
| Wasserloos, Philipp | University of Stuttgart |
| Gienger, Andreas | University of Stuttgart |
| Sawodny, Oliver | University of Stuttgart |
Keywords: Control applications, Mechatronics
Abstract: Robotics is an emerging field in the construction sector. While it is a promising approach to increase productivity and sustainability, it also comes with some challenges such as uncertain environments paired with the necessity to ensure safe interaction between the end-effector and its surroundings. To overcome these challenges, this paper presents a compliance control scheme for large-scale electro-hydraulic manipulators with high structural flexibility using joint velocities as control variables, and analyzes its performance. First, a trajectory in workspace coordinates is generated. Then, based on the force measured near the tool center point (TCP), a desired deviation from the trajectory is computed. The new trajectory is then transferred to joint position trajectories, which constitute the reference signal of joint position controllers. As a result, the TCP follows a trajectory in space, but yields to external forces when contact with the environment is established. To evaluate the approach, two experiments are carried out. First, a constant force is applied and the reaction of the system recorded. Second, an assembly situation at the construction site with interaction between the end-effector and the environment is imitated and a building element is inserted by the manipulator. The experiments not only validate the quantitative expectations, but also demonstrate the applicability to construction processes.
|
| |
| 16:45-17:00, Paper WeC11.6 | Add to My Program |
| Dominant Pole Placement in Linear Quadratic Regulators: Application in Automated Design of Gain-Scheduled Control |
|
| Ekanayake, Lahiru | Southern Illinois University |
| Komaee, Arash | Southern Illinois University |
Keywords: Linear parameter-varying systems, Mechatronics, Identification
Abstract: A tuning procedure for linear quadratic regulators (LQR) is presented to stabilize linear systems with a prescribed exponential decay rate. More specifically, this procedure allows an LQR to place the dominant poles of a closed-loop linear system at an arbitrary distance from the imaginary axis on the left half of the complex plane. This procedure is in particular a useful tool for automated control design within the framework of gain scheduling, in which a family of linear controllers must be designed for a set of linear systems that locally approximate a nonlinear system at a large number of operating points. By placing the dominant closed-loop poles of all linear systems at the same distance from the imaginary axis, uniform bandwidth and perhaps nearly identical frequency response can be attained at all operating points. Application of uniform pole placement in gain scheduling control is demonstrated by a practical example in feedback control of magnetic manipulators.
|
| |
| WeC12 Regular Session, Grand Salon 18 |
Add to My Program |
| Uncertain Systems |
|
| |
| Chair: Coogan, Samuel | Georgia Institute of Technology |
| Co-Chair: Givigi, Sidney | Queen's University |
| |
| 15:30-15:45, Paper WeC12.1 | Add to My Program |
| Risk Mitigation for Interval Signal Temporal Logic Monitoring and Synthesis |
|
| Baird, Luke | Georgia Institute of Technology |
| Schoer, Andrew | MIT Lincoln Laboratory |
| Cleaveland, Matthew | MIT Lincoln Laboratory |
| Coogan, Samuel | Georgia Institute of Technology |
| Leahy, Kevin | Worcester Polytechnic Institute |
Keywords: Uncertain systems, Computational methods, Fault detection
Abstract: This letter presents a method to mitigate the risk of violation of a temporal logic specification. Given a time-varying signal with interval-valued uncertainty at each time step, we propose an optimization approach to identify time instances for which a tighter uncertainty bound is required to satisfy the specification, knowledge which can be used to, e.g., focus sensing resources to strategically reduce uncertainty. We demonstrate our method on a simulated unmanned underwater vehicle where GPS calibration is informed by the identified time instances. In contrast to existing methods that solve an uncertainty-aware optimal control program with temporal logic mixed-integer constraints, our proposed interval-tightening approach is several orders of magnitude faster to compute. Additionally, we survey methods to produce interval-valued uncertainty, specifically how probabilistic bounds may translate to confidence intervals about an entire signal.
|
| |
| 15:45-16:00, Paper WeC12.2 | Add to My Program |
| Learning-Based Shrinking Disturbance-Invariant Tubes for State and Input-Dependent Uncertainty |
|
| Ramadan, Abdelrahman | Queen's University |
| Givigi, Sidney | Queen's University |
Keywords: Uncertain systems, Data driven control, Robust adaptive control
Abstract: We develop a learning-based framework for constructing shrinking disturbance-invariant tubes under state- and input-dependent uncertainty, intended as a building block for tube Model Predictive Control (MPC), and certify safety via a lifted, isotone (order-preserving) fixed-point map. Gaussian Process (GP) posteriors become (1 − α) credible ellipsoids, then polytopic outer sets for deterministic set operations. A two-time-scale scheme separates learning epochs, where these polytopes are frozen, from an inner, outside-in iteration that converges to a compact fixed point Z* ⊆ 𝒢; its state projection is RPI for the plant. As data accumulate, disturbance polytopes tighten and the associated tubes nest monotonically, resolving the circular dependence between the set to be verified and the disturbance model while preserving hard constraints. A double-integrator study illustrates shrinking tube cross-sections in data-rich regions while maintaining invariance.
|
| |
| 16:00-16:15, Paper WeC12.3 | Add to My Program |
| Learning Feedforward Planners from Frequency-Domain Data |
|
| Jiang, Yu | ClearMotion, Inc |
Keywords: Uncertain systems, Direct adaptive control, Linear systems
Abstract: This paper develops a frequency-domain framework for learning finite-horizon feedforward planners directly from data. By embedding Toeplitz dynamics into circulants, the lifted planning problem is diagonalized under the discrete Fourier transform, reducing control design to per-frequency gain estimation. Trajectory windows are tapered, transformed, and used in convex least-squares regressions with Laplacian or Tikhonov regularization, allowing selective use of informative frequency bins and mitigating noise effects. The resulting planner is obtained without explicit plant identification and yields interpretable frequency responses. On a preview steering problem with a kinematic bicycle model, combining the learned feedforward with a baseline feedback controller improves tracking accuracy relative to feedback alone.
|
| |
| 16:15-16:30, Paper WeC12.4 | Add to My Program |
| Safe Navigation in Dynamic Environments Using Data-Driven Koopman Operators and Conformal Prediction |
|
| Liang, Kaier | Lehigh University |
| Yang, Guang | Boston University |
| Cai, Mingyu | University of California Riverside |
| Vasile, Cristian Ioan | Lehigh University |
Keywords: Uncertain systems, Nonlinear systems identification, Predictive control for linear systems
Abstract: We propose a novel framework for safe navigation in dynamic environments by integrating Koopman operator theory with conformal prediction. Our approach leverages data-driven Koopman approximation to learn nonlinear dynamics and employs conformal prediction to quantify uncertainty in the Koopman approximation, providing statistical guarantees on approximation errors. This uncertainty is effectively incorporated into a Model Predictive Controller (MPC) formulation through constraint tightening, ensuring robust safety guarantees. We implement a layered control architecture with a reference generator providing waypoints for safe navigation. The effectiveness of our methods is validated in simulation.
|
| |
| 16:30-16:45, Paper WeC12.5 | Add to My Program |
| On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis |
|
| Nguyen-Le, Alexian | University of Pennsylvania |
| Matni, Nikolai | University of Pennsylvania |
Keywords: Uncertain systems, Robust control, Randomized algorithms
Abstract: Domain randomization is a simple, effective, and flexible scheme for obtaining robust feedback policies aimed at reducing the sim-to-real gap due to model mismatch. While domain randomization methods have yielded impressive demonstrations in the robotics-learning literature, general and theoretically motivated principles for designing optimization schemes that effectively leverage the randomization are largely unexplored. We address this gap by considering a stochastic policy gradient descent method for the domain randomized linear-quadratic regulator synthesis problem, a situation simple enough to provide theoretical guarantees. In particular, we demonstrate that stochastic gradients obtained by repeatedly sampling new systems at each gradient step converge to global optima with appropriate hyperparameters choices, and yield better controllers with lower variability in the final controllers when compared to approaches that do not resample. Sampling is often a quick and cheap operation, so computing policy gradients with newly sampled systems at each iteration is preferable to evaluating gradients on a fixed set of systems.
|
| |
| 16:45-17:00, Paper WeC12.6 | Add to My Program |
| Indirect Herding Control through a Chain-Of-Influence by Uncertain Intermediaries |
|
| Philor, Jhyv | University of Florida |
| Nino, Cristian | Florida Institute for Human and Machine Cognition |
| Amy, Patrick | University of Florida |
| Dixon, Warren E. | University of Florida |
Keywords: Uncertain systems, Stability of nonlinear systems, Lyapunov methods
Abstract: Herding is an established class of problems where a herding agent seeks to influence a target agent to move to a desired location. Motivated by scenarios where the herder cannot directly influence a target or the herder desires to conceal its influence, this paper examines a new generalization of the herding problem where the herding agent influences a target agent through a set of intermediate agents (i.e., multi-hop chain of influence). The interconnected influences result in challenging coupled dynamics within the group of agents. Motivated by this challenge, robust control techniques are used to compensate for the complex chain of influences among the group, and a Lyapunov-based analysis is used to prove the target is exponentially regulated to a desired location by the herding agent.
|
| |
| WeC13 Regular Session, Grand Salon 19 |
Add to My Program |
| Autonomous Systems III |
|
| |
| Chair: Rogers, Jonathan | Naval Surface Warfare Center, Philadelphia Division |
| Co-Chair: Sargolzaei, Arman | University of South Florida |
| |
| 15:30-15:45, Paper WeC13.1 | Add to My Program |
| Trusted Reinforcement Learning Control of a Shipboard Stewart Platform Via Control Lyapunov Functions and Control Barrier Functions |
|
| Rogers, Jonathan | Naval Surface Warfare Center, Philadelphia Division |
| Ripple, John T. | Naval Surface Warfare Center, Philadelphia Division |
Keywords: Control applications, Reinforcement learning, Stability of nonlinear systems
Abstract: Certain shipboard payloads–such as cranes, cameras, or landing decks–require ship stability in order to operate correctly. Unfortunately, wave induced ship motion causes these systems to perform in a suboptimal manner. A solution is to mount the payloads onto a parallel manipulator–like a Stewart platform–instead of directly to the ship. However, Stewart platforms are difficult to control because they are highly nonlinear and, in the case of shipboard applications, the bottom plate will move due to the ship motion. Motivated by the need to stabilize shipboard payloads, in this paper, we present a novel reinforcement learning (RL) based control strategy for a Stewart platform that contains controller stability and system safety guarantees through the use of a control Lyapunov function (CLF) and control barrier function (CBF) based quadratic program (QP) filter. We demonstrate robustness of our control strategy through numerical validation that includes real-world extreme wave induced ship motion.
|
| |
| 15:45-16:00, Paper WeC13.2 | Add to My Program |
| Quadrotor Aerobatics: A Geometric Algebra–Based Tracking Approach |
|
| Garcia Alcantara, Omar Alejandro | New Mexico State University |
| Sandoval, Steven | New Mexico State University |
| Rubio Scola, Ignacio | INTI - Conicet |
| Garcia Carrillo, Luis Rodolfo | Air Force Research Laboratory (AFRL) |
| Espinoza Quesada, Eduardo Steed | CINVESTAV |
Keywords: Modeling, Algebraic/geometric methods, Autonomous vehicles
Abstract: Geometric Algebra (GA) tools are employed to enable trajectory tracking of aerobatic maneuvers for a quadrotor while explicitly accounting for its fully coupled 6-DoF dynamics. The proposed method introduces a GA-based prioritization framework for reference generation that unifies translational objectives, preferred orientations, and aerobatic maneuver commands within a single control structure. Rotational references are derived analytically from translational trajectories, thereby avoiding the need for numerical differentiation. The effectiveness of the trajectory tracking controller is demonstrated through numerical simulations of a mission profile comprising three aerobatic maneuvers: barrel roll, 720° corkscrew, and a loop, executed in the presence of external disturbances (drag and wind). The results demonstrate the controller's robustness and effectiveness for handling underactuation and achieving trajectory tracking of aerobatic maneuvers beyond π radians.
|
| |
| 16:00-16:15, Paper WeC13.3 | Add to My Program |
| Resilient UAV Mission Management in Threat-Prone Environments Using Markov Decision Processes |
|
| Quamar, Md Muzakkir | King Fahd University of Petroleum and Minerals |
| Nasir, Ali | King Fahd University of Petroleum and Minerals |
Keywords: Markov processes, Autonomous systems, Stochastic optimal control
Abstract: This paper proposes a unified Markov Decision Process (MDP)-based framework for resilient mission management of a single unmanned aerial vehicle (UAV) operating in threat-prone and resource-constrained environments. UAV missions are inherently challenged by limited battery capacity, stochastic threats, dynamic goal reprioritization, and potential actuator, sensor, or payload faults. Existing approaches typically address these factors in isolation, resulting in suboptimal decision-making under complex operational conditions. The proposed framework formulates mission execution as a finite-state MDP that integrates post-fault system capability assessment, state-of-charge-based range feasibility, dynamic goal updates, and adaptive navigation mode switching for threat evasion. A structured cost function captures trade-offs among mission commitment, fault mitigation, energy preservation, and risk-aware maneuvering. The optimal policy is computed offline using value iteration and implemented online via a precomputed lookup structure to enable low-latency execution. Simulation results demonstrate adaptive goal commitment, safe recovery under major faults, and recharge-aware planning under energy constraints, thereby enhancing mission reliability and operational resilience.
|
| |
| 16:15-16:30, Paper WeC13.4 | Add to My Program |
| Nonlinear Time-Optimal Trajectory Optimization for Heterogeneous Microrobots in Constrained Environments |
|
| TalebiRostami, Hossein | Old Dominion University |
| Beaver, Logan E. | Old Dominion University |
Keywords: Optimization, Optimization algorithms, Robotics
Abstract: This paper presents a nonlinear optimization framework for offline, time-optimal trajectory planning of heterogeneous microrobots driven by a shared global magnetic field. Navigation is posed as a constrained nonlinear program with workspace limits, obstacle avoidance, and inter-robot separation. To reduce computational burden, three enhancements are introduced: micro-step constraint aggregation, vectorized trajectory evaluation, and precomputed velocity–frequency interpolation. These updates reduce computation time by nearly 90% relative to the original formulation without compromising accuracy. Simulations with up to three microrobots in cluttered environments, using experimentally calibrated speed–frequency data, demonstrate feasibility of coordinated shared-input control. Compared with Rapidly-exploring Random Trees (RRT) and path-length–based strategies, the proposed formulation converges reliably while explicitly minimizing total traversal time.
|
| |
| 16:30-16:45, Paper WeC13.5 | Add to My Program |
| Data-Driven Predictive Control for Autonomous Vehicle Tracking with Obstacle Avoidance Using Gaussian Processes |
|
| Xie, Mengxu | Northeastern University |
| Ma, Tong | Northeastern University |
Keywords: Stochastic optimal control, Identification for control, Autonomous systems
Abstract: This work tackles the challenge of vehicle tracking under obstacle avoidance by embedding path planning within a model- and data-driven predictive control framework that leverages Gaussian processes (GP-MDPC) in conjunction with backoffs. Autonomous driving is inherently affected by stochasticity and uncertainty, making vehicle tracking with obstacle avoidance a stochastic constrained control problem, where stochastic nonlinear model predictive control (SNMPC) emerges as the primary approach. The proposed approach addresses two key challenges in SNMPC. First, GPs are employed to capture unknown vehicle dynamics, with sparse GPs adopted to reduce computational complexity and enable efficient online evaluation. Second, a backoff-based approximation is introduced to reformulate chance constraints into tractable expressions by tuning backoffs offline from generated closed-loop Monte Carlo samples, which balances the risk of constraint violation and robustness as well as guarantees online recursive feasibility. The backoff method improves vehicle tracking performance by mitigating the conservatism inherent in Chebyshev's inequality method. The resulting formulation yields a finite-horizon stochastic optimal control problem (FH-SOCP) that integrates vehicle and obstacle chance constraints with state mean and covariance. Simulation results confirm that sparse GP-MDPC using backoffs achieves superior computational efficiency while maintaining satisfactory tracking performance as well as ensuring safety compared to full GP-MDPC either using backoffs or Chebyshev’s inequality.
|
| |
| 16:45-17:00, Paper WeC13.6 | Add to My Program |
| An Actor-Critic-Identifier Control Design for Increasing Energy Efficiency of Automated Electric Vehicles |
|
| Faghihian, Hamed | University of South Florida |
| Sargolzaei, Arman | University of South Florida |
Keywords: Automotive control, Reinforcement learning, Autonomous systems
Abstract: Electric vehicles (EVs) can achieve greater operational effectiveness when range limitations are mitigated. Enhancing energy efficiency through advanced control strategies is therefore of significant importance, and recent developments in vehicle automation provide a promising opportunity to address this challenge. However, many existing approaches rely on indirect methods, primarily because establishing a direct relationship between control inputs and power consumption is inherently difficult. To address this limitation, a neural network (NN)-based identifier is proposed to learn this mapping online and integrate it with an actor-critic reinforcement learning (RL) framework to synthesize an optimal control policy. The resulting actor-critic-identifier architecture eliminates the reliance on explicit system models while maintaining accurate speed tracking and improving energy efficiency. In the proposed framework, the update laws of the neural networks are derived using Lyapunov-based stability analysis to ensure closed-loop stability. The effectiveness of the method is evaluated through simulation studies. Comparative results with a baseline controller demonstrate that the proposed approach improves overall energy efficiency by 42%, indicating substantial potential for enhancing the energy efficiency of electric vehicles.
|
| |
| WeC14 Regular Session, Grand Salon 21 |
Add to My Program |
| Machine Learning I |
|
| |
| Chair: Pare, Philip E. | Purdue University |
| Co-Chair: Farhood, Mazen | Virginia Tech |
| |
| 15:30-15:45, Paper WeC14.1 | Add to My Program |
| Learning Passive Continuous-Time Dynamics with Multistep Port-Hamiltonian Gaussian Processes |
|
| Leung, Humphrey | Purdue University |
| Pare, Philip E. | Purdue University |
Keywords: Learning, Identification for control, Nonlinear systems identification
Abstract: We propose the multistep port-Hamiltonian Gaussian process (MS-PHS GP) to learn physically consistent continuous-time dynamics and a posterior over the Hamiltonian from noisy, irregularly-sampled trajectories. By placing a GP prior on the Hamiltonian surface H and encoding variable-step multistep integrator constraints as finite linear functionals, MS-PHS GP enables closed-form conditioning of both the vector field and the Hamiltonian surface without latent states, while enforcing energy balance and passivity by design. We state a finite-sample vector-field bound that separates the estimation and variable-step discretization terms. Lastly, we demonstrate improved vector-field recovery and well-calibrated Hamiltonian uncertainty on mass-spring, Van der Pol, and Duffing benchmarks.
|
| |
| 15:45-16:00, Paper WeC14.2 | Add to My Program |
| Data-Driven Discrepancy Modeling in Higher-Dimensional State Space Via Coprime Factorization |
|
| Sinha, Sourav | Virginia Tech |
| Farhood, Mazen | Virginia Tech |
Keywords: Learning, Identification for control, Robust control
Abstract: This work provides a data-driven framework that combines coprime factorization with a lifting linearization technique to model the discrepancy between a nonlinear system and its nominal linear approximation using a linear time-invariant (LTI) state-space model in a higher-dimensional state space. In the proposed framework, the nonlinear system is represented in terms of the left coprime factors of the nominal linear system, along with perturbations modeled as stable, norm-bounded LTI systems in a higher-dimensional state space using a deep learning approach. Our method builds on a recently proposed parametrization for norm-bounded systems, enabling the simultaneous minimization of the H-infinity norm of the learned perturbations. We also provide a coprime factorization-based approach as an alternative to direct methods for learning lifted LTI approximations of nonlinear systems. In this approach, the LTI approximations are obtained by learning their left coprime factors, which remain stable even when the original system is unstable. The effectiveness of the proposed discrepancy modeling approach is demonstrated through multiple examples.
|
| |
| 16:00-16:15, Paper WeC14.3 | Add to My Program |
| Entropy-Regularized Two-Stage State-Only Imitation Learning under Unknown Dynamics |
|
| Wang, Dejin | Northeastern University |
| Ghoreishi, Seyede Fatemeh | Northeastern University |
Keywords: Learning, Machine learning
Abstract: Imitation learning (IL) enables policy recovery from demonstrations without requiring reward specification; however, most approaches assume access to expert actions and known dynamics---assumptions that are rarely satisfied in realistic domains. We propose a two-stage state-only IL framework that first learns a dynamics surrogate from auxiliary action-labeled data and then recovers an expert-like policy from purely state-only demonstrations. This decoupling is a key distinction from prior state-only IL methods, providing a principled path to stable policy recovery. To address the policy identifiability challenge inherent to state-only settings, we incorporate entropy regularization, which encourages deterministic solutions and improves robustness rather than acting as an ad hoc penalty. Our framework is supported by a theoretical analysis clarifying how modeling and policy errors impact imitation accuracy. Experiments conducted in autonomous driving scenarios within the CARLA simulator show that our method achieves higher action accuracy, lower variance, and improved control rewards compared to state-of-the-art baselines.
|
| |
| 16:15-16:30, Paper WeC14.4 | Add to My Program |
| Learning Transferable Friction Models and LuGre Identification Via Physics-Informed Neural Networks |
|
| Ozmen, Asutay | Univ. of California, Santa Barbara |
| Hespanha, Joao P. | Univ. of California, Santa Barbara |
| Byl, Katie | Univ. of California at Santa Barbara |
Keywords: Learning, Modeling, Simulation
Abstract: Accurately modeling friction in robotics remains a core challenge, as robotics simulators like MuJoCo and PyBullet use simplified friction models or heuristics to balance computational efficiency with accuracy, where these simplifications and approximations can lead to substantial differences between simulated and physical performance. In this paper, we present a physics-informed friction estimation framework that enables the integration of well-established friction models with learnable components, requiring only minimal, generic measurement data. Our approach enforces physical consistency yet retains the flexibility to capture complex friction phenomena. We demonstrate, on an underactuated and nonlinear system, that the learned friction models, trained solely on small and noisy datasets, accurately reproduce dynamic friction properties with significantly higher fidelity than the simplified models commonly used in robotics simulators. Crucially, we show that our approach enables the learned models to be transferable to systems they are not trained on. This ability to generalize across multiple systems streamlines friction modeling for complex, underactuated tasks, offering a scalable and interpretable path toward improving friction model accuracy in robotics and control.
|
| |
| 16:30-16:45, Paper WeC14.5 | Add to My Program |
| Adaptive Control with Integral Concurrent Learning for Systems with Uncertain Dynamical and Control Effectiveness Terms |
|
| Gul, Zeki | Ege University |
| Tatlicioglu, Enver | Ege University |
| Deniz, Meryem | Izmir Katip Celebi University |
| Zergeroglu, Erkan | Gebze Technical University |
Keywords: Learning, Stability of linear systems, Estimation
Abstract: This work presents a concurrent learning based adaptive controller formulation for a class of uncertain linear systems having unknown effectiveness matrix that represents potential degradations in actuator performance. Specifically, making use of a memory stack containing selected data from the recorded and current information gathered from the system's runs incorporated with a model reference adaptation algorithm, asymptotically convergent error tracking is guaranteed. When the memory stack contains sufficiently rich system information, the proposed methodology ensures both exponential convergence of the tracking error signal and exact estimation of the parameters.
|
| |
| 16:45-17:00, Paper WeC14.6 | Add to My Program |
| Data-Driven Practical Stabilization of Nonlinear Systems Via Chain Policies: Sample Complexity and Incremental Learning |
|
| Siegelmann, Roy | Massachusetts Institute of Technology |
| Mallada, Enrique | Johns Hopkins University |
Keywords: Learning, Stability of nonlinear systems, Lyapunov methods
Abstract: We propose a method for data-driven practical stabilization of nonlinear systems with provable guarantees, based on the novel concept of Nonparametric Chain Policies (NCPs). The approach employs a normalized nearest-neighbor rule to assign, at each state, a finite-duration control signal derived from stored data, after which the process repeats. Unlike recent works that model the system as linear, polynomial, or polynomial fraction, we only require the system to be locally Lipschitz. Our analysis builds on the framework of Recurrent Lyapunov Functions (RLFs), which enable data-driven certification of stability using standard norm functions instead of requiring the explicit construction of a classical Lyapunov function. To extend this framework, we introduce the concept of Recurrent Control Lyapunov Functions (R-CLFs), which can certify the existence of an NCP that practically stabilizes an arbitrarily small c-neighborhood of an equilibrium point. We also provide an explicit sample complexity guarantee of O((3/rho)^d log(R/c)) number of trajectories—where R is the domain radius, d the state dimension, and rho is a system-dependent constant. The proposed Chain Policies are nonparametric, thus allowing new verified data to be readily incorporated into the policy to either improve convergence rate or enlarge the certified region. Numerical experiments illustrate and validate these properties.
|
| |
| WeC15 Regular Session, Grand Salon 22 |
Add to My Program |
| Algebraic and Geometric Methods |
|
| |
| Chair: Clark, William | Ohio University |
| Co-Chair: Verriest, Erik I. | Georgia Inst. of Tech |
| |
| 15:30-15:45, Paper WeC15.1 | Add to My Program |
| Geometric Insight in Solving Optimal Control Problems and the Emergence of Generalized Functions |
|
| Verriest, Erik I. | Georgia Inst. of Tech |
Keywords: Algebraic/geometric methods, Optimal control
Abstract: Geometric insight may lead to a quick solution for a class of non-LQ optimal control problems. We illustrate this with a simple example that looks very inconspicuous. While necessary conditions for optimality are easily obtained, their analytic solution may not be easy. But some problems are reducible to an Euclidean distance problem. This insight then leads to the additional realization that in some cases, optimality may require impulsive inputs. However, Dirac deltas cannot be compatible with nonlinear operations, at least not in Schwartz's distribution theory. Thus, it seems that we may have a solution but not a theory. Since the solution is transparent in its geometric form, it suggests that another approach to generalized functions, as proposed by Colombeau, should be used. This is very valuable as it corroborates our earlier work (MTNS-24). Generalizations are then sought for other problems reducible to Euclidean minimum distance problems, and even more general Riemannian spaces. We make some connections with the notion of persistence of behavior, where these results apply.
|
| |
| 15:45-16:00, Paper WeC15.2 | Add to My Program |
| On the Equivalence of Koopman Eigenfunctions and Commuting Symmetries |
|
| Jiang, Xinyuan | Pennsylvania State University |
| Li, Yan | The Pennsylvania State University |
Keywords: Algebraic/geometric methods, Reduced order modeling
Abstract: The Koopman operator framework offers a way to represent a nonlinear system as a linear one. The key to this simplification lies in the identification of eigenfunctions. While various data-driven algorithms have been developed for this problem, a theoretical characterization of Koopman eigenfunctions from geometric properties of the flow is still missing. This paper provides such a characterization by establishing an equivalence between a set of Koopman eigenfunctions and a set of commuting symmetries -- both assumed to span the tangent spaces at every point on a simply connected open set. Based on this equivalence, we derive an explicit formula for the principal Koopman eigenfunctions and prove its uniform convergence on the region of attraction of a locally asymptotically stable equilibrium point, thereby offering a constructive method for computing Koopman eigenfunctions.
|
| |
| 16:00-16:15, Paper WeC15.3 | Add to My Program |
| Variations on the Retarded Oscillator through Diffeology |
|
| Clark, William | Ohio University |
Keywords: Delay systems, Algebraic/geometric methods, Variational methods
Abstract: The dynamics and evolution of many real-world systems depend on past values of the states due to the finite propagation speed of information. One way to model such systems is through delay differential equations. Unlike ordinary differential equations, delay systems are inherently infinite-dimensional which makes their analysis more difficult. Standard approaches for studying these systems is through either functional analysis or perturbation analysis as the delay vanishes. This work introduces the idea of diffeology as a possible framework for the qualitative geometric study of delay differential equations. As an application of this, the variational equation is presented for retarded delay differential equations with state-dependent delays. As a case-study, this theory is applied to the retarded oscillator.
|
| |
| 16:15-16:30, Paper WeC15.4 | Add to My Program |
| Traffic Characterization of Event-Triggered Control Systems: A Geometric-Algebraic Perspective |
|
| Chen, Tao | Central South University |
| Wang, Hongju | Central South University |
| Hu, Wenfeng | Central South University |
Keywords: Hybrid systems, Modeling
Abstract: The triggering behaviors of event-triggered control systems are investigated in this article and characterized by inter-event times (IETs) and the transitions among them. Such behaviors are particularly significant in resource-constrained networked control systems, where IETs determine packet transmission instants and shape communication traffic, directly affecting bandwidth utilization and closed-loop stability. While the feasibility of inter-event time (IET) transitions is conventionally formulated as a complex constraint satisfaction problem, we reformulate it into an equivalent linear cone problem, which provides a clearer description of the feasible region for reliable analysis. Based on this formulation, necessary and sufficient conditions are established to determine the feasibility of IET transitions, together with an algorithm to compute the set of all feasible IET transitions. Numerical simulations illustrate how transition feasibility varies with the control parameter and provide insight for parameter selection and system design.
|
| |
| 16:30-16:45, Paper WeC15.5 | Add to My Program |
| Homogeneous Approximations and Fixed Time Semistability with Applications to Network Consensus |
|
| Verma, Kriti | Georgia Institute of Technology |
| Haddad, Wassim M. | Georgia Inst. of Tech |
Keywords: Stability of nonlinear systems, Algebraic/geometric methods, Lyapunov methods
Abstract: This paper studies properties of homogeneous approximations of systems with a continuum of equilibrium points in a geometric, coordinate-free setting. Specifically, we develop the concept of homogeneity in the 0-limit and ∞-limit (i.e., homogeneity in the bi-limit) with respect to general dilations. A key contribution of the paper is a result relating the regularity properties of homogeneous in the bi-limit functions and vector fields to their degree of homogeneity and the local behavior of the dilations near the equilibrium and infinity. Specifically, we develop new finite time and fixed time semistability results for a class dynamical systems possessing a continuum of equilibria which are homogeneous in the bi-limit with respect to a semi-Euler vector field. Finally, we use these results to develop thermodynamically inspired distributed consensus control protocols for multiagent network systems for achieving coordination tasks in fixed time.
|
| |
| 16:45-17:00, Paper WeC15.6 | Add to My Program |
| Source-Seekers with Obstacle Avoidance for Nonholonomic Vehicles with Perception: A Model-Free Geometric Hybrid Control Approach |
|
| Zhang, Xiyuan | University of California San Diego |
| Abdelgalil, Mahmoud | University of California, San Diego |
| Poveda, Jorge I. | University of California, San Diego |
Keywords: Optimization algorithms, Learning, Hybrid systems
Abstract: We study the problem of steering nonholonomic vehicles toward the source of an unknown signal, accessible only through noisy measurements, while navigating cluttered environments using onboard perception instead of position data. Such stabilization cannot be robustly achieved with smooth feedback laws due to topological obstructions and geometric constraints induced by the obstacles and the nonholonomic dynamics. To address this, we propose a model-free hybrid controller based on Lie-bracket averaging, integrating perception maps from a transformer-based mixture-of-experts model with geometric control. Perception maps provide camera-derived state information, while the hybrid controller ensures robust source seeking. We show that, under bounded and even adversarial perception errors, the closed-loop system achieves semi-global practical asymptotic stability and guarantees obstacle avoidance. Experiments in dense environments demonstrate consistent collision-free trajectories and reliable convergence, closely matching the ground truth.
|
| |
| WeC16 Regular Session, Grand Salon 24 |
Add to My Program |
| Reinforcement Learning III |
|
| |
| Chair: Wongpiromsarn, Tichakorn (Nok) | Iowa State University |
| Co-Chair: Xu, Zhe | Arizona State University |
| |
| 15:30-15:45, Paper WeC16.1 | Add to My Program |
| Encoding High-Level Knowledge in Offline Multi-Agent Reinforcement Learning Using Reward Machines |
|
| Meshkat Alsadat, Shayan | Arizona State University |
| Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning
Abstract: Offline reinforcement learning (RL) learns policies from fixed-size datasets without interacting with the environment, while multi-agent reinforcement learning (MARL) faces challenges from large joint state-action spaces and agent interdependencies. Most offline MARL methods apply regularizations, ignoring system-wide dependencies, risking extrapolation errors. We propose Automata-Guided Multi-Agent Offline RL with Reward Machine (AGMORL), a novel framework extending automata learning to offline MARL with reward machines. AGMORL uses a deterministic finite automaton to learn the reward machine from a dataset, capturing team dynamics and agent interactions, while guiding individual policies to avoid out-of-distribution actions by encoding dataset knowledge. Unlike other methods, it avoids secondary components like generative models. We provide convergence guarantees to an optimal policy and show AGMORL outperforms state-of-the-art offline MARL methods.
|
| |
| 15:45-16:00, Paper WeC16.2 | Add to My Program |
| Enhancing Multi-Agent Reinforcement Learning by Mining Decomposed Causal Signal Temporal Logic Formula |
|
| Partovi Aria, Hadi | Arizona State University |
| Xu, Zhe | Arizona State University |
Keywords: Reinforcement learning, Agents-based systems
Abstract: Reinforcement Learning (RL) has seen significant success in training agents to perform complex sequential tasks. However, challenges remain in multi-agent contexts where agents must coordinate to satisfy temporal and causal specifications. This paper introduces MASTL-CIRL (Multi-Agent Signal Temporal Logic for Causal Inference in Reinforcement Learning), a framework that extends single-agent causal STL inference to multi-agent systems. Our framework mines interpretable causal specifications that capture temporal dependencies while enabling decentralized execution through principled task decomposition. Experimental results in cooperative multi-agent environments demonstrate that our method outperforms traditional approaches, achieving higher sample efficiency, more robust policies, and better generalization to novel scenarios. The decomposed formulas retain causal interpretability while significantly reducing communication requirements between agents during execution.
|
| |
| 16:00-16:15, Paper WeC16.3 | Add to My Program |
| Incorporating System-Level Safety Requirements in Perception Models Via Reinforcement Learning |
|
| Fan, Weisi | Iowa State Univeristy |
| Lane, Jesse | Iowa State University |
| Liu, Qisai | Iowa State University |
| Sarkar, Soumik | Iowa State University |
| Wongpiromsarn, Tichakorn (Nok) | Iowa State University |
Keywords: Reinforcement learning, Autonomous systems, Machine learning
Abstract: Control of autonomous systems is often designed under the assumption of accurate perception. However, perception models are typically trained in isolation using metrics such as accuracy, precision, and recall, which fail to capture the downstream safety consequences of perception errors. Conventional loss functions, including cross-entropy and negative log-likelihood, treat all misclassifications equally and ignore how perception errors propagate into decision-making and control. To address this limitation, we propose a safety-driven training paradigm that explicitly integrates system-level safety objectives into the optimization of perception models. Formally specified safety requirements, expressed using the rulebook formalism, are translated into quantitative violation scores that reflect the severity of perception-induced failures. These scores are incorporated into a reinforcement learning–based fine-tuning process, enabling perception models to adjust their internal representations to reduce high-risk system-level violations. Experiments conducted in the high-fidelity CARLA simulator show that, compared to state-of-the-art perception models trained with conventional objectives, our approach reduces average safety violations by up to 50% under challenging driving scenarios. The proposed framework provides a principled and controller-agnostic mechanism for aligning perception training with formal safety specifications, offering a practical pathway toward safer learning-enabled autonomous systems.
|
| |
| 16:15-16:30, Paper WeC16.4 | Add to My Program |
| Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning |
|
| Baheri, Ali | Rochester Institute of Technology |
Keywords: Reinforcement learning, Cooperative control, Distributed control
Abstract: Cooperative multi-agent reinforcement learning (MARL) demands principled mechanisms to align heterogeneous policies while preserving the capacity for specialized behavior. We introduce a novel consensus framework that defines the team strategy as the entropic-regularized p-Wasserstein barycenter of agents’ joint state–action visitation measures. By augmenting each agent’s policy objective with a soft penalty proportional to its Sinkhorn divergence from this barycenter, the proposed approach encourages coherent group behavior without enforcing rigid parameter sharing. We derive an algorithm that alternates between Sinkhorn-barycenter computation and policy-gradient updates, and we prove that, under standard Lipschitz and compactness assumptions, the maximal pairwise policy discrepancy contracts at a geometric rate. Empirical evaluation for a cooperative navigation case study demonstrates that our OT-barycenter consensus outperforms an independent learners baseline in convergence speed and final coordination success.
|
| |
| 16:30-16:45, Paper WeC16.5 | Add to My Program |
| Dynamics-Augmented Reinforcement Learning for Partially Measurable Nonlinear Systems |
|
| Patil, Aditya Satish | University of Minnesota |
| Sun, Zongxuan | University of Minnesota |
| Kim, Kenneth | DEVCOM Army Research Laboratory |
| Kweon, Chol-Bum | DEVCOM Army Research Laboratory |
Keywords: Reinforcement learning, Data driven control
Abstract: Designing effective controllers for nonlinear systems without access to accurate models or full state measurements remains a core challenge in control engineering. While model-free reinforcement learning methods such as Deep Deterministic Policy Gradient (DDPG) can achieve high performance when full state information is available, their effectiveness typically degrades under partial state measurement. This paper proposes a two-phase control framework that decouples dynamics representation learning from policy optimization. A key novelty of our approach is the explicit use of measurement history to learn a predictive model of system dynamics, enabling a compact, dynamics-aware embedding to be constructed offline. This learned representation is then used to augment the input to a DDPG agent, facilitating stable and sample-efficient control under partial measurement. We evaluate the proposed approach on a high speed trajectory tracking task for a nonlinear electrohydraulic system. Results show that the method achieves desired tracking and rapid convergence under partial state measurement, outperforming recurrent reinforcement learning approaches in both sample efficiency and final tracking performance.
|
| |
| 16:45-17:00, Paper WeC16.6 | Add to My Program |
| Expert-Augmented Distributed Multi-Agent Reinforcement Learning with Jensen–Shannon Regularization: A Policy Gradient Approach |
|
| Wang, Dongming | University of California, Riverside |
| Zhu, Yuhan | University of California, Riverside |
| Zhang, Yanyu | University of California, Riverside |
| Ren, Wei | University of California, Riverside |
Keywords: Reinforcement learning, Networked control systems, Distributed control
Abstract: In this paper, we propose an expert-augmented distributed multi-agent reinforcement learning (MARL) framework that integrates local prior knowledge from demonstrations, pretrained models, or transfer learning, while relying only on local observations and one-hop neighbor communication. To quantify the quality of expert policies, we introduce a Jensen–Shannon divergence–based metric. In the general case, we establish that the algorithm retains sublinear convergence to a neighborhood of the optimum, with irreducible bias from local communication constraints, and achieves faster convergence in practice. In particular, when the expert policy is sufficiently close to the optimal, our algorithm establishes local linear convergence. Experiments in Level-Based Foraging environments support our theoretical findings, providing empirical validation. Overall, this framework offers a principled method to incorporate expert knowledge into distributed MARL, with promising potential applications in robotics and autonomous systems where demonstrations are available.
|
| |
| WeC17 Regular Session, Churchill A1 |
Add to My Program |
| Biological Systems I |
|
| |
| Chair: Del Vecchio, Domitilla | Massachusetts Institute of Technology |
| Co-Chair: Gracy, Sebin | South Dakota School of Mines and Technology |
| |
| 15:30-15:45, Paper WeC17.1 | Add to My Program |
| Game-Theoretic Social Distancing in Competitive Bi-Virus SIS Epidemics |
|
| Catalano, Benjamin | South Dakota School of Mines |
| Paarporn, Keith | University of Colorado, Colorado Springs |
| Gracy, Sebin | South Dakota School of Mines and Technology |
Keywords: Biological systems, Autonomous systems, Network analysis and control
Abstract: The numerous elements that drive infectious dis- eases are highly complex, especially social behaviors that evolve in tandem with the spreading of diseases. Moreover, recent studies highlight the importance of understanding how multiple strains will simultaneous spread through a population (e.g.Delta and Omicron variants of SARS-CoV-2). In this paper, we propose a bi-virus SIS epidemics model that is coupled with individual social distancing behaviors. The behaviors are governed by replicator equations from evolutionary game-theory. The prevalence of the strains impact individuals’ choices to social distance or not, and, in turn, their choices affect the spreading of the bi-virus epidemic. Our analysis identifies the system’s equilibria and their local stability properties, which reveals several isolated fixed points with varying levels of social distancing. We find that the only outcomes where co-existence of the two strains is possible are on line segments of equilibria that emerge when the reproduction numbers of both strains are equivalent, and under suitable conditions on our model parameters, said line segments are locally exponentially stable. We demonstrate our findings with several numerical simulations.
|
| |
| 15:45-16:00, Paper WeC17.2 | Add to My Program |
| Stochastic Analysis of Plasmids Competition in a Bacterial Switch |
|
| Ruolo, Iacopo | Massachusetts Institute of Technology |
| Lu, Eric | Massachusetts Institute of Technology |
| Del Vecchio, Domitilla | Massachusetts Institute of Technology |
Keywords: Biological systems, Cellular dynamics, Stochastic systems
Abstract: Recombinase-based toggle switches recently emerged as synthetic gene circuits for encoding reversible cellular memory in bacteria, yet an understanding of their dynamics has been lacking. In particular, when the state is flipped via external inputs, it is possible that plasmids with identical origins of replication coexist, leading to competition for plasmid replication machinery. The role of these dynamics in such circuits has not been analyzed. In this work, we model intracellular plasmid dynamics using deterministic and stochastic approaches to examine how competition by different plasmids for the same replication machinery shapes the long-term outcome of switching events. While deterministic analysis predicts monostability, analysis of a stochastic chemical reaction model reveals the existence of two absorbing sets in which only one of the plasmid species exists. These results indicate that the final outcome of a switching event depends on the ratio of incompatible plasmids within the cell and on the relative binding constants for the replication machinery, potentially leading to failure of state maintenance. This analysis investigates the characteristics of recombinase-based toggle switches accounting for plasmid competition and identifies key parameters that must be optimized to achieve long-term memory.
|
| |
| 16:00-16:15, Paper WeC17.3 | Add to My Program |
| Prescribed-Time Control of an Agent-Based SIWS Epidemic Model |
|
| Abbasi, Zohreh | University of Waterloo |
| Liu, Xinzhi | University of Waterloo |
Keywords: Biological systems, Stability of nonlinear systems, Control applications
Abstract: This paper proposes a multi-agent approach to a multilayer, interconnected, and heterogeneous SIWS (Susceptible-Infected-Water-Susceptible) epidemic model incorporating population flow. The layers represent city-to-city connectivity, water resources, and designated leaders. A layer-specific prescribed-time synchronization strategy is applied to ensure infection eradication across cities and water systems within a user-defined time horizon. A saturated control input model is considered to account for limited medical resources and to enforce positive infection rates, addressing real-world constraints. Theoretical results are validated through a case study modeled on Louisiana, U.S., which demonstrates effective epidemic suppression within the prescribed time frame.
|
| |
| 16:15-16:30, Paper WeC17.4 | Add to My Program |
| Feedback Regulation of Protein Synthesis with State-Dependent Delays |
|
| Chatterjee, Poulami | University of Delaware |
| Singh, Abhyudai | University of Delaware |
Keywords: Biological systems, Stochastic systems, Modeling
Abstract: Cells must maintain stable protein levels despite the stochastic nature of gene expression, as large fluctuations can disrupt cellular function and decision-making. Mechanisms such as negative feedback and delays in protein activation help buffer this fluctuation, but the role of state-dependent delays remains unclear. We develop a stochastic model where proteins are produced in bursts as inactive molecules, pass through intermediate activation steps, and then degrade. The activation delay depends on the current active protein level, creating a state-dependent feedback loop. We derive analytical expressions linking delay structure and feedback strength to fluctuations in active protein levels, quantified by the Fano factor. Our results show that state-dependent delays can reduce fluctuations below the baseline from burst frequency, and stochastic simulations confirm these predictions. Incorporating negative feedback in burst production further suppresses variability while maintaining stable system behavior. These findings highlight how temporal, state-dependent regulation stabilizes protein expression and can inform the design of robust synthetic gene circuits.
|
| |
| 16:30-16:45, Paper WeC17.5 | Add to My Program |
| Bayesian Nonlinear State-Space Modeling and Chance-Constrained Predictive Control for Hemodynamic Resuscitation |
|
| Estiri, Elham | Kent State University |
| Mirinejad, Hossein | Kent State University |
Keywords: Predictive control for nonlinear systems, Biomedical, Stochastic optimal control
Abstract: This paper presents a unified framework for uncertainty-aware modeling and stochastic control of fluid resuscitation. A Bayesian nonlinear state–space model (BNSSM) is developed to generate calibrated multi-step predictions of mean arterial pressure (MAP) while jointly propagating aleatoric and epistemic uncertainty. Monte Carlo rollouts of the learned model are integrated into a chance-constrained stochastic model predictive control (CC-SMPC) formulation with joint uncertainty, which enforces safety bounds on MAP and infusion rates. The accuracy of the BNSSM was evaluated on real hemorrhage–resuscitation data, demonstrating its ability to capture physiologically consistent MAP dynamics. Furthermore, the proposed CC-SMPC controller outperformed its deterministic counterpart in fluid dose adjustment, achieving safer and more precise regulation of MAP. These results highlight the framework’s ability to jointly address epistemic and aleatoric uncertainty, providing a more reliable foundation for closed-loop fluid management in critical care.
|
| |
| WeC18 Regular Session, Churchill A2 |
Add to My Program |
| Constrained Control III |
|
| |
| Chair: Nicotra, Marco M | University of Colorado Boulder |
| |
| 15:30-15:45, Paper WeC18.1 | Add to My Program |
| From Bundles to Backstepping: Geometric Control Barrier Functions for Safety-Critical Control on Manifolds |
|
| de Sa, Massimiliano | California Institute of Technology |
| Ong, Pio | California Institute of Technology |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Algebraic/geometric methods, Lyapunov methods
Abstract: Control barrier functions (CBFs) have a well-established theory in Euclidean spaces, yet still lack general formulations and constructive synthesis tools for systems evolving on manifolds common in robotics and aerospace applications. In this paper, we develop a general theory of geometric CBFs on bundles and, for control-affine systems, recover the standard optimization-based CBF controllers and their smooth analogues. Then, by generalizing kinetic energy-based CBF backstepping to Riemannian manifolds, we provide a constructive CBF synthesis technique for geometric mechanical systems, as well as easily verifiable conditions under which it succeeds. Further, this technique utilizes mechanical structure to avoid computations on higher-order tangent bundles. We demonstrate its application to an underactuated satellite on SO(3).
|
| |
| 15:45-16:00, Paper WeC18.2 | Add to My Program |
| Importance of the Input Cost When Designing Control Barrier Function Safety Filters |
|
| Freire, Victor | University of Colorado Boulder |
| Nicotra, Marco M | University of Colorado Boulder |
Keywords: Constrained control
Abstract: Control barrier function (CBF)-based safety filters seek to enforce constraints by ``minimally adjusting'' a nominal control input. This minimal adjustment is usually measured using a simple Euclidean norm in input space. In this work, we study how the use of weighed norms can affect the performance of CBF safety filters. Specifically, we seek to minimize the impact of the input error on the nominal performance, as opposed to minimizing the input error itself. Numerical simulations show that the proper weighing of the cost function of the CBF quadratic program can have a profound effect on the overall performance of the closed-loop system.
|
| |
| 16:00-16:15, Paper WeC18.3 | Add to My Program |
| Probabilistic Control Barrier Functions: Safety in Probability for Discrete-Time Stochastic Systems |
|
| Mestres, Pol | California Institute of Technology |
| Werner, Blake | California Institute of Technology |
| Cosner, Ryan | California Institute of Technology |
| Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control
Abstract: Control systems operating in the real world face countless sources of unpredictable uncertainties. These random disturbances can render deterministic guarantees inapplica- ble and cause catastrophic safety failures. To overcome this, this paper proposes a method for designing safe controllers for discrete-time stochastic systems that retain probabilistic guarantees of safety. To do this we modify the traditional notion of a control barrier function (CBF) to explicitly account for these stochastic uncertainties and call these new modified functions probabilistic CBFs. We show that probabilistic CBFs can be used to design controllers that guarantee safety over a finite number of time steps with a prescribed probability. Next, by leveraging various uncertainty quantification methods, such as concentration inequalities and the scenario approach, we provide a variety of sufficient conditions that result in computationally tractable controllers with tunable probabilistic guarantees across a plethora of practical scenarios. Finally, we showcase the applicability of our results in simulation and hardware for the control of a quadruped robot.
|
| |
| 16:15-16:30, Paper WeC18.4 | Add to My Program |
| Safe Trajectory Planning with Bernstein Polynomials under Control Barrier Function Constraints |
|
| Lee, Jungeun | UNIST |
| Lee, Seongjae | Ulsan National Institute of Science and Technology (UNIST) |
| Jeon, Jeong hwan | Ulsan National Institute of Science and Technology |
Keywords: Constrained control, Predictive control for nonlinear systems, Optimal control
Abstract: Ensuring safe trajectory planning of autonomous mobile robots in environments with static and dynamic obstacles is a critical challenge for real-world deployment. Existing approaches, such as nonlinear model predictive control (NMPC) with control barrier functions (CBFs), can provide safety guarantees. However, their computational complexity scales with the prediction horizon, which can hinder real-world feasibility. We propose an NMPC algorithm that combines discrete-time CBFs with Bernstein polynomial parameterization. By constraining a constant number of Bernstein control points within a dynamically generated safe corridor, the resulting trajectory is guaranteed to remain forward invariant within the safe set. This formulation also reduces computational complexity compared with conventional approaches. Simulation results in both static and dynamic obstacle environments show that the proposed method achieves a high success rate while maintaining safety margins comparable to existing baselines. At the same time, computation time is significantly reduced and remains nearly constant as the prediction horizon increases. These results demonstrate the efficiency, robustness, and scalability of the proposed approach for safe trajectory planning in diverse scenarios.
|
| |
| 16:30-16:45, Paper WeC18.5 | Add to My Program |
| Barrier-Riccati Synthesis for Nonlinear Safe Control with Expanded Region of Attraction |
|
| Almubarak, Hassan | King Fahd University of Petroleum and Minerals |
| AL-Sunni, Maitham | Carnegie Mellon University |
| Dubbin, Justin | Georgia Institute of Technology |
| Sadegh, Nader | Georgia Inst. of Tech |
| Dolan, John | Carnegie Mellon University |
| Theodorou, Evangelos A. | Georgia Institute of Technology |
Keywords: Constrained control, Stability of nonlinear systems, Optimal control
Abstract: We present a Riccati-based framework for safety-critical nonlinear control that integrates the barrier states (BaS) methodology with the State-Dependent Riccati Equation (SDRE) approach. The BaS formulation embeds safety constraints into the system dynamics via auxiliary states, enabling safety to be treated as a control objective. To overcome the limited region of attraction in linear BaS controllers, we extend the framework to nonlinear systems using SDRE synthesis applied to the barrier-augmented dynamics and derive a matrix inequality condition that certifies forward invariance of a large region of attraction and guarantees asymptotic safe stabilization. The resulting controller is computed online via pointwise Riccati solutions. We validate the method on an unstable constrained system and cluttered quadrotor navigation tasks, demonstrating improved constraint handling, scalability, and robustness near safety boundaries. This framework offers a principled and computationally tractable solution for synthesizing nonlinear safe feedback in safety-critical environments.
|
| |
| 16:45-17:00, Paper WeC18.6 | Add to My Program |
| Autonomy Architectures for Safe Planning in Unknown Environments under Budget Constraints |
|
| Cherenson, Daniel | University of Michigan |
| Agrawal, Devansh Ramgopal | University of Michigan |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Constrained control, Autonomous robots
Abstract: Mission planning can often be formulated as a constrained control problem under multiple path constraints (i.e., safety constraints) and budget constraints (i.e., resource expenditure constraints). In a priori unknown environments, verifying that an offline solution will satisfy the constraints for all time can be difficult, if not impossible. We present ReRoot, a novel sampling-based framework that enforces safety and budget constraints for nonlinear systems in unknown environments. The main idea is that ReRoot grows multiple reverse RRT* trees online, starting from renewal sets, i.e., sets where the budget constraints are renewed. The trees provide paths of minimal budget expenditure back to the renewal set, aiding the construction of safe dynamically feasible backup trajectories and providing a principled backup policy when integrated into the Gatekeeper safety verification architecture. We demonstrate our approach in simulation with a fixed-wing UAV in a GNSS-denied environment with a budget constraint on localization error that can be renewed at visual landmarks.
|
| |
| WeC19 Regular Session, Churchill B1 |
Add to My Program |
| Optimal Control II |
|
| |
| Chair: Drgona, Jan | Johns Hopkins University |
| Co-Chair: Cowlagi, Raghvendra V. | Worcester Polytechnic Institute |
| |
| 15:30-15:45, Paper WeC19.1 | Add to My Program |
| Safe Online Control-Informed Learning |
|
| Zhou, Tianyu | Purdue University |
| Liang, Zihao | Purdue University |
| Lu, Zehui | Independent Researcher |
| Mou, Shaoshuai | Purdue University |
Keywords: Optimal control, Constrained control, Data driven control
Abstract: This paper proposes a Safe Online Control- Informed Learning framework for safety-critical autonomous systems. The framework unifies optimal control, parameter estimation, and safety constraints into an online learning process. It employs an extended Kalman filter to incrementally update system parameters in real time, enabling robust and data-efficient adaptation under uncertainty. A softplus barrier function enforces constraint satisfaction during learning and control while eliminating the dependence on high-quality initial guesses. Theoretical analysis establishes convergence and safety guarantees, and the framework’s effectiveness is demonstrated on cart-pole and robot-arm systems.
|
| |
| 15:45-16:00, Paper WeC19.2 | Add to My Program |
| Trajectory Optimization for Cooperative Navigation by Solving the HJB Equation Using PINNs |
|
| Wojciechowski, Maria R. | Worcester Polytechnic Institute |
| Deshamouni, Jayaratna | Shrewsbury High School |
| Steffens, Michael | Draper |
| Cowlagi, Raghvendra V. | Worcester Polytechnic Institute |
Keywords: Optimal control, Cooperative control, Neural networks
Abstract: Cooperative navigation is a methodology where multiple mobile vehicles share navigational aiding information with the aim of improving the precision and accuracy of localization. This methodology can enable teams of autonomous mobile vehicles to navigate in, say, undersea and subterranean environments where conventional localization methods based on global satellite navigation systems and/or vision do not suffice. In such environments, information can be shared when vehicles are in close proximity. To this end, one approach is to assign the role of a cooperative navigation aid (CNA) to specific vehicles in the team, and to then optimize the trajectories of the remaining vehicles to achieve sufficient proximity to the CNA while conducting their tasks. We formulate this problem as an optimal control problem, where proximity to a moving CNA constitutes a "benefit" to be maximized while traversing between two prespecified fixed locations in the environment. We develop a physics-informed neural network (PINN)-based solution of the Hamilton-Jacobi-Bellman (HJB) equation for the problem of optimizing a weighted composite objective of maximizing the aid received from a CNA while also minimizing the time of travel. We describe the details of the PINN architecture and demonstrate its effectiveness via numerical simulations.
|
| |
| 16:00-16:15, Paper WeC19.3 | Add to My Program |
| Adaptive Input Shaper Design for Unknown Second-Order Systems with Real-Time Parameter Estimation |
|
| Aung, Nyi Nyi | Louisiana State University |
| Wight, Bradley | Louisiana State University |
| Stein, Adrian | Louisiana State University |
Keywords: Optimal control, Estimation, Adaptive control
Abstract: This work addresses the problem of designing an input shaper for an unknown system, where the switch time is highly sensitive to the system’s natural frequency. Applying the input-shaped reference either before or after the switch time typically results in poor performance. To overcome this challenge, a feedforward control method with an online parameter estimator is proposed for black-box second-order systems, in which the system parameters are estimated in real time and the input shaper is adapted accordingly. This approach allows the input-shaped reference to be applied even if the first switch time is missed, as the method accounts for the system’s periodic switch time and enables periodic application of the input shaper. Simulation results demonstrate accurate online parameter estimation and reference tracking with complete vibration suppression, achieved through the proposed adaptive input shaper design.
|
| |
| 16:15-16:30, Paper WeC19.4 | Add to My Program |
| Regret of H_infty Preview Controllers |
|
| Liu, Jietian | University of Michigan |
| Seiler, Peter | University of Michigan, Ann Arbor |
Keywords: Optimal control, H-infinity control, Linear systems
Abstract: This paper studies preview control in both the H_infty and regret-optimal settings. The plant is modeled as a discrete-time, linear time-invariant system subject to external disturbances. The performance baseline is the optimal non-causal controller that has full knowledge of the disturbance sequence. We first review the construction of the H_infty preview controller with p-steps of disturbance preview. We then show that the closed-loop H_infty performance of this preview controller converges as pto infty to the performance of the optimal non-causal controller. Furthermore, we prove that the optimal regret of the preview controller converges to zero. These results demonstrate that increasing preview length allows controllers to asymptotically achieve non-causal performance in both the H_infty and regret frameworks. A numerical example illustrates the theoretical results.
|
| |
| 16:30-16:45, Paper WeC19.5 | Add to My Program |
| Combined Learning and Control: A New Paradigm for Optimal Control with Unknown Dynamics |
|
| Kounatidis, Panagiotis | Cornell University |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Optimal control, Learning
Abstract: In this paper, we present the combined learning-and-control (CLC) approach, which is a new way to solve optimal control problems with unknown dynamics by unifying model-based control and data-driven learning. The key idea is simple: we design a controller to be optimal for a proxy objective built on an available model while penalizing mismatches with the real system, so that the resulting controller is also optimal for the actual system. Building on the original CLC formulation, we demonstrate the framework to the linear–quadratic regulator problem and make three advances: (i) we show that the CLC penalty is a sequence of stage-specific weights rather than a single constant; (ii) we identify when these weights can be set in advance and when they must depend on the (unknown) dynamics; and (iii) we develop a lightweight learning loop that tunes the weights directly from data without abandoning the benefits of a model-based design. We provide a complete algorithm and an empirical study against common baseline methods. The results clarify where prior knowledge suffices and where learning is essential, and they position CLC as a practical, theoretically grounded bridge between classical optimal control and modern learning methods.
|
| |
| 16:45-17:00, Paper WeC19.6 | Add to My Program |
| Zero-Shot Transferable Solution Method for Parametric Optimal Control Problems |
|
| Li, Xingjian | University of Texas at Austin |
| Kan, Kai Fung (Kelvin) | UCLA |
| Verma, Deepanshu | Clemson University |
| Kumar, Krishna | University of Texas at Austin |
| Osher, Stanley | University of California, Los Angeles |
| Drgona, Jan | Johns Hopkins University |
Keywords: Optimal control, Machine learning, Neural networks
Abstract: This paper presents a transferable solution method for optimal control problems with varying objectives using function encoder (FE) policies. Traditional optimization-based approaches must be re-solved whenever objectives change, resulting in prohibitive computational costs for applications requiring frequent evaluation and adaptation. The proposed method learns a reusable set of neural basis functions that spans the control policy space, enabling efficient zero-shot adaptation to new tasks through either projection from data or direct mapping from problem specifications. The key idea is an offline–online decomposition: basis functions are learned once during offline imitation learning, while online adaptation requires only lightweight coefficient estimation. Numerical experiments across diverse dynamics, dimensions, and cost structures show our method delivers near-optimal performance with minimal overhead when generalizing across tasks, enabling semi-global feedback policies suitable for real-time deployment.
|
| |
| WeC20 Regular Session, Churchill B2 |
Add to My Program |
| Model Predictive Control I |
|
| |
| Chair: Trimboli, Michael | University of Colorado, Colorado Springs |
| Co-Chair: Schitz, Philipp | German Aerospace Center (DLR) |
| |
| 15:30-15:45, Paper WeC20.1 | Add to My Program |
| A Comparison of MPC Architectures for a Heat Pump |
|
| Quah, Titus | University of California Santa Barbara |
| Bortoff, Scott A. | Mitsubishi Electric Research Laboratories |
| Rawlings, James B. | University of California, Santa Barbara |
Keywords: Predictive control for linear systems, Building and facility automation, Process Control
Abstract: Modern heat pumps are strongly interactive multivariable systems, requiring rigorous application of multivariable control theory. Important factors to consider are robustness with respect to plant uncertainty, and enforcement of process constraints. This paper summarizes the design of two Model Predictive Control (MPC) architectures for a heat pump application, and compares their design methodologies and performance to a conventional selector-based multi-loop PID architecture. One MPC architecture is an H-infinity loop-shaped MPC that uses inverse optimal control to realize the robustifing compensator as a constrained optimization to enforce constraints. The second is an offset-free MPC architecture that preserves output tracking despite plant-model mismatch and unmeasured disturbances and can be tuned using operational data. We compare the two MPCs and the PID on a constrained heat-pump model, assessing closed-loop transients, robustness margins, and tuning complexity. In simulation, all controllers track set-points with similar performance. During sensible heat disturbance, a temperature limit prevents full rejection; even so, both MPC designs counterintuitively reduce the outdoor fan speed -- which, through coupled system interactions, increases net heat removal and yields improved disturbance rejection without violating constraints. As for robustness, all three designs meet disk-margin targets. For tuning complexity, offset-free MPC > H-infinity loop-shaped MPC > PID. Overall, on this plant, robustness is a tie; offset-free MPC delivers the strongest constrained disturbance handling but requires the most tuning, H-infinity loop-shaped MPC is the middle ground, and PID is simplest.
|
| |
| 15:45-16:00, Paper WeC20.2 | Add to My Program |
| Adaptive-R Dual-Mode Model Predictive Control |
|
| Undare, Suchita Anil | University of Colorado Colorado Springs |
| Karami, Kiana | Penn State Harrisburg |
| Trimboli, Michael | University of Colorado, Colorado Springs |
Keywords: Predictive control for linear systems, Constrained control, Optimal control
Abstract: We introduce Adaptive-R Dual-Mode Model Predictive Control (Adaptive-R DMPC), a systematic framework in which the input weight R is scheduled based on set membership over nested regions χCPI (constrained), χADAP (adaptive), and χMPI (invariant). Conventional DMPC reduces computation by switching from constrained MPC to LQR, but this switch at the terminal set may induce abrupt changes in the control input at the MPC–LQR transition, which can result in non- smooth state evolution and thereby motivate the proposed adaptive formulation. In our approach, the input weight R is scheduled inside an intermediate region χADAP, blending between a conservative and nominal value so that the feedback gain converges smoothly to KLQR without solving QPs in this region. This construction preserves recursive feasibility and stability while markedly reducing the number of QP solves and total computation time.
|
| |
| 16:00-16:15, Paper WeC20.3 | Add to My Program |
| Inexact-ADMM for Embedded Linear Model Predictive Control |
|
| Adegbege, Ambrose Adebayo | The College of New Jersey |
| Oluleti, Victor Pelumi | Obafemi Awolowo University |
| Harish, Nia | The College of New Jersey |
Keywords: Predictive control for linear systems, Constrained control, Optimization algorithms
Abstract: We consider an inexact variant of the Alternating Direction Method of Multipliers (ADMM) for quadratic programming problems commonly encountered in embedded model predictive control (MPC). The inexact-ADMM method retains the convergence properties and behavior of standard-ADMM but with reduced complexity of the first inner subproblem, involving only simple over-relaxed Uzawa steps. We establish a convergence rate of O(1/k) for the primal-dual gap, where k is the iteration count, and time complexity per iteration of O(N), where N is the prediction horizon. We showcase the efficiency and performance of the inexact-ADMM compared to the standard ADMM using a well-known numerical example.
|
| |
| 16:15-16:30, Paper WeC20.4 | Add to My Program |
| Robust Distributed Model Predictive Control under Limited Communication Rates and DoS Attacks |
|
| Cao, Xinyuan | Shanghai Jiao Tong University |
| Dai, Yufan | University of Victoria |
| Li, Shaoyuan | Shanghai Jiao Tong University |
| Shi, Yang | University of Victoria |
Keywords: Predictive control for linear systems, Distributed control, Quantized systems
Abstract: This paper proposes a distributed model predictive control (DMPC) strategy for multi-agent systems (MASs) under the combined challenges of disturbances, limited network communication rates, and denial of service (DoS) attacks. In contrast to existing approaches, this paper integrates robust resilient control strategies and quantized communication mechanisms within the DMPC framework to jointly address the multiple challenges. Specifically, tightened robustness constraints are first introduced to handle the external disturbances. Then, a lengthened and quantized transmission method is developed, where the lengthened strategy preserves system coordination against DoS attacks and the quantized encoder/decoder scheme significantly reduces communication cost and suppresses quantization error. Furthermore, a rigorous theoretical analysis is provided to establish the recursive feasibility and ensure stability of the closed-loop system. Finally, simulation results validate the performance of the proposed framework.
|
| |
| 16:30-16:45, Paper WeC20.5 | Add to My Program |
| Robust Helicopter Ship Deck Landing with Guaranteed Timing Using Shrinking-Horizon Model Predictive Control |
|
| Schitz, Philipp | German Aerospace Center (DLR) |
| Mercorelli, Paolo | Leuphana University of Lüneburg |
| Dauer, Johann C. | DLR (German Aerospace Center) |
Keywords: Predictive control for linear systems, Flight control, Autonomous systems
Abstract: Autonomous helicopter ship deck landing requires precise timing, robustness to disturbances, and strict constraint satisfaction. This paper presents a trajectory planning approach based on robust shrinking-horizon Model Predictive Control (SHMPC) with move blocking, enabling computationally efficient planning over long horizons while guaranteeing maneuver completion within a specified time. A simplified control-oriented helicopter model is introduced for trajectory generation, while attitude stabilization is handled by an inner-loop controller. Robustness is achieved using a tube-based formulation with a disturbance-observer-based ancillary controller. A terminal set designed via backward reachability guarantees that the helicopter reaches a state from which a safe touchdown can occur within a prescribed time window. Simulations using a nonlinear model of DLR’s midiARTIS demonstrator under strong wind and ship motion demonstrate accurate tracking, constraint satisfaction, and real-time feasibility.
|
| |
| WeC21 Regular Session, Churchill C1 |
Add to My Program |
| Optimization III |
|
| |
| Chair: Lin, Zongli | University of Virginia |
| Co-Chair: Canova, Marcello | The Ohio State University |
| |
| 15:30-15:45, Paper WeC21.1 | Add to My Program |
| Non-Ergodic Convergence Algorithms for Distributed Consensus and Coupling-Constrained Optimization |
|
| Qiu, Chenyang | University of Virginia |
| Lin, Zongli | University of Virginia |
Keywords: Optimization algorithms, Large-scale systems
Abstract: We study distributed convex optimization with two ubiquitous forms of coupling: consensus constraints and global affine equalities. We first design a linearized method of multipliers for the consensus optimization problem. Without smoothness or strong convexity, we establish non-ergodic sublinear rates of order O(1/sqrt{k}) for both the objective optimality and the consensus violation. Leveraging duality, we then show that the economic dispatch problem admits a dual consensus formulation, and that applying the same algorithm to the dual economic dispatch yields non-ergodic O(1/sqrt{k}) decay for the error of the summation of the cost over the network and the equality-constraint residual under convexity and Slater’s condition. Numerical results on the IEEE 118-bus system demonstrate faster reduction of both objective error and feasibility error relative to the state-of-the-art baselines, while the dual variables reach network-wide consensus.
|
| |
| 15:45-16:00, Paper WeC21.2 | Add to My Program |
| DC Fast Charging Station Queue Minimization through Dynamic Pricing |
|
| Abbasi, Mohammad Hossein | Clemson University |
| Zhang, Jiangfeng | Clemson University |
| Krovi, Venkat | Clemson University |
Keywords: Optimization algorithms, Lyapunov methods, Optimization
Abstract: This paper presents a novel approach to minimizing queues in DC fast charging stations (FCSs). The number of queuing electric vehicles (EVs) is explicitly modeled as a decision variable, defined as a function of departing EVs after completing their charging sessions. To enhance profitability, the objective function also accounts for FCS operating costs, while charging prices are dynamically adjusted based on user price sensitivity. The optimization problem is formulated within the Lyapunov Optimization (LO) framework, which decomposes it into one time step subproblems and eliminates the need for forecasting EV arrivals, while guaranteeing global optimality. Rigorous analysis establishes the theoretical guarantees of the proposed method. Furthermore, a Pareto Front study is conducted to support parameter tuning. Simulation results show that an FCS with four charging ports can serve 100 EVs per day with an average of only eight vehicles in the queue, illustrating the strength of the proposed strategy.
|
| |
| 16:00-16:15, Paper WeC21.3 | Add to My Program |
| Optimal Multimarginal Schrödinger Bridge: Minimum Spanning Tree Over Measure-Valued Vertices |
|
| Bondar, Georgiy Antonovich | UC Santa Cruz |
| Halder, Abhishek | Iowa State University |
Keywords: Optimization algorithms, Machine learning, Computational methods
Abstract: The Multimarginal Schrödinger Bridge (MSB) finds the optimal coupling among a collection of random vectors with known statistics and a known correlation structure. In the MSB formulation, this correlation structure is specified a priori as an undirected connected graph with measure-valued vertices. In this work, we formulate and solve the problem of finding the optimal MSB in the sense we seek the optimal coupling over all possible graph structures. We find that computing the optimal MSB amounts to solving the minimum spanning tree problem over measure-valued vertices. We show that the resulting problem can be solved in two steps. The first step constructs a complete graph with edge weight equal to a sum of the optimal value of the corresponding bimarginal SB and the entropies of the endpoints. The second step solves a standard minimum spanning tree problem over that complete weighted graph. Numerical experiments illustrate the proposed solution.
|
| |
| 16:15-16:30, Paper WeC21.4 | Add to My Program |
| Temporal Variabilities Limit Convergence Rates in Gradient-Based Online Optimization |
|
| Van Scoy, Bryan | Miami University |
| Bianchin, Gianluca | University of Louvain |
Keywords: Optimization algorithms, Optimization, Linear systems
Abstract: This paper investigates the fundamental performance limits of gradient-based algorithms for time-varying optimization. Leveraging the internal model principle and root locus techniques, we show that temporal variabilities impose intrinsic limits on the achievable rate of convergence. For a problem with condition ratio kappa and temporal variability described by a model of degree n, we show that the worst-case convergence rate of any minimal-order gradient-based algorithm (having access to a model of temporal variation) is rho_text{TV} = (frac{kappa-1}{kappa+1})^{1/n}. This bound reveals a fundamental tradeoff between problem conditioning, temporal complexity, and rate of convergence. We further construct explicit controllers that attain the bound for low-degree models of time variation.
|
| |
| 16:30-16:45, Paper WeC21.5 | Add to My Program |
| Quadratic Surrogate Attractor for Particle Swarm Optimization |
|
| Clemente, Maurizio | The Ohio State University |
| Canova, Marcello | The Ohio State University |
Keywords: Optimization algorithms, Optimization, Numerical algorithms
Abstract: This paper presents a particle swarm optimization algorithm that leverages surrogate modeling to replace the conventional global best solution with the minimum of an n-dimensional quadratic form, providing a better-conditioned dynamic attractor for the swarm. This refined convergence target, informed by the local landscape, enhances resilience against premature convergence and noisy environments while incurring only minimal computational overhead. The surrogate-augmented approach is evaluated against the standard algorithm through a numerical study on a set of benchmark optimization functions that exhibit diverse landscapes. To ensure statistical significance, 400 independent runs are conducted for each function and algorithm, and the results are analyzed based on their statistical characteristics and corresponding distributions. The quadratic surrogate attractor consistently outperforms the conventional algorithm across all tested functions. The improvement is particularly pronounced for quasi-convex functions, where the surrogate model can exploit the underlying convex-like structure of the landscape.
|
| |
| 16:45-17:00, Paper WeC21.6 | Add to My Program |
| Speed Limitations of Gradient-Based Algorithms for Heterogeneous Optimization |
|
| Wu, Wuwei | City University of Hong Kong |
| Wang, Miaomiao | City University of Hong Kong |
| Colaneri, Patrizio | Politecnico Di Milano |
| Chen, Jie | City University of Hong Kong |
Keywords: Optimization algorithms, Optimization, Robust control
Abstract: Modern large-scale optimization problems often involve heterogeneous components with diverse properties essential to the performance of optimization algorithms, which pose challenges for designing fast and efficient algorithms. This paper tackles the heterogeneity issue through a control-theoretic lens by modeling gradient-based optimization algorithms as feedback systems and utilizing frequency-domain tools, including Nevanlinna-Pick interpolation theory. This perspective enables a unified treatment of parallel and serial gradient-based algorithms, and allows us to establish the fastest achievable convergence rates for both architectures under the circle and Zames-Falb criteria. The results reveal a fundamental trade-off: when parallel computation is available, parallel algorithms achieve superior real-time performance, whereas in resource-constrained settings, serial algorithms offer faster iteration rates. The proposed approach provides a systematic foundation for analyzing and designing high-performance optimization algorithms in heterogeneous computing environments.
|
| |
| WeC22 Invited Session, Churchill C2 |
Add to My Program |
| Estimation and Control of Distributed Parameter Systems III |
|
| |
| Chair: Hu, Weiwei | University of Georgia |
| Co-Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
| Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
| Organizer: Hu, Weiwei | University of Georgia |
| |
| 15:30-15:45, Paper WeC22.1 | Add to My Program |
| Backstepping Stabilization of infty+infty Continua of Linear Hyperbolic Systems (I) |
|
| Humaloja, Jukka-Pekka | Technical University of Crete |
| Bekiaris-Liberis, Nikolaos | Technical University of Crete |
Keywords: Distributed parameter systems
Abstract: We consider the boundary stabilization problem for continua of PDE systems that are viewed as the limits of n+m linear hyperbolic systems, as n and m tend to infinity. We construct a full-state feedback law via introduction of a continuum-PDE backstepping transformation. The design procedure gives rise to two kernel equations evolving on a 4-D domain, whose well-posedness is established. We show stability of the continuum closed-loop system via construction of a novel Lyapunov functional. Effectiveness of the control design is illustrated in a numerical example.
|
| |
| 15:45-16:00, Paper WeC22.2 | Add to My Program |
| Optimal Sensor Location for Predictive Iterative Learning Control of Distributed Parameter Systems (I) |
|
| Patan, Maciej | University of Zielona Gora |
| Klimkowicz, Kamil | University of Zielona Gora |
| Patan, Krzysztof | University of Zielona Gora |
| Rogers, Eric | University of Southampton |
Keywords: Iterative learning control, Distributed parameter systems, Estimation
Abstract: This study aims to develop a robust computational scheme to address the optimal tracking control problem for repetitive distributed parameter systems, in situations where the controlled quantity is not directly observable. In such a case, the reliability of the model predictions becomes a crucial factor affecting the quality of control design in consecutive process replications. To maximize the accuracy of prediction, the optimal sensor location problem is considered. It consists in selecting gaged sites from among all available sites so that the suitable criterion defined on the Fisher information matrix associated with the model parameters is minimal. To effectively solve it, a relaxed convex optimization problem is formulated, and then suitable optimality conditions are provided. In result, the existing efficient convex optimization solvers can be easily applied. Then, the measurement locations providing the most informative system observations are further incorporated into the predictive control scheme based on the iterative learning control. The proposed approach is verified on the example of repetitive control for a thermophotovoltaic cell.
|
| |
| 16:00-16:15, Paper WeC22.3 | Add to My Program |
| Concurrent Design of Event-Triggered and Gain Scheduled Control for 2x2 Coupled Hyperbolic PDEs (I) |
|
| Somathilake, Eranda | Department of Mechanical and Aerospace Engineering, University of California San Diego |
| Diagne, Mamadou | University of California San Diego |
|
|
| |
| 16:15-16:30, Paper WeC22.4 | Add to My Program |
| Existence and Uniqueness of the Solution to a Class of Fredholm Integral Equations Related to Difference Equations (I) |
|
| Bresch-Pietri, Delphine | Mines Paris -- PSL |
| Auriol, Jean | Université Paris-Saclay, CNRS, CentraleSupélec, Laboratoire Des Signaux Et Systèmes |
Keywords: Delay systems, Distributed parameter systems
Abstract: This paper addresses the existence and uniqueness of the solution to a class of Fredholm integral equations associated with scalar Linear Integral Delay Equations (LIDEs). Based on an operator-theoretic framework involving transport partial differential equations, we provide general conditions that guarantee well-posedness of these equations. Leveraging this result, we introduce a constructive approach for the numerical computation of the Lyapunov matrix corresponding to scalar LIDEs with commensurate delays. Specifically, the Lyapunov matrix equations are reformulated as the Fredholm integral equation under consideration, for which the proposed conditions are satisfied under exponential stability of the LIDE. The resulting integral equation can then be discretized, and the linear system efficiently solved to compute the Delay Lyapunov matrix. Numerical examples illustrate the effectiveness and applicability of this methodology.
|
| |
| 16:30-16:45, Paper WeC22.5 | Add to My Program |
| SEC-Two: Secure Estimation Over Two Channels (I) |
|
| Porzio, Jacopo | KTH Royal Institute of Technology |
| Kanellopoulos, Aris | KTH Royal Institute of Technology |
| Alisic, Rijad | Massachusetts Institute of Technology |
| Dán, György | KTH - Royal Institute of Technology |
| Sandberg, Henrik | KTH Royal Institute of Technology |
Keywords: Filtering, Emerging control applications, Fault detection
Abstract: We characterize the impact of single-time-step sensor attacks on Kalman filtering and fixed-point optimal smoothing for state estimation in linear time-invariant systems over a finite time window. We introduce a single noiseless, unattacked, full-state (perfect) measurement into the estimation process to reduce the attack's impact on the estimation process and increase the probability of detection under static chi-squared tests. In the scalar case, we show that smoothing previous estimates with the perfect measurement to update detection statistics achieves higher attack detectability at the attack time than online Kalman filter--based statistics. Numerical experiments suggest that our findings generalize to multi-dimensional systems.
|
| |
| 16:45-17:00, Paper WeC22.6 | Add to My Program |
| Safe Trajectory Tracking of the Stefan Problem with Second-Order Moving Boundary Dynamics |
|
| Koga, Shumon | Kobe University |
| Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Constrained control, Process Control
Abstract: This paper considers a safe trajectory tracking of the Stefan problem with a second-order moving boundary dynamics. The model is given by a parabolic Partial Differential Equation (PDE) defined on a time-varying domain of moving boundary governed by a second-order Ordinary Differential Equation (ODE) associated with the Neumann boundary condition. A feedforward control is designed by a series expansion approach to solve the inverse Stefan problem under given reference trajectory of the moving boundary, and the convergence of infinite series is proven. A trajectory tracking controller is derived based on an energy-shaping, which ensures the safety of the model constraint in the closed-loop system. The closed-loop system is also shown to be globally exponentially stable with respect to the tracking error by performing PDE backstepping transformation and Lyapunov analysis. Numerical simulation illustrates an effective tracking performance of the proposed method under a sinusoidal reference trajectory.
|
| |