ACC 2026 Program | Wednesday May 27, 2026


WeP1 Plenary Session, Grand Ballroom C-D	Add to My Program
Geometric and Analytic Facets of Feedback Design: Elementary Control Meeting Policy Optimization and Modern Application

Chair: Frew, Eric W.	University of Colorado, Bolder
Co-Chair: Beard, Randal W.	Brigham Young Univ

08:30-10:00, Paper WeP1.1	Add to My Program
Geometric and Analytic Facets of Feedback Design: Control Theory Meets Policy Optimization and Modern Applications

Mesbahi, Mehran	University of Washington


WeA03 Tutorial Session, Grand Salon 3	Add to My Program
COIN: COmmunication-Aware INtegrated Learning, Estimation, and Control

Chair: Johansson, Karl H.	KTH Royal Institute of Technology
Co-Chair: Mamduhi, Mohammad H.	University of Birmingham
Organizer: Maity, Dipankar	University of North Carolina at Charlotte
Organizer: Goswami, Debdipta	The Ohio State University
Organizer: Mamduhi, Mohammad H.	University of Birmingham
Organizer: Johansson, Karl H.	KTH Royal Institute of Technology

10:30-12:00, Paper WeA03.1	Add to My Program
COIN: COmmunication-Aware INtegrated Learning, Estimation, and Control (I)

Maity, Dipankar	University of North Carolina at Charlotte
Goswami, Debdipta	The Ohio State University
Mamduhi, Mohammad H.	University of Birmingham
Johansson, Karl H.	KTH Royal Institute of Technology
Keywords: Networked control systems, Identification for control, Quantized systems Abstract: Future autonomous systems ranging from multi-robot networks to distributed satellite constellations must operate reliably under limited, delayed, and lossy communications. Traditional control and learning frameworks, however, assume perfect information flow, leaving a fundamental gap between the theory and the realities of modern cyber-physical systems. This tutorial introduces a unified framework for communication-constrained learning, estimation, and control, where sensing, estimation, learning, and decision-making are tightly coupled through shared, resource-limited communication channels. We begin with the theoretical foundations of system identi- fication under communication constraints, highlighting recent advances in operator-theoretic and learning-enabled modeling that yield provable performance guarantees even with degraded data. The second part focuses on communication-aware control design, covering sequential and joint control–estimation–communication co-design, adaptive quantization, and risk-aware robust control under bandwidth-limited, delay-prone, and unreliable communication. Finally, we demonstrate how these theories enable resilient guidance and control in robotics applications where nonlinear dynamics, limited bandwidth, and high-latency links pose critical challenges to autonomy. The tutorial combines theory, algorithms, and case studies to provide attendees with (i) a principled understanding of the quality–quantity trade-off in data-driven control, (ii) practical tools for implementing communication-aware controllers, and (iii) insights into emerging aerospace, autonomous cars, and robotic applications where these ideas are reshaping design paradigms.


WeA04 Regular Session, Grand Salon 4	Add to My Program
Aerospace I

Chair: Bisheban, Mahdis	University of Calgary
Co-Chair: Atkins, Ella M.	University of Michigan

10:30-10:45, Paper WeA04.1	Add to My Program
Safety-Critical Input-Constrained Nonlinear Intercept Guidance in Multiple Engagement Zones

Ranjan, Praveen Kumar	University of Texas at San Antonio
Sinha, Abhinav	The University of Cincinnati
Cao, Yongcan	University of Texas, San Antonio
Keywords: Aerospace, Control applications Abstract: This paper presents an input-constrained nonlinear guidance law to address the problem of intercepting a stationary target in contested environments with multiple defending agents. Contrary to prior approaches that rely on explicit knowledge of defender strategies or utilize conservative safety conditions based on a defender's range, our work characterizes defender threats geometrically through engagement zones that delineate inevitable-interception regions. Outside these engagement zones, the interceptor remains invulnerable. The proposed guidance law switches between a repulsive safety maneuver near these zones and a pursuit maneuver outside their influence. To deal with multiple engagement zones, we employ a smooth minimum function (log-sum-exponent approximation) that aggregates threats from all the zones while prioritizing the most critical threats. Input saturation is modeled and embedded in the non-holonomic vehicle dynamics so the controller respects actuator limits while maintaining stability. Numerical simulations with several defenders demonstrate the proposed method’s ability to avoid engagement zones and achieve interception across diverse initial conditions.

10:45-11:00, Paper WeA04.2	Add to My Program
Inversion-Free Adaptive Control of a Bicopter with Unknown Mass and Inertia

Portella Delgado, Jhon Manuel	University of Maryland Baltimore County
Goel, Ankit	University of Maryland Baltimore County
Keywords: Adaptive control, Lyapunov methods, Aerospace Abstract: This paper develops an inversion-free adaptive controller for stabilizing and tracking the trajectory of a bicopter system. In a bicopter system, the inertial parameters, that is, mass and moment of inertia, parameterize the input map. Since the classical adaptive backstepping technique requires inverting the input map, which would contain the parameter estimates, the stability of the closed-loop system cannot be guaranteed because the inversion of these estimates may result in division by zero. This paper proposes a novel technique to circumvent the inversion of parameter estimates in the control law. The resulting controller requires only the sign of the unknown parameters, which are trivially known to be positive in the case of a bicopter. The proposed controller is validated in simulation for a smooth and nonsmooth trajectory-tracking problem.

11:00-11:15, Paper WeA04.3	Add to My Program
An Adaptive Lyapunov-Based MPC Framework for Quadcopter Attitude Control with Real-World Experimental Validation

Protoulis, Teo	University of West Attica
Alexandridis, Alex	University of West Attica
Keywords: Aerospace, Predictive control for nonlinear systems, Lyapunov methods Abstract: In this work, a nonlinear adaptive Lyapunov-based model predictive control (NALMPC) framework for attitude regulation of quadcopters is presented. Through backstepping, a Lyapunov controller is synthesized that accounts for angular displacement constraints, while considering uncertain moments of inertia. The derived feedback laws are utilized to form a contractive constraint that is integrated into the optimal control problem (OCP) of the MPC protocol. Additionally, the moment of inertia adaptation laws are integrated into the OCP, thus resulting to a predictive model that is updated in real-time, based on the operating conditions. A numerical comparative case study against a standard nonlinear MPC (NMPC) formulation is presented, where NALMPC outperforms NMPC in terms of tracking error and computational load. Finally, experimental validation on a real-world quadcopter demonstrates the efficacy of the proposed framework in terms of tracking and computational efficiency.

11:15-11:30, Paper WeA04.4	Add to My Program
Trajectory Planning for Contingency Landing Using Optimal Control

Kim, Heejin	Virginia Tech
Atkins, Ella M.	University of Michigan
Keywords: Optimal control, Aerospace, Flight control Abstract: This paper presents an optimal control framework for emergency landing trajectory generation under loss-of-thrust conditions. The method incorporates population density maps to minimize ground risk while ensuring feasibility with a six-degree-of-freedom nonlinear dynamics model. Trajectories are initialized using either discrete search or Dubins paths. Time-dependent and time-independent cost functions are evaluated. Using real population density data from the New York City region, results show that excluding a time cost favors longer trajectories that bypass densely populated regions, while including time in the cost function reduces flight duration at the expense of greater population exposure. Closed-loop simulations confirm feasibility and consistently lower costs compared to discrete search and Dubins solutions. The study highlights the importance of accurate initialization for optimal control as small errors in reference trajectory position or yaw can result in an adverse optimal control gradient and in turn infeasible results. Convergence and computational considerations suggest optimal control be deployed in preflight planning where computational and supervisory requirements can be safely accommodated.

11:30-11:45, Paper WeA04.5	Add to My Program
Realtime Wind Estimation Using Low Cost Quadrotor Uncrewed Aerial Vehicles

Udagedara, Hiranya	University of Calgary
Bisheban, Mahdis	University of Calgary
Keywords: Kalman filtering, Estimation, Modeling Abstract: In environmental monitoring as well as emergency response applications such as wildfires, wind velocity measurement is essential. Quadrotor UAVs have become popular platforms for wind velocity estimation due to their maneuverability, compact size, and cost-effectiveness. Numerous studies use the Extended Kalman Filter (EKF) to estimate the wind velocity based on the quadrotor dynamic model. However, EKF performance is constrained by its reliance on linearized approximations of the nonlinear quadrotor dynamics around current states, limiting accuracy in highly nonlinear scenarios, including windy conditions. This study proposes the use of an Unscented Kalman Filter (UKF), a nonlinear estimator to provide accurate wind estimations while maintaining the trajectory of the quadrotor UAV. The quadrotor is modeled on the Special Euclidean group SE(3) and the approach is evaluated through numerical simulations using a geometric controller to maintain quadrotor flight paths, under different wind conditions. The results indicate that the UKF consistently outperforms the EKF, reducing wind velocity estimation error by 11.8% under near-linear flight conditions, and 25.5% under strongly nonlinear conditions. This demonstrates the potential of the UKF as a reliable estimator for highly nonlinear scenarios, capable of maintaining the trajectory with minimal deviation while providing accurate wind velocity estimations.


WeA05 Tutorial Session, Grand Salon 6	Add to My Program
Control Barrier Functions for Flight-Critical Systems in Aerospace Applications

Chair: Hussain, Heather	The Boeing Company
Co-Chair: Cohen, Max	North Carolina State University
Organizer: Cohen, Max	North Carolina State University
Organizer: Hussain, Heather	The Boeing Company

10:30-12:00, Paper WeA05.1	Add to My Program
Control Barrier Functions for Flight-Critical Systems in Aerospace Applications (I)

Cohen, Max	North Carolina State University
Hussain, Heather	The Boeing Company
Molnar, Tamas G.	Wichita State University
van Wijk, David E. J.	California Institute of Technology
Gaudio, Joseph	Boeing
Fisher, Peter	Massachusetts Institute of Technology
Menner, Marcel	Aurora Flight Sciences (A Boeing Company)
Annaswamy, Anuradha M.	Massachusetts Inst. of Tech
Autenrieb, Johannes	German Aerospace Center (DLR)
Lavretsky, Eugene	The Boeing Co
Wise, Kevin A.	Boeing Defense and Space Systems
Ratliff, Ryan T.	The Boeing Company
Ames, Aaron D.	California Institute of Technology
Keywords: Constrained control, Aerospace Abstract: We present a tutorial focused on the applications of control barrier functions (CBFs) to flight-critical aerospace systems. This tutorial is motivated by the need for mathematically justified approaches to state and input limiting on various aerospace systems, much of which is currently accomplished using heuristic techniques. Here, we illustrate how CBFs -- originally motivated by applications in robotics -- provide a natural framework for constrained control of aerospace systems. Our tutorial encompasses both the theoretical foundations of CBFs as well as real-world aerospace applications of CBFs.


WeA06 Regular Session, Grand Salon 7	Add to My Program
Game Theory I

Chair: Vasconcelos, Marcos M.	Florida State University
Co-Chair: Shishika, Daigo	George Mason University

10:30-10:45, Paper WeA06.1	Add to My Program
Linear Programming Approach to Deceptive Path Planning Game with Goal Selection

Rostobaya, Violetta	George Mason University
Guan, Yue	Georgia Institute of Technology
Berneburg, James	George Mason University
Shishika, Daigo	George Mason University
Keywords: Game theory, Agents-based systems, Information theory and control Abstract: In adversarial settings, a mobile agent may strategically plan its motion to influence an opponent’s inference about its intended goal. We study deceptive path planning in a scenario where a mobile agent aims to reach a privately selected goal while an adversarial observer allocates limited defensive resources based on the observed trajectory. Unlike classical path-planning and goal-recognition approaches that model observers as passive inference process, our game-theoretic formulation models them as strategic decision-makers. For the resulting dynamic asymmetric-information game, we develop an efficient solution method that combines a linear programming formulation with the Double Oracle algorithm. To evaluate performance, we introduce metrics that quantify both the risk and the effectiveness of deception and provide illustrative numerical examples.

10:45-11:00, Paper WeA06.2	Add to My Program
Game-To-Real Gap: Quantifying the Effect of Model Misspecification in Network Games

Ferguson, Bryce L.	Dartmouth College
Maheshwari, Chinmay	Johns Hopkins University
Wu, Manxi	University of California Berkeley
Sastry, Shankar	Univ. of California at Berkeley
Keywords: Game theory, Agents-based systems, Network analysis and control Abstract: Game-theoretic models and solution concepts provide rigorous tools for predicting collective behavior in multi-agent systems. In practice, however, different agents may rely on different game-theoretic models to design their strategies. As a result, when these heterogeneous models interact, the realized outcome can deviate substantially from the outcome each agent expects based on its own local model. In this work, we introduce the game-to-real gap, a new metric that quantifies the impact of such model misspecification in multi-agent environments. The game-to-real gap is defined as the difference between the utility an agent actually obtains in the multi-agent environment (where other agents may have misspecified models) and the utility it expects under its own game model. Focusing on quadratic network games, we show that misspecifications in either (i) the external shock or (ii) the player interaction network can lead to arbitrarily large game-to-real gaps. We further develop novel network centrality measures that allow exact evaluation of this gap in quadratic network games. Our analysis reveals that standard network centrality measures fail to capture the effects of model misspecification, underscoring the need for new structural metrics that account for this limitation. Finally, through illustrative numerical experiments, we show that existing centrality measures in network games may provide a counterintuitive understanding of the impact of model misspecification.

11:00-11:15, Paper WeA06.3	Add to My Program
Privacy-Preserving Nash Equilibrium Synthesis with Partially Ordered Temporal Objectives

Probine, Caleb	The University of Texas at Austin
Kulkarni, Abhishek	Vijil, Inc
Topcu, Ufuk	The University of Texas at Austin
Keywords: Game theory, Autonomous systems, Automata Abstract: Nash equilibrium is a central solution concept for reasoning about self-interested agents. We study the synthesis of Nash equilibria in two-player deterministic games on graphs, where players have private, partially-ordered preferences on temporal goals. Unlike prior work, which assumes preferences are common knowledge, we develop a communication protocol for equilibrium synthesis in settings where players' preferences are private information. In the protocol, players communicate to synthesize equilibria by exchanging information about when they can force desirable outcomes. We incorporate privacy by ensuring the protocol stops before enough information is revealed to expose players' preferences. We prove completeness by showing that, when no player halts communication, the protocol either returns an equilibrium or certifies that none exist. We then prove privacy by showing that, with stopping, the messages a player sends are always consistent with multiple preference relations and thus do not reveal some given secret regarding a player's true preference ordering. Experiments show that we can synthesize non-trivial equilibria while preserving privacy of preferences, showing the protocol’s potential for applications in strategy synthesis with constrained information sharing.

11:15-11:30, Paper WeA06.4	Add to My Program
Designing Inferable Signaling Schemes for Bayesian Persuasion

Probine, Caleb	The University of Texas at Austin
Karabag, Mustafa O.	The University of Texas at Austin
Topcu, Ufuk	The University of Texas at Austin
Keywords: Game theory, Learning, Information theory and control Abstract: In Bayesian persuasion, an informed sender, who observes a state, commits to a randomized signaling scheme that guides a self-interested receiver's actions. Classical models assume the receiver knows the commitment. Instead, we study a setting where the receiver infers the scheme from repeated interactions. We bound the sender’s performance loss relative to the known-commitment case by a term that grows with the signal space size and shrinks as the receiver’s optimal actions become more distinct. We then lower bound the samples required for the sender to approximately achieve their known-commitment utility in the inference setting. We show that the sender in persuasion requires more samples than the leader in a Stackelberg game, which includes commitment but lacks signaling. Motivated by these bounds, we propose two methods for designing inferable signaling schemes, one being stochastic gradient descent (SGD) on the sender’s inference-setting utility, and the other being optimization with a boundedly-rational receiver model. SGD performs best in low-interaction regimes, but applying the boundedly-rational model and tuning the rationality constant remains a flexible method for designing inferable schemes. Applied in a safety-alert example, SGD finds schemes that have fewer signals and make citizens’ optimal actions more distinct than in the known-commitment case.

11:30-11:45, Paper WeA06.5	Add to My Program
Deceptive Planning Exploiting Inattention Blindness

Karabag, Mustafa O.	The University of Texas at Austin
Milzman, Jesse	DEVCOM Army Research Laboratory
Topcu, Ufuk	The University of Texas at Austin
Keywords: Markov processes, Game theory, Information theory and control Abstract: We study decision-making with rational inattention in settings where agents have perception constraints. In such settings, inaccurate prior beliefs or models of others may lead to inattention blindness, where an agent is unaware of its incorrect beliefs. We model this phenomenon in two-player zero-sum stochastic games, where Player 1 has perception constraints and Player 2 deceptively deviates from its security policy presumed by Player 1 to gain an advantage. We formulate the perception constraints as an online sensor selection problem, develop a value-weighted objective function for sensor selection capturing rational inattention, and propose the greedy algorithm for selection under this monotone objective function. When Player 2 does not deviate from the presumed policy, this objective function provides an upper bound on the expected value loss compared to the security value where Player 1 has perfect information of the state. We then propose a myopic decision-making algorithm for Player 2 to exploit Player 1's beliefs by deviating from the presumed policy and, thereby, improve upon the security value. Numerical examples illustrate how Player 1 persistently chooses sensors that are consistent with its priors, allowing Player 2 to systematically exploit its inattention.

11:45-12:00, Paper WeA06.6	Add to My Program
Log-Linear Learning for Coordination with Heterogeneous Bounded Rationalities

Rajab, Fathy Omar	Florida State University
Anubi, Olugbenga Moses	Florida State University
Vasconcelos, Marcos M.	Florida State University
Keywords: Game theory, Networked control systems, Iterative learning control Abstract: This work investigates a two-player coordination game in which the players exhibit heterogeneous levels of bounded rationality. We analyze the log-linear learning dynamics, where the probability distribution used to select which of the agents gets to revise its strategy is fixed but not necessarily uniform. The stationary distribution of the resulting Markov chain on the strategy profile space is derived in closed-form as a function of the rationalities and the agent selection probabilities. We proceed by showing that adjusting the selection probabilities can be used to bias the stationary distribution toward the potential-maximizing state. However, this optimization comes at the cost of a reduced convergence rate, whereas the uniform selection probabilities uniquely maximizes the convergence speed irrespective of the players’ rationality levels. To address this trade-off, a Pareto-optimal probability selection rule is proposed, balancing the distributional bias with convergence rate. Moreover, it is shown that in coordination games, high levels of rationality sometimes accelerate convergence, whereas in other cases they may paradoxically hinder the convergence rate of the log-linear learning dynamics.


WeA07 Invited Session, Grand Salon 9	Add to My Program
Autonomous Risk-Aware Perception, Planning, and Control

Chair: Motee, Nader	Lehigh University
Co-Chair: Liu, Guangyi	Amazon Robotics
Organizer: Liu, Guangyi	Amazon Robotics
Organizer: Li, Na	Harvard University
Organizer: Topcu, Ufuk	The University of Texas at Austin
Organizer: Zavlanos, Michael M.	Duke University
Organizer: Motee, Nader	Lehigh University

10:30-10:45, Paper WeA07.1	Add to My Program
Risk-Aware Safety Filters with Poisson Safety Functions and Laplace Guidance Fields (I)

Bahati, Gilbert	California Institute of Technology
Bena, Ryan	California Institute of Technology
Wilkinson, Meg	California Institute of Technology
Mestres, Pol	California Institute of Technology
Cosner, Ryan	California Institute of Technology
Ames, Aaron D.	California Institute of Technology
Keywords: Vision-based control, Autonomous systems, Constrained control Abstract: Robotic systems navigating in real-world settings require a semantic understanding of their environment to properly determine safe actions. This work aims to develop the mathematical underpinnings of such a representation---specifically, the goal is to develop safety filters that are risk-aware. To this end, we take a two step approach: encoding an understanding of the environment via Poisson’s equation, and associated risk via Laplace guidance fields. That is, we first solve a Dirichlet problem for Poisson’s equation to generate a safety function that encodes system safety as its 0-superlevel set. We then separately solve a Dirichlet problem for Laplace’s equation to synthesize a safe guidance field that encodes variable levels of caution around obstacles---by enforcing a tunable flux boundary condition. The safety function and guidance fields are then combined to define a safety constraint and used to synthesize a risk-aware safety filter which, given a semantic understanding of an environment with associated risk levels of environmental features, guarantees safety while prioritizing avoidance of higher risk obstacles. We demonstrate this method in simulation and discuss how a priori understandings of obstacle risk can be directly incorporated into the safety filter to generate safe behaviors that are risk-aware.

10:45-11:00, Paper WeA07.2	Add to My Program
A Formal Gatekeeper Framework for Safe Dual Control with Active Exploration (I)

Naveed, Kaleb Ben	University of Michigan,
Agrawal, Devansh Ramgopal	University of Michigan
Panagou, Dimitra	University of Michigan, Ann Arbor
Keywords: Constrained control, Robust adaptive control, Autonomous robots Abstract: Planning safe trajectories under model uncertainty is a fundamental challenge. Robust planning ensures safety by considering worst-case realizations, yet ignores uncertainty reduction and leads to overly conservative behavior. Actively reducing uncertainty on-the-fly during a nominal mission defines the dual control problem. Most approaches address this by adding a weighted exploration term to the cost, tuned to trade off the nominal objective and uncertainty reduction, but without formal consideration of when exploration is beneficial. Moreover, safety is enforced in some methods but not in others. We propose a framework that integrates robust planning with active exploration under formal guarantees as follows: The key innovation and contribution is that exploration is pursued only when it provides a verifiable improvement without compromising safety. To achieve this, we utilize our earlier work on gatekeeper as an architecture for safety verification, and extend it so that it generates both safe and informative trajectories that reduce uncertainty and the cost of the mission, or keep it within a user-defined budget. The methodology is evaluated via simulation case studies on the online dual control of a quadrotor under parametric uncertainty.

11:00-11:15, Paper WeA07.3	Add to My Program
Bayesian Risk-Aware CBFs for Discrete-Time Stochastic Systems with Learned Dynamics (I)

Hoxha, Bardh	Toyota NA R&D
Black, Mitchell	Toyota NA R&D
Majd, Keyvan	Toyota NA R&D
Okamoto, Hideki	Toyota NA R&D
Fainekos, Georgios	Toyota NA R&D
Prokhorov, Danil	Toyota NA R&D
Keywords: Stochastic systems, Statistical learning, Robust control Abstract: We study safety for stochastic systems under sampled–data control and learned dynamics. We develop Bayesian Risk‑Aware Control Barrier Functions (RA‑CBFs) for discrete time. First, we give two guarantees for barrier crossing over a finite horizon: (i) a time‑uniform martingale bound using Ville’s inequality and a predictable variance budget, and (ii) a tighter pathwise bound that recovers the continuous‑time RA‑CBF margin via DDS time change and the reflection principle. Second, we propagate posterior uncertainty in drift and diffusion into an upper confidence bound (UCB) on the generator and derive a closed-form inter-sample robustness margin. Third, we synthesize a minimally invasive controller via a convex QP that enforces risk and uncertainty thresholds jointly. The result is a high-confidence, finite-horizon safety bound and a practical sampled-data controller that, under the stronger pathwise noise assumption, recovers the continuous-time RA-CBF margin while avoiding supermartingale-based S-CBF conditions.

11:15-11:30, Paper WeA07.4	Add to My Program
Delay-Independent Safe Control with Neural Networks: Positive Lur’e Certificates for Risk-Aware Autonomy (I)

Montazeri Hedesh, Hamidreza	Northeastern University
Siami, Milad	Northeastern University
Keywords: Neural networks, Delay systems, Uncertain systems Abstract: We present a risk-aware safety certification method for autonomous, learning-enabled control systems. Focusing on two realistic risks, state/input delays and interval matrix uncertainty, we model the neural network (NN) controller with local sector bounds and exploit positivity structure to derive linear, delay-independent certificates that guarantee local exponential stability across admissible uncertainties. To benchmark performance, we adopt and implement a state-of-the-art IQC NN-verification pipeline. In representative cases, our positivity-based tests run orders of magnitude faster than SDP-based IQC while certifying regimes the latter cannot, providing scalable safety guarantees that complement risk-aware control.

11:30-11:45, Paper WeA07.5	Add to My Program
Optimism As Risk-Seeking in Multi-Agent Reinforcement Learning (I)

Zhang, Runyu	Massachusetts Institute of Technology
Li, Na	Harvard University
Ozdaglar, Asu	MIT
Shamma, Jeff S.	University of Illinois at Urbana-Champaign
Zardini, Gioele	Massachusetts Institute of Technology
Keywords: Reinforcement learning, Decentralized control, Cooperative control Abstract: Risk sensitivity has become a central theme in reinforcement learning (RL), where convex risk measures and robust formulations provide principled ways to model preferences beyond expected return. Recent extensions to multi-agent RL (MARL) have largely emphasized the risk-averse setting, prioritizing robustness to uncertainty. In cooperative MARL, however, such conservatism often leads to suboptimal equilibria, and a parallel line of work has shown that optimism can promote cooperation. Existing optimistic methods, though effective in practice, are typically heuristic and lack theoretical grounding. Building on the dual representation for convex risk measures, we propose a principled framework that interprets risk-seeking objectives as optimism.We introduce optimistic value functions, which formalize optimism as divergence-penalized risk-seeking evaluations. Building on this foundation, we derive a policy-gradient theorem for optimistic value functions, including explicit formulas for the entropic risk/KLpenalty setting, and develop decentralized optimistic actorcritic algorithms that implement these updates. Empirical results on cooperative benchmarks demonstrate that riskseeking optimism consistently improves coordination over both risk-neutral baselines and heuristic optimistic methods. Our framework thus unifies risk-sensitive learning and optimism, offering a theoretically grounded and practically effective approach to cooperation in MARL.

11:45-12:00, Paper WeA07.6	Add to My Program
Risk-Budgeted Control Framework for Balanced Performance and Safety in Autonomous Vehicles

Chang, Pei Yu	The Ohio State University
Renganathan, Vishnu	The Ohio State University
Ahmed, Qadeer	The Ohio State University
Keywords: Autonomous systems, Automotive systems, Automotive control Abstract: This paper presents a hybrid control framework with a risk-budgeted monitor for safety-certified autonomous driving. A sliding-window monitor tracks insufficient barrier residuals and triggers switching from a relaxed control barrier function (R-CBF) to a more conservative conditional value-at-risk CBF (CVaR-CBF) when the safety margin deteriorates. Two real-time triggers are considered: feasibility-triggered (FT), which activates CVaR-CBF when the R-CBF problem is reported infeasible, and quality-triggered (QT), which switches when the residual falls below a prescribed safety margin. The framework is evaluated with model predictive control (MPC) under vehicle localization noise and obstacle position uncertainty across multiple AV-pedestrian interaction scenarios with 1,500 Monte Carlo runs. In the most challenging case with 5 m pedestrian detection uncertainty, the proposed method achieves a 94--96% collision-free success rate over 300 trials while maintaining the lowest mean cross-track error (CTE = 3.2--3.6 m), indicating faster trajectory recovery after obstacle avoidance and a favorable balance between safety and performance.


WeA08 Regular Session, Grand Salon 10-13	Add to My Program
Distributed Control

Chair: Li, Jing Shuang (Lisa)	University of Michigan
Co-Chair: Matni, Nikolai	University of Pennsylvania

10:30-10:45, Paper WeA08.1	Add to My Program
Distributed Continuous-Time Control Via System Level Synthesis

Du, Yaozhi	University of Michigan
Li, Jing Shuang (Lisa)	University of Michigan
Keywords: Distributed control, Linear systems, Optimal control Abstract: This paper designs H2 and H-Infinitydistributed controllers with local communication and local disturbance rejection. We propose a two-step procedure: first, select closed-loop poles; then, optimize over parameterized controllers. We build on the system level synthesis (SLS) parameterization --- primarily used in the discrete-time setting --- and extend it to the general continuous-time setting. We verify our approach in simulation on a 9-node grid governed by linearized swing equations, where our distributed controllers achieve performance comparable to that of optimal centralized controllers while facilitating local disturbance rejection.

10:45-11:00, Paper WeA08.2	Add to My Program
Scalable Distributed Nonlinear Control under Flatness-Preserving Coupling

Yang, Fengjun	University of Pennsylvania
Welde, Jake	Cornell University
Matni, Nikolai	University of Pennsylvania
Keywords: Distributed control, Feedback linearization, Large-scale systems Abstract: We study distributed control for a network of nonlinear, differentially flat subsystems subject to dynamic coupling. Although differential flatness simplifies planning and control for isolated subsystems, the presence of coupling can destroy this property for the overall joint system. Focusing on subsystems in pure-feedback form, we identify a class of compatible lower-triangular dynamic couplings that preserve flatness and guarantee that the flat outputs of the subsystems remain the flat outputs of the coupled system. Further, we show that the joint flatness diffeomorphism can be constructed from those of the individual subsystems and, crucially, its sparsity structure reflects that of the coupling. Exploiting this structure, we synthesize a distributed tracking controller that computes control actions from local information only, thereby ensuring scalability. We validate our proposed framework on a simulated example of planar quadrotors dynamically coupled via aerodynamic downwash, and show that the distributed controller achieves accurate trajectory tracking.

11:00-11:15, Paper WeA08.3	Add to My Program
Explicit Distributed MPC: Reducing Computation and Communication Load by Exploiting Facet Properties

Brahmbhatt, Parth	University of Wisconsin Madison
Ganesh, Hari	Indian Institute of Technology Gandhinagar,
Avraamidou, Styliani	University of Wisconsin Madison
Keywords: Distributed control, Cooperative control, Decentralized control Abstract: Classical Distributed Model Predictive Control (DiMPC) requires multiple iterations to achieve convergence, leading to high computational and communication burdens. This work focuses on the improvement of an iteration-free distributed MPC methodology that minimizes computational effort and communication load. The aforementioned methodology leverages multiparametric programming to compute explicit control laws offline for each subsystem, enabling real-time control without iterative data exchanges between subsystems. Extending our previous work on iteration-free DiMPC, here we introduce a FAcet-based Critical region Exploration Technique for iteration-free DiMPC (FACET-DiMPC) that further reduces computational complexity by leveraging facet properties to do targeted critical region exploration. Simulation results demonstrate that the developed method achieves comparable control performance to centralized methods, while significantly reducing communication overhead and computation time. In particular, the proposed methodology offers substantial efficiency gains in terms of the average computation time reduction of 98% compared to classic iterative DiMPC methods and 42% compared to iteration-free DiMPC methods, making it well-suited for real-time control applications with tight latency and computation constraints.

11:15-11:30, Paper WeA08.4	Add to My Program
A Graph-Based Classification Approach for the Adaptive Decomposition of Model Predictive Control Problems

Cooper, Elliot	University of Michigan
Allman, Andrew	University of Michigan
Keywords: Distributed control, Machine learning, Adaptive control Abstract: Distributed model predictive control (DMPC) has emerged as a computationally efficient alternative to centralized model predictive control (CMPC) by decomposing the centralized optimal control problem into a collection of constituent subsystem controllers. The subsystem controllers can be solved in parallel and coordinated to reach a consensus solution, significantly reducing computational cost at the potential expense of control quality relative to CMPC. In this paper, we demonstrate that state measurements and operational objectives, which serve as parameters in the optimal control problem of each distributed subsystem controller, are strongly correlated with the quality of control achieved by a particular DMPC architecture. We propose a novel graph-based classifier that selects the most suitable DMPC architecture based on these time-varying parameters and integrate this selection process into the online control loop. This adaptive approach dynamically responds to process variations, enhancing control performance and mitigating the potential loss in control quality typically associated with DMPC.

11:30-11:45, Paper WeA08.5	Add to My Program
Discrete-Time Robust Cooperative Output Regulation by Dynamic Measurement Output Feedback

Gul, Kursad Metehan	Utah State University
Sarsilmaz, Selahattin Burak	Utah State University
Keywords: Distributed control, Output regulation, LMIs Abstract: Recent work [1] by the authors has considered robust cooperative output regulation of discrete-time uncertain heterogeneous (in dimension) multi-agent systems (MASs) using the distributed internal model approach, assuming that each follower has full access to its own state. This paper extends our discrete-time robust cooperative output regulation results to the case wherein followers have partial access to their states. Specifically, a detectability condition is added, and each follower employs a local state observer. We first show that the solvability of the robust cooperative output regulation problem (RCORP) with the internal model-based distributed dynamic measurement output feedback control law of interest reduces to the solvability of stabilization problems, including a structured stabilization problem. Then, we leverage our recent results, which cover the existence and design of the corresponding structured control gain, and extend both global and agent-wise local conditions to solve the RCORP via distributed dynamic measurement output feedback. Finally, we provide a numerical example of a MAS comprising three quadrotors and one uninhabited ground vehicle to illustrate the resulting agent-wise local design method.

11:45-12:00, Paper WeA08.6	Add to My Program
Geometry-Aware Decentralized Sinkhorn for Wasserstein Barycenters

Baheri, Ali	Rochester Institute of Technology
Millard, David	Rochester Institute of Technology
Vahid, Alireza	Rochester Institute of Technology
Keywords: Decentralized control, Distributed control, Communication networks Abstract: Distributed systems require fusing heterogeneous local probability distributions into a global summary over sparse and unreliable communication networks. Traditional consensus algorithms, which average distributions in Euclidean space, ignore their inherent geometric structure, leading to misleading results. Wasserstein barycenters offer a geometry-aware alternative by minimizing optimal transport costs, but their entropic approximations via the Sinkhorn algorithm typically require centralized coordination. This paper proposes a fully decentralized Sinkhorn algorithm that reformulates the centralized geometric mean as an arithmetic average in the log-domain, enabling approximation through local gossip protocols. Agents exchange log-messages with neighbors, interleaving consensus phases with local updates to mimic centralized iterations without a coordinator. To optimize bandwidth, we integrate event-triggered transmissions and b-bit quantization, providing tunable trade-offs between accuracy and communication while accommodating asynchrony and packet loss. Under mild assumptions, we prove convergence to a neighborhood of the centralized entropic barycenter, with bias linearly dependent on consensus tolerance, trigger threshold, and quantization error. Complexity scales near-linearly with network size. Simulations confirm near-centralized accuracy with significantly fewer messages, across various topologies and conditions.


WeA09 Invited Session, Grand Salon 12	Add to My Program
Battery Systems, Estimation & Energy Management

Chair: Chen, Jun	Oakland University
Co-Chair: Gupta, Shobhit	General Motors
Organizer: Chen, Jun	Oakland University
Organizer: Gupta, Shobhit	General Motors
Organizer: Chang, Insu	General Motors LLC
Organizer: Wang, Zejiang	The University of Texas at Dallas

10:30-10:45, Paper WeA09.1	Add to My Program
Dense Extended Kalman Filter for Simultaneous Cell State-Parameter Estimation (I)

Nuculaj, Luke	Oakland University
Chen, Jun	Oakland University
Keywords: Kalman filtering, Estimation, Energy systems Abstract: This work investigates the use of dense extended Kalman filter (DEKF) to simultaneously estimate both the state-of-charge (SOC) and equivalent-circuit parameters of serial-connected, lithium-ion battery cells under limited voltage measurement. Stability analyses are conducted to certify the reliability of the DEKF for simultaneous state-parameter estimation under certain mild assumptions. To begin, the local asymptotic stability of the expected dense error in the presence of nonlinearity is verified by selection of an appropriate Lyapunov function. Then the expected sparse error is shown to be marginally stable in the absence of nonlinearities by proving that the spectral radius of the sparse expected error dynamics is exactly equal to 1. Finally, a sufficient condition for sparse stability in the presence of nonlinearity is obtained via selection of an appropriate Lyapunov function. Simulation trials under several different scenarios are executed, confirming that the associated error dynamics are indeed stable for the duration of the simulation time.

10:45-11:00, Paper WeA09.2	Add to My Program
Battery Remaining-Useful-Life Estimation with Probabilistic Transformers (I)

Chen, Evan	Purdue University
Chellapandi, Vishnu Pandi	Cummins Inc
Borhan, Hoseinali	Cummins Inc
Brinton, Christopher	Purdue University
Keywords: Neural networks, Optimization, Adaptive control Abstract: Accurate remaining useful life (RUL) prediction is essential for battery safety and reliability, yet point forecasts can be misleading under nonstationarity and domain shift. We propose QE–RULMamba, a probabilistic, transformer-based state-space forecaster that ensembles multiple experts under a fixed parameter budget and enforces quantile diversity to capture prognostic uncertainty. From capacity forecasts, we compute actionable RUL via a lower-confidence-bound (LCB) search that respects simple temporal and throughput constraints and exposes a tunable risk knob for operations. Evaluated on the NASA and TJU datasets, QE–RULMamba achieves consistently lower MAE/RMSE and higher R^2 than competitive sequence models and a non-quantile ensemble, while yielding calibrated intervals, tight in-distribution and appropriately wider under shift. The resulting confidence-aware RUL curves track oracle behavior in stable regimes and become deliberately conservative when degradation dynamics are uncertain, enabling practical triggers (e.g., early inspection when bounds approach the end-of-life threshold) and underscoring the value of uncertainty-aware forecasting for real-world battery management.

11:00-11:15, Paper WeA09.3	Add to My Program
A Multi-Objective Optimization Based Energy Management System (EMS) for Improving Battery Health in Battery-Supercapacitor Electric Vehicles (I)

Upadhyaya, Ashruti	Indian Institute of Technology, Guwahati
Mahanta, Chitralekha	Indian Institute of Technology, Guwahati
Keywords: Control applications, Automotive control, Hybrid systems Abstract: This paper proposes a novel multi-objective optimization based energy management system (EMS) for battery/ Supercapacitor (SC) Electric Vehicles (EVs). Two conflicting objectives, viz. minimization of battery state of charge (SOC) degradation and reduction of SC power loss are considered for allocating power optimally between battery and SC. The choice of objective functions is based on the fact that battery is the main energy source of the EV and SC is used for supporting the battery during the journey. The primary objective is to minimize battery degradation and reduce overall system losses by minimizing SC power loss. This will prevent overburdening of the auxiliary energy source i.e. SC and improve system performance. The proposed optimal control problem is solved by applying Dynamic Programming (DP) method. A Pareto front is generated displaying the trade-offs between the two objective functions which can play an important role in benchmarking other real-time control techniques. A throughput based dynamic model is also utilized for studying the battery State of Health (SOH) degradation in varying conditions. Simulation study and numerical investigations are conducted by altering the priority of the two objective functions and performance of the proposed technique is evaluated. The proposed Energy Management System (EMS) is compared with the EMS using Filter based Strategy (FBS) and battery only system.

11:15-11:30, Paper WeA09.4	Add to My Program
High-Fidelity Battery Capacity Estimation Method Using Recurrent Neural Network and Data Mapping Strategy (I)

Chang, Insu	General Motors LLC
Shieh, Su-Yang	General Motors LLC
Gupta, Shobhit	General Motors LLC
Zanardelli, Wesley	General Motors LLC
Keywords: Energy systems, Estimation, Neural networks Abstract: This paper presents a data-driven framework for accurate battery capacity estimation, addressing challenges of large storage requirements and time-scale discrepancies. The proposed method converts high-frequency time-series data into compact snapshots using a histogram-based data mapping strategy and optimal bin-sizing, reducing data storage significantly. A recurrent neural network (RNN) is employed to model capacity evolution, and the network architecture is optimized using a random search method. Evaluation on an open source NASA dataset achieved less than 1 % validation error, and the benefit scales across other battery chemistries. These results confirm the scalability and adaptability of the method for diverse applications.

11:30-11:45, Paper WeA09.5	Add to My Program
Accelerating Battery Charging Via Bayesian Optimization with Risk-Seeking Sampling

Zhang, Yi	Georgia Institute of Technology
Wang, Xizhe	Tsinghua University
Jiang, Benben	Tsinghua University
Keywords: Energy systems, Optimization algorithms, Machine learning Abstract: The global transition towards electrified transportation has increased the demand for fast and reliable charging technologies, which are essential for the widespread adoption of electric vehicles. However, enabling rapid charging without accelerating battery degradation remains a major technical challenge. Conventional methods for optimizing charging protocols, such as model-based approaches and grid search, are constrained by inaccurate battery models and the high cost of extensive experimental characterization. Bayesian optimization offers an efficient alternative, as it can learn and optimize unknown objectives with limited sampling. Nevertheless, standard acquisition functions are often designed for general-purpose tasks and may not align with the performance oriented nature of battery fast charging. This research gap motivates the development of the acquisition function better suited to high performance charging applications. We therefore propose a novel acquisition function that prioritizes best case outcomes by identifying and exploring high risk, high reward regions in the objective landscape. Experimental results on the mathematical benchmark function and the PET based battery simulator validate the effectiveness of our proposed approach.

11:45-12:00, Paper WeA09.6	Add to My Program
A Nonlinear Geometric Approach for Energy Management of a Cascaded Multilevel Inverter Integrating the Battery

Buenfil Solis, Guillermo	CentraleSupélec
Desreveaux, Anatole	CNAM
Iovine, Alessio	CNRS
Pasillas-Lepine, William	CNRS
Keywords: Control applications, Power electronics, Automotive systems Abstract: Multilevel inverters have recently emerged as a promising alternative for traction and stationary applications, offering advantages such as reduced losses and lower current distortion compared to conventional two-level inverters. To fully harness these benefits, suitable control strategies are required to drive the system towards desired operating states, along with a battery management system to preserve the health of the battery pack. In this work, we consider a cascaded multilevel inverter that directly integrates batteries as the energy source and propose a control technique based on a geometric approach. The method simultaneously achieves accurate machine control and balances the state of charge across modules, thereby mitigating non-uniform aging of the battery pack. The effectiveness of the proposed approach is validated through simulation studies.


WeA10 Regular Session, Grand Salon 15	Add to My Program
Linear Systems I

Chair: Zare, Armin	University of Texas at Dallas
Co-Chair: Zaccarian, Luca	LAAS-CNRS

10:30-10:45, Paper WeA10.1	Add to My Program
H2-Optimal Parameter Tuning Via Structured Feedback-Gain Basis Decomposition

Hartman, Daniel	University of Texas at Dallas
Zare, Armin	University of Texas at Dallas
Keywords: Linear parameter-varying systems, Optimal control, Optimization algorithms Abstract: Parameter tuning in structured control design faces inherent challenges due to nonconvex feasible sets, sensitivity to initialization, and the need for stabilizing starting points. We revisit this problem by recasting parameter tuning as a structured state-feedback synthesis that expands the gain over sparsity-preserving bases. We formulate a quadratic objective in the state and augmentation terms and develop two augmented-Lagrangian algorithms that efficiently compute optimal parameters in the presence of application-driven bound constraints. This framework accommodates additional convex constraints when needed, delivers high-quality warm starts for subspace gradient methods, and applies broadly whenever the dynamics depend affinely on the unknown parameters. We provide examples that illustrate the effectiveness of our approach and the proposed algorithms in solving parameter tuning and structured control design problems.

10:45-11:00, Paper WeA10.2	Add to My Program
Computation of Minimal Kernel Representation of a Discrete LTI Behavior

Dilip, Sanand	IIT Kharagpur
Khare, Swanand	Indian Institute of Technology Kharagpur
Keywords: Linear systems, Computational methods Abstract: We propose a novel linear algebraic approach to construct a minimal kernel representation of a discrete LTI behavior from an arbitrary kernel representation. We also present an algorithm to generate data for a restricted behavior from its kernel representation. Using this data, we construct certain Hankel matrices and obtain a minimal kernel representation from left kernels of these Hankel matrices via a modified algorithm of [Markovsky and Dorfler, IEEE Transactions on Automatic Control, vol. 68, no. 3, pp. 1667–1677, 2022]. As a consequence, we give a constructive algorithm to compute a minimal basis of a finitely generated module over a single variable polynomial ring.

11:00-11:15, Paper WeA10.3	Add to My Program
Identifying Network Structure of Linear Dynamical Systems: Observability and Edge Misclassification

Gill, Jaidev	University of Michigan, Ann Arbor
Li, Jing Shuang (Lisa)	University of Michigan
Keywords: Linear systems, Networked control systems, Identification Abstract: This work studies the limitations of uniquely identifying the structure (i.e., topology) of a networked linear system from partial measurements of its nodal dynamics. In general, many networks can be consistent with these measurements; this is a consideration often neglected by standard network inference methods. We show that the space of these networks are related through the nullspace of the observability matrix for the true network. We establish relevant metrics to investigate this space, including an analytic characterization of the most structurally dissimilar network that can be inferred, as well as the possibility of mis-inferring presence or absence of edges. In simulations, we find that when observing over 6% of nodes in random network models (e.g., Erdos-Renyi and Watts-Strogatz), approximately 99% of edges are correctly classified. Extending this discussion, we construct a family of networks that keep measurements epsilon-close to each other, and connect the identifiability of these networks to the spectral properties of an augmented observability Gramian.

11:15-11:30, Paper WeA10.4	Add to My Program
A Periodic Output Feedback Stabilization Scheme with Non-Simultaneous Sensing and Actuation

Manca, Giorgio	University of Rome, Tor Vergata
Sassano, Mario	University of Rome, Tor Vergata
Zaccarian, Luca	LAAS-CNRS
Keywords: Hybrid systems, Linear systems, Stability of hybrid systems Abstract: In this work, an output feedback control scheme is proposed for the stabilization of a continuous-time LTI system characterized by non-simultaneous sensing and actuation. Using a Gramian-based framework, a periodically repeating time window is divided into a sensing phase, where the state is reconstructed by filtering the output, and an actuation phase, where the state is driven to a target point balancing the control effort and the stabilization goal. The overall control architecture is cast as a well-posed hybrid dynamical system, providing a convenient tool for proving robust global exponential stability.

11:30-11:45, Paper WeA10.5	Add to My Program
Robust Controllable Set Computation Using Constrained Convex Generators

Silvestre, Daniel	NOVA University of Lisbon
P. Vinod, Abraham	Mitsubishi Electric Research Laboratories
Keywords: Linear systems, Predictive control for linear systems, Autonomous systems Abstract: Robust Controllable (RC) sets enable safe control of dynamical systems under constraints and uncertainty. Existing approaches typically rely on polytopic representations for the computation of these sets, which suffer from conservativeness and scalability issues. Recently, Constrained Convex Generators (CCGs) were proposed to allow set-based control and analysis in presence of both ellipsoidal and polytopic state-input constraints. However, the computation of RC sets using CCGs is currently hindered because the Pontryagin difference set operation has not been developed for CCGs. In this paper, we provide theory and algorithms to address this challenge, and enable safe control under uncertainty using RC sets and CCGs. Specifically, we propose an inner approximation for the RC set using a CCG description. We show in simulations that the proposed approach improves accuracy and memory usage when compared to computations with polytopic approximations.


WeA11 Regular Session, Grand Salon 16	Add to My Program
Mechanical Systems

Chair: Gilbert, Hunter B.	Louisiana State University
Co-Chair: Diaz-Mercado, Yancy	University of Maryland

10:30-10:45, Paper WeA11.1	Add to My Program
Sequential Model Calibration of Vapor Compression Cycles Using Approximate Bayesian Computation

Zinage, Shrenik	Purdue University, West Lafayette
Dixit, Vaibhav	Meta
Chakrabarty, Ankush	Mitsubishi Electric Research Laboratories (MERL)
Qiao, Hongtao	Mitsubishi Electric Research Laboratories (MERL)
Laughman, Christopher R.	Mitsubishi Electric Research Labs
Bilionis, Ilias	Purdue University
Deshpande, Vedang M.	Mitsubishi Electric Research Laboratories
Keywords: Mechanical systems/robotics, Estimation, Differential-algebraic systems Abstract: Physics-based simulation models are essential for the development of vapor compression cycles (VCCs), the core technology behind most modern refrigeration and air conditioning systems. These models enable robust control, monitoring, fault detection and diagnostics, and digital twin technologies. However, nonlinear dynamics, high-dimensional parameter and state spaces, numerical stiffness, and the limited integration of conventional modeling environments with scientific machine learning workflows create significant challenges for efficient, transferable, and automated calibration. Existing approaches typically rely on deterministic methods and lack mechanisms for principled knowledge transfer across calibration tasks, while also failing to quantify epistemic and aleatoric uncertainty. To address these limitations, we propose a Bayesian calibration framework for VCC systems that explicitly quantifies various sources of uncertainty in model predictions. The framework supports transferability across datasets by sequentially updating informative priors from previously inferred posteriors. We implement and evaluate this approach on a commercially available high-fidelity Julia based dynamic VCC model, demonstrating its ability to successfully estimate key system parameters.

10:45-11:00, Paper WeA11.2	Add to My Program
Adaptive Motion Planning Via Contact-Based Intent Inference for Human-Robot Collaboration

Song, Jiurun	Texas A&M University
Liang, Xiao	Texas A&M University
Zheng, Minghui	Texas A&M University
Keywords: Mechanical systems/robotics, Robotics, Human-in-the-loop control Abstract: Human-robot collaboration (HRC) requires robots to adapt their motions to human intent to ensure safe and efficient cooperation in shared spaces. Although large language models (LLMs) provide high-level reasoning for inferring human intent, their application to reliable motion planning in HRC remains challenging. Physical human-robot interaction (pHRI) is intuitive but often relies on continuous kinesthetic guidance, which imposes burdens on operators. To address these challenges, a contact-informed adaptive motion-planning framework is introduced to infer human intent directly from physical contact and employ the inferred intent for online motion correction in HRC. First, an optimization-based force estimation method is proposed to infer human-intended contact forces and locations from joint torque measurements and a robot dynamics model, thereby reducing cost and installation complexity while enabling whole-body sensitivity. Then, a torque-based contact detection mechanism with link-level localization is introduced to reduce the optimization search space and to enable real-time estimation. Subsequently, a contact-informed adaptive motion planner is developed to infer human intent from contacts and to replan robot motion online, while maintaining smoothness and adapting to human corrections. Finally, experiments on a 7-DOF manipulator demonstrate the accuracy of the proposed force estimation method with a mean absolute joint-torque estimation error of 0.665 Nm, and verify the effectiveness of the contact-informed adaptive motion planner under perception uncertainty in HRC.

11:00-11:15, Paper WeA11.3	Add to My Program
Input Shaping for Point-To-Point Motion with a Continuum Robot Arm

Hernandez Ibarra, Rodolfo Alejandro	Tecnológico De Monterrey (ITESM)
Baker, Karan	Louisiana State University
Molaei, Parsa	Louisiana State University
Stein, Adrian	Louisiana State University
Gilbert, Hunter B.	Louisiana State University
Keywords: Mechanical systems/robotics, Mechatronics, Flexible structures Abstract: A cable-driven continuum robot arm is an underactuated mechanism and may suffer residual vibration at the end of a rest-to-rest maneuver. In this work, a time-delay filter is applied as an input shaper to the system to eliminate the excitation of vibratory modes. A non-robust and a robust time-delay filter are designed based on a linear system model and demonstrate improved response compared to a velocity-driven pulse input. Experimental results using the continuum robot validate the application of the input shaper, with reduced overshoot and settling time exemplifying the reduction in oscillation at the end of the maneuver. It is also shown that utilizing the robust shaper further improves the response of the arm in comparison to applying the non-robust shaper. These results are significant towards the precise and robust implementation of continuum robots in applications involving arbitrary end-effector trajectories.

11:15-11:30, Paper WeA11.4	Add to My Program
A Hierarchical Model-Free Controller for Stable Limit Cycle Generation in Underactuated Mechanical Systems

Tahir, Muhammad Rafey	National University of Sciences and Technology, Islamabad
Murtaza, Muhammad Ali	Information Technology University
Hayat, Rameez	National University of Science and Technology, Islamabad
Hutchinson, Seth	Northeastern University
Ali, Usman	De Montfort University
Keywords: Mechanical systems/robotics, Hierarchical control, Adaptive control Abstract: This paper presents a hierarchical model-free control framework for underactuated mechanical systems aimed at generating stable limit cycles. Our proposed control architecture consists of a high-level controller that generates reference trajectories while the low-level controller guarantees accurate tracking of these trajectories on the unactuated coordinates. Our proposed controller has desirable robustness and stability properties and we show its effectiveness through demonstrations on various underactuated mechanical systems such as rotary inverted pendulum (RIP), a segway, an acrobot and a leg-foot model on deformable ground.

11:30-11:45, Paper WeA11.5	Add to My Program
Task-Space Singularity Avoidance for Control Affine Systems Using Control Barrier Functions

Forghani, Kimia	University of Maryland College Park
Raval, Suraj	University of Maryland, College Park
Mair, Lamar	Manipulation and Particle Research, Weinberg Medical Physics, Inc
Krieger, Axel	John Hopkins University
Diaz-Mercado, Yancy	University of Maryland
Keywords: Mechanical systems/robotics, Robotics, Numerical algorithms

11:45-12:00, Paper WeA11.6	Add to My Program
A Class of Axis–Angle Attitude Control Laws for Rotational Systems

Gonçalves, Francisco	Washington State University
Bena, Ryan	California Institute of Technology
Perez Arancibia, Nestor Osvaldo	Washington State University (WSU)
Keywords: Robotics, Mechatronics, Control applications Abstract: We introduce a new class of attitude control laws for rotational systems; the proposed framework generalizes the use of the Euler axis–angle representation beyond quaternion-based formulations. Using basic Lyapunov stability theory and the notion of extended class K function, we developed a method for determining and enforcing the global asymptotic stability of the single fixed point of the resulting closed-loop (CL) scheme. In contrast with traditional quaternion-based methods, the introduced generalized axis–angle approach enables greater flexibility in the design of the control law, which is of great utility when employed in combination with a switching scheme whose transition state depends on the angular velocity of the controlled rotational system. Through simulation and real-time experimental results, we demonstrate the effectiveness of the developed formulation. According to the recorded data, in the execution of high-speed tumble-recovery maneuvers, the new method consistently achieves shorter stabilization times and requires lower control effort relative to those corresponding to the quaternion-based and geometric-control methods used as benchmarks.


WeA12 Invited Session, Grand Salon 18	Add to My Program
Optimization for Energy and Infrastructure Networks

Chair: Moura, Scott	University of California, Berkeley
Co-Chair: De Castro, Ricardo	University of California, Merced
Organizer: Wang, Ruiting	University of California, Berkeley
Organizer: Moura, Scott	University of California, Berkeley
Organizer: De Castro, Ricardo	University of California, Merced

10:30-10:45, Paper WeA12.1	Add to My Program
Joint Price and Power MPC for Peak Power Reduction at Workplace EV Charging Stations (I)

Cambronne, Thibaud	University of California, Berkeley
Bobick, Samuel	University of California, Berkeley
Zeng, Wente	TotalEnergies S.E
Moura, Scott	University of California, Berkeley
Keywords: Smart grid, Human-in-the-loop control Abstract: Demand charge, a utility fee based on an electricity customer's peak power consumption, often constitutes a significant portion of costs for commercial electric vehicle (EV) charging station operators. This paper explores control methods to reduce peak power consumption at workplace EV charging stations in a joint price and power optimization framework. We optimize a menu of price options to incentivize users to select controllable charging service. Using this framework, we propose a model predictive control approach to reduce both demand charge and overall operator costs. Through a Monte Carlo simulation, we find that our algorithm outperforms a state-of-the-art benchmark optimization strategy and can significantly reduce station operator costs.

10:45-11:00, Paper WeA12.2	Add to My Program
Energy Management Strategies for Electric Aircraft Charging Leveraging Active Landside Vehicle-To-Grid (I)

Vehlhaber, Finn Niklas	Eindhoven University of Technology
Salazar, Mauro	Eindhoven University of Technology
Keywords: Transportation networks, Energy systems, Smart grid Abstract: The deployment of medium-range battery electric aircraft is a promising pathway to improve the environmental footprint of air mobility. Yet such a deployment would be accompanied by significant electric power requirements at airports due to aircraft charging. Given the growing prevalence of electric vehicles and their bi-directional charging capabilities—so-called vehicle-to-grid (V2G)—we study energy buffer capabilities of parked electric vehicles to alleviate pressure on grid connections. To this end, we present energy management strategies for airports providing cost-optimal apron and landside V2G charge scheduling. Specifically, we first formulate the optimal energy management problem of joint aircraft charging and landside V2G coordination as a linear program, whereby we use partial differential equations to model the aggregated charging dynamics of the electric vehicle fleet. Second, we consider a shuttle flight network with a single hub of a large Dutch airline, real-world grid prices, and synthetic parking garage occupancy data to test our framework. Our results show that V2G at even a single airport can indeed reduce energy costs to charge the aircraft fleet: Compared to a baseline scenario without V2G, the proposed concept yields cost savings of up to 32 %, depending on the schedule and amount of participating vehicles, and has other potential beneficial effects on the local power grid, e.g., the reduction of potential power peaks.

11:00-11:15, Paper WeA12.3	Add to My Program
Optimal Surface Power Allocation for Sustainable Urban Electric Vehicle Growth (I)

Fierro Ulloa, Joel Ignacio	Centre Inria De l'Université Grenoble Alpes
Canudas de Wit, Carlos	CNRS, GIPSA-Lab
Keywords: Optimization, Multivehicle systems, Large-scale systems Abstract: The rapid transition to electric mobility requires not only the deployment of charging infrastructure but also the strategic allocation of power within urban electricity networks. While most existing studies address charger siting and network coverage, this work focuses on the complementary problem of how much power should be distributed across different areas of a city — a challenge we define as the Surface Power Allocation Problem (SPAP). We introduce a macroscopic modeling framework in which the urban landscape is partitioned into Urban Surface Units (USUs), each treated as a virtual charging node with aggregated demand. Using an origin–destination graph of mobility flows and a dynamic nonlinear representation of electric vehicle state-of-charge evolution, we estimate spatial power requirements consistent with long-term sustainability objectives. The resulting optimization problem is formulated as a large-scale nonlinear program, which we solve through a relaxation and gradient-based approach to provide computationally efficient solutions.

11:15-11:30, Paper WeA12.4	Add to My Program
Regulating EV Charging Markets for Fairness: Incentives for Pricing and Capacity Decisions (I)

Wang, Ruiting	University of California, Berkeley
Hu, Kita	University of California, Berkeley
Yu, Yitong	Nanyang Technological University, Singapore
Moura, Scott	University of California, Berkeley
Keywords: Transportation networks, Optimization algorithms, Game theory Abstract: The transition to electric mobility calls for charging infrastructure that is both efficient and socially equitable. This paper examines fairness in electric vehicle (EV) charging station pricing and capacity through a game-theoretic perspective. We model a non-cooperative market in which competing charging service providers set prices and capacities while customers choose stations based on generalized cost, leading to a market equilibrium. We then benchmark this decentralized outcome against an idealized planner solution that jointly optimizes efficiency and equity. To align market outcomes with socially desirable goals, we design targeted incentives that guide operators toward more fair charger placement. Case studies demonstrate that unregulated competition tends to exacerbate disparities in charger access across demographic groups, whereas carefully calibrated incentives can reduce inequities without significant efficiency loss. The framework provides insights for policymakers on reconciling free-market dynamics with the broader societal goals of fairness in electrified mobility systems.

11:30-11:45, Paper WeA12.5	Add to My Program
Energy-Aware Integrated Proactive Maintenance Planning and Production Scheduling (I)

Li, Hongliang	Pennsylvania State University
Pangborn, Herschel	The Pennsylvania State University
Kovalenko, Ilya	Pennsylvania State University
Keywords: Manufacturing systems, Hierarchical control, Optimal control Abstract: Demand-side energy management, such as the real-time pricing (RTP) program, offers manufacturers opportunities to reduce energy costs by shifting production to low-price hours. However, this strategy is challenging to implement when machine degradation is considered, as degraded machines have decreased processing capacity and increased energy consumption. Proactive maintenance (PM) can restore machine health but requires production downtime, creating a challenging trade-off: scheduling maintenance during low-price periods sacrifices energy savings opportunities, while deferring maintenance leads to capacity losses and higher energy consumption. To address this challenge, we propose a hierarchical bi-level control framework that jointly optimizes PM planning and runtime production scheduling, considering the machine degradation. A higher-level optimization, with the lower-level model predictive control (MPC) embedded as a subproblem, determines PM plans that minimize total operational costs under day-ahead RTP. At runtime, the lower-level MPC executes closed-loop production scheduling to minimize energy costs under realized RTP, meeting delivery targets. Simulation results from a lithium-ion battery pack assembly line case study demonstrate that the framework strategically shifts PM away from bottlenecks and high-price hours, meeting daily production targets while reducing energy costs.

11:45-12:00, Paper WeA12.6	Add to My Program
Fairness-Aware Management of Electric Vehicle Charging Stations (I)

Monteiro, Paulo	UC Merced
Williams, Chase	University of California, Merced
Moyalan, Joseph	University of California, Merced
Chen, YangQuan	University of California, Merced
De Castro, Ricardo	University of California, Merced
Keywords: Optimization, Optimization algorithms, Energy systems Abstract: This paper presents an optimization framework for scheduling electric vehicle (EV) charging at public stations, with the aim of minimizing overall user dissatisfaction while taking into account arrival time, charging duration, and power demand. To promote fairness, the framework differentiates between high-priority users (those without access to home or workplace charging) and low-priority users (those with such access). In addition, it ensures contiguous charging time slots by incorporating trigger functions into the optimization model. The problem is formulated as a Mixed Integer Linear Program (MILP). The effectiveness of the proposed approach is demonstrated through simulations using EV charger data from the UC Merced campus parking lot.


WeA13 Regular Session, Grand Salon 19	Add to My Program
Autonomous Systems I

Chair: Chen, Fei	Massachusetts Institute of Technology
Co-Chair: Samanipour, Pouya	University of Kentucky

10:30-10:45, Paper WeA13.1	Add to My Program
Verification Framework for the Union of Control Barrier Functions

Jiang, Chuanrui	Washington University in St. Louis
Clark, Andrew	Washington University in St. Louis
Keywords: Autonomous systems, Constrained control, Switched systems Abstract: Control Barrier Functions (CBFs) have been proposed to ensure safety of autonomous systems. This paper considers control policies that switch between CBF constraints. Under this approach, we represent a complex non-convex safe region as a union of sets that are computationally tractable to verify. We denote this framework as union-CBFs and make the following contributions. First, considering switching CBF-QP controllers, we propose a sufficient condition that ensures (i) the system undergoes a finite number of switches in any finite time interval and ensures (ii) the forward invariance of the closed-loop system in between switches. Second, we consider two types of switching strategies and propose union-CBFs conditions for each strategy to satisfy (i) and (ii). Third, we formulate Sum-of-Squares (SOS) algorithms to verify the conditions. The experiments show that our union-CBFs framework results in a larger safe region compared to high-degree polynomial CBFs. We also show the efficiency of the verification algorithms using a polynomial system model.

10:45-11:00, Paper WeA13.2	Add to My Program
ReLU Barrier Functions for Nonlinear Systems with Constrained Control: A Union of Invariant Sets Approach

Samanipour, Pouya	Univ. of Kentucky
Poonawala, Hasan A.	Univ. of Kentucky
Keywords: Autonomous systems, Formal verification/synthesis, Hybrid systems Abstract: Certifying safety for nonlinear systems with polytopic input constraints is challenging because control barrier function (CBF) synthesis must ensure control admissibility under input saturation. We propose an approximation-verification pipeline that performs convex barrier synthesis on piecewise-affine (PWA) surrogates and certifies safety for the original nonlinear system through facet-wise verification. To reduce conservatism while preserving tractability, we use a two-slope Leaky ReLU surrogate for the extended class-K function alpha(.) and combine multiple certificates using Union of Invariant Sets (UIS). Counterexamples are handled through local uncertainty updates. Simulations on pendulum and cart-pole systems with input saturation show larger certified invariant sets than linear-alpha designs while maintaining tractable computation time.

11:00-11:15, Paper WeA13.3	Add to My Program
Cooperative Visual-Inertial Navigation with Inter-Robot Coordinate Alignment

Liu, Jiayang	The University of Melbourne
Zhou, Yizhi	Geroge Mason University
Keywords: Estimation, Autonomous robots, Kalman filtering Abstract: This paper focuses on multi-robot navigation in GPS-denied environments, where accurate localization is essential for tasks such as environmental monitoring and search-and-rescue. While Visual–Inertial Navigation Systems (VINS) provide lightweight 6-DoF state estimation for single robots, extending them to multi-robot settings introduces the critical challenge of inter-robot frame alignment. Existing approaches often assume ground-truth initialization or overlook this issue, limiting their practicality. We propose a cooperative multi-robot VINS framework that augments the inter-robot transformation as part of the state and jointly estimates it online together with each robot’s trajectory. Crucially, our method leverages common observations as additional constraints, ensuring consistent frame alignment and enabling effective cross-robot fusion. Monte Carlo simulations on 3-D trajectories demonstrate that the proposed framework achieves lower error and improved consistency compared with single-robot baselines, validating its effectiveness for real-world multi-robot localization.

11:15-11:30, Paper WeA13.4	Add to My Program
Warm-Starting Optimization-Based Motion Planning for Robotic Manipulators Via Point Cloud-Conditioned Flow Matching

Tian, Sibo	Texas A&M University
Zheng, Minghui	Texas A&M University
Liang, Xiao	Texas A&M University
Keywords: Emerging control applications, Mechanical systems/robotics, Intelligent systems Abstract: Rapid robot motion generation is critical in Human-Robot Collaboration (HRC) systems, as robots need to respond to dynamic environments in real time by continuously observing their surroundings and replanning their motions to ensure both safe interactions and efficient task execution. Current sampling-based motion planners face challenges in scaling to high-dimensional configuration spaces and often require post-processing to interpolate and smooth the generated paths, resulting in time inefficiency in complex environments. Optimization-based planners, on the other hand, can incorporate multiple constraints and generate smooth trajectories directly, making them potentially more time-efficient. However, optimization-based planners are sensitive to initialization and may get stuck in local minima. In this work, we present a novel learning-based method that utilizes a Flow Matching model conditioned on a single-view point cloud to learn near-optimal solutions for optimization initialization. Our method does not require privileged knowledge of the environment, such as obstacle locations and geometries, and can generate feasible trajectories directly from single-view depth camera input. Simulation studies on a UR5e robotic manipulator in cluttered workspaces demonstrate that the proposed generative initializer achieves a high success rate on its own, significantly improves the success rate of trajectory optimization compared with traditional and learning-based benchmark initializers, requires fewer optimization iterations, and exhibits strong generalization to unseen environments.

11:30-11:45, Paper WeA13.5	Add to My Program
Context-Aware LLM-Based Safe Control against Latent Risks

Deng, Xiyu	Carnegie Mellon University
Luu, Quan	Purdue University
Ho, Van	JAIST
Nakahira, Yorie	Carnegie Mellon University
Keywords: Autonomous systems, Uncertain systems, Optimization Abstract: Autonomous control systems face significant challenges in executing complex tasks under latent risks—risks arising from occluded agents or partially observable dynamics. To address this, we propose a framework that integrates Large Language Models (LLMs), numerical optimization, and optimization-based control to enable context-aware subtask generation and refinement under latent risks. The framework operates across multiple timescales: (i) in-context learning with LLMs for leveraging past experience, (ii) multi-turn Chain-of-Thought reasoning with numerical optimization for refining subtask parameters, and (iii) real-time feedback control via model predictive control (MPC). Complex tasks are decomposed into subtasks that explicitly account for latent risks, refined through multi-turn optimization in physics-based simulation, and improved over time by accumulating failure-related examples for in-context learning. We validate the framework through simulation-based case studies involving robots and autonomous vehicles using GPT-4o, demonstrating its feasibility and potential to complement MPC with language-driven reasoning for safer decision-making in latent-risk scenarios.

11:45-12:00, Paper WeA13.6	Add to My Program
AuDeRe: Automated Strategy Decision and Realization in Robot Planning and Control Via LLMs

Meng, Yue	MIT
Chen, Fei	Massachusetts Institute of Technology
Chen, Yongchao	Harvard University
Fan, Chuchu	Massachusetts Institute of Technology
Keywords: Autonomous systems, Robotics Abstract: Recent advancements in large language models (LLMs) have shown significant promise in various domains, especially robotics. However, most prior LLM-based work in robotic applications either directly predicts waypoints or applies LLMs within fixed tool integration frameworks, offering limited flexibility in exploring and configuring solutions best suited to different tasks. In this work, we propose a framework that leverages LLMs to select appropriate planning and control strategies based on task descriptions, environmental constraints, and system dynamics. These strategies are then executed by calling the available comprehensive planning and control APIs. Our approach employs iterative LLM-based reasoning with performance feedback to refine the algorithm selection. We validate our approach through extensive experiments across tasks of varying complexity, from simple tracking to complex planning scenarios involving spatiotemporal constraints. The results demonstrate that using LLMs to determine planning and control strategies from natural language descriptions significantly enhances robotic autonomy while reducing the need for extensive manual tuning and expert knowledge. Furthermore, our framework maintains generalizability across different tasks and notably outperforms baseline methods that rely on LLMs for direct trajectory, control sequence, or code generation.


WeA14 Regular Session, Grand Salon 21	Add to My Program
Data Driven Control

Chair: Iwata, Takumi	Hiroshima University
Co-Chair: Ibuki, Tatsuya	Meiji University

10:30-10:45, Paper WeA14.1	Add to My Program
Recursive Gaussian Process Based Safety Assurance Exploration Control in Unknown Environments

Kanou, Kento	Meiji University
Ibuki, Tatsuya	Meiji University
Keywords: Data driven control, Machine learning, Constrained control Abstract: This article presents a novel method for environmental exploration that takes safety into account in unknown areas by using recursive Gaussian process regression (RGPR). Safety in unknown environments is ensured by an RGPR-based control barrier function, which is constructed non-parametrically from the posterior mean of RGPR. Simultaneously, an exploration function inspired by a control Lyapunov function is presented. This function utilizes the posterior variance of RGPR to achieve efficient exploration. Moreover, the use of RGPR mitigates the cubic growth in computational cost inherent to conventional Gaussian process regression, thereby enabling online learning. To validate the effectiveness of the proposed method, an experiment using differential-drive robots equipped with a LiDAR sensor is conducted.

10:45-11:00, Paper WeA14.2	Add to My Program
A Small-Gain Look at Cyber-Physical Security

Chakraborty, Sayan	New York University
Jiang, Zhong-Ping	New York University
Keywords: Data driven control, Optimal control, Networked control systems Abstract: This paper studies the resilience of cyber-physical systems under denial-of-service attacks. We develop a novel framework for resilient control that avoids the need for detailed information about the system or attacker dynamics by treating the plant–attacker interaction as an interconnected system. Using small-gain analysis and switching systems theory, we derive explicit resilience conditions, and employ reinforcement learning to synthesize an optimal policy directly from input–state data, estimating the required small-gain bounds in a data-driven manner. A numerical example illustrates the effectiveness of the proposed approach.

11:00-11:15, Paper WeA14.3	Add to My Program
Robust Data-Driven Receding-Horizon Control for LQR with Input Constraints

Zheng, Jian	Northeastern University
Sznaier, Mario	Northeastern University
Keywords: Data driven control, Optimal control, Predictive control for linear systems Abstract: This letter presents a robust data-driven receding-horizon control framework for the discrete-time linear quadratic regulator (LQR) with input constraints. Unlike earlier data-driven approaches that design a controller from initial data and apply it unchanged throughout the trajectory, our method exploits all available execution data in a receding-horizon manner, thereby capturing additional information about the unknown system and enabling less conservative performance. Existing data-driven LQR model predictive control methods rely on over-approximations of the consistency set and ell_2 descriptions of noise. In contrast, the proposed approach uses exact descriptions of the consistency set under ell_infty-bounded noise, leveraging duality to recast the problem into a tractable convex optimization. Further, the proposed controller renders the closed-loop system input-to-state stable. Simulation results demonstrate the effectiveness of the method.

11:15-11:30, Paper WeA14.4	Add to My Program
Gain-Scheduled Data-Enabled Predictive Control: A DeePC Approach for Nonlinear Systems

Guerrero, Margarita A.	KTH Royal Institute of Technology
Lakshminarayanan, Braghadeesh	KTH Royal Institute of Technology
Rojas, Cristian R.	KTH Royal Institute of Technology
Keywords: Data driven control, Predictive control for nonlinear systems, Linear parameter-varying systems Abstract: Model predictive control is a well established control technology for trajectory tracking. Its use requires the availability of an accurate model of the plant, but obtaining such a model is often time consuming and costly. Data-Enabled Predictive Control (DeePC) addresses this shortcoming in the linear time-invariant setting, by skipping the model building step and instead relying directly on input-output data. Unfortunately, many real systems are nonlinear and exhibit strong operating-point dependence. Building on classical linear parameter-varying control, we introduce DeePC-GS, a gain-scheduled extension of DeePC for unknown, regime-varying systems. The key idea is to allow DeePC to switch between different local Hankel matrices—selected online via a measurable scheduling variable—thereby uniting classical gain scheduling tools with identification-free, data-driven MPC. We test the effectiveness of our DeePC-GS formulation on a nonlinear ship-steering benchmark, demonstrating that it outperforms state-of-the-art data-driven MPC while maintaining tractable computation.

11:30-11:45, Paper WeA14.5	Add to My Program
Data-Fused MPC with Guarantees: Application to Flying Humanoid Robots

Gorbani, Davide	Italian Institute of Technology
Elobaid, Mohamed	King Abdullah University of Science and Technology
L'Erario, Giuseppe	Istituto Italiano Di Tecnologia
Mohamed, Hosameldin Awadalla Omer	Italian Institute of Technology
Pucci, Daniele	Istituto Italiano Di Tecnologia
Keywords: Data driven control, Robotics, Predictive control for linear systems Abstract: This paper introduces a Data-Fused Model Predictive Control (DFMPC) framework that combines physics-based models with data-driven representations of unknown dynamics. Leveraging Willems’ Fundamental Lemma and an artificial equilibrium formulation, the method enables tracking of changing, potentially unreachable setpoints while explicitly handling measurement noise through slack variables and regularization. We provide guarantees of recursive feasibility and practical stability under input–output constraints for a specific class of reference signals. The approach is validated on the iRonCub flying humanoid robot, integrating analytical momentum models with data-driven turbine dynamics. Simulations show improved tracking and robustness compared to a purely model-based MPC, while maintaining real-time feasibility.

11:45-12:00, Paper WeA14.6	Add to My Program
Data Informativity for Analysis and Design of Positive Systems

Iwata, Takumi	Hiroshima University
Azuma, Shun-ichi	Kyoto University
Nagahara, Masaaki	Hiroshima University
Peaucelle, Dimitri	LAAS-CNRS, Université De Toulouse
Ebihara, Yoshio	Kyushu University
Keywords: Compartmental and Positive systems, Data driven control, Constrained control Abstract: This paper investigates data informativity of positive systems using linear programming (LP). The concept called data informativity represents the sufficiency of a given dataset to solve analysis/design problems. In this paper, we provide the necessary and sufficient conditions for the data-driven analysis and design problems of positive systems to be solvable. Moreover, we clarify that these conditions are characterized by LP problems.


WeA15 Regular Session, Grand Salon 22	Add to My Program
Lyapunov Methods

Chair: Goel, Ankit	University of Maryland Baltimore County
Co-Chair: Allen, Brendon C.	Auburn University

10:30-10:45, Paper WeA15.1	Add to My Program
Sampling-Based Efficient Receding Horizon Control Barrier Functions

Li, Anni	The University of North Carolina at Charlotte
Xiao, Wei	WPI
Keywords: Lyapunov methods, Constrained control, Optimal control Abstract: Control Barrier Functions (CBFs) are widely used in guaranteeing safety for nonlinear systems. They can conservatively map a constrained optimal control problem into a sequence of point-wise Quadratic Programs (QPs) for affine control systems. One of the challenges with the CBF-based point-wise optimization is its myopic property: optimality and safety are only considered at the current time instant. This could make the system overly aggressive and susceptible to infeasibility in the optimization. To address this issue, we propose a sampling method for CBFs to formulate a receding horizon QP. Specifically, at each iteration, we first sample a trajectory based on the current state by a tractable planning method, and then incorporate it into nonlinear dynamics and constraints to obtain piecewise linear approximations that are then enforced by CBFs within the planning horizon. The eventual optimization problem is reformulated into a sequence of receding horizon QPs instead of complex nonlinear programs. We show that the proposed framework can still guarantee safety while staying close to the (optimal) planning trajectory with minimum deviation. We illustrate our approach on traffic merging and robot obstacle avoidance, and compare it with other methods to show its advantage in efficiency and guarantees.

10:45-11:00, Paper WeA15.2	Add to My Program
Accelerating Lyapunov-Stable Neural Control Using Fulfillment Priority Logic

Abdelgawad, Abdelrahman	Boston University
El Mabsout, Bassel	Boston University
Wang, Zili	Boston University
Mancuso, Renato	Boston University
Andersson, Sean B.	Boston University
Tron, Roberto	Boston University
Keywords: Lyapunov methods, Formal verification/synthesis, Neural networks Abstract: We present a two-stage approach for learning stability-certified neural controllers that achieves a reduction of up to ~95% in training time compared to the state-of-the-art baseline, which introduced monotonic neural Lyapunov architectures. Our method combines monotonic neural Lyapunov functions with fulfillment priority logic (FPL) to efficiently initialize controllers before formal verification. Traditional approaches for jointly learning controllers and neural Lyapunov functions require computationally expensive mixed-integer linear programming (MILP) or satisfiability modulo theory (SMT) solvers at each training iteration, often taking several hours to converge. We address this bottleneck by leveraging FPL to perform early joint initialization of the controller and Lyapunov networks. Building on the monotonic neural network architecture from the baseline, which guarantees non-negativity and a unique global minimum by construction, our method focuses on efficiently satisfying the remaining property of decreasing along trajectories. Existing works focus on maximizing the region of attraction/convergence of the learned controller. In contrast, leveraging FPL allows us to (1) increase learning efficiency substantially and (2) focus on complementary performance metrics, such as convergence rate and control effort minimization, thereby adding significant specification flexibility. In this paper, we encode an approximate Lyapunov-decrease condition in FPL to pre-train the controller and Lyapunov networks, then apply a MILP-based verification/refinement step. This decouples efficient learning from certificate enforcement and allows the FPL specification to include auxiliary objectives (e.g., convergence rate and control effort), whose influence persists through the final MILP pass. The resulting controllers converge rapidly while admitting formal Lyapunov certificates on standard nonlinear control benchmarks.

11:00-11:15, Paper WeA15.3	Add to My Program
Augmented Lyapunov-Net: Incorporating Adversarial Verifier for the Construction of Lyapunov Functions Using Neural Networks

Zhao, Yuxuan	Hong Kong University of Science and Technology
Wang, Junkai	Georgia Institute of Technology
Zhang, Fumin	Hong Kong University of Science and Technology
Keywords: Lyapunov methods, Neural networks, Learning Abstract: Lyapunov functions are pivotal for analyzing the stability of dynamical systems. While neural networks have emerged as effective approximators, existing methods often struggle with hard-to-fit regions and scale poorly to high-dimensional systems. To address these limitations, this paper introduces the Augmented Lyapunov-Net, a novel framework that integrates a modified CounterExample Guided Inductive Synthesis (CEGIS) architecture. Our framework features a Learner, implemented by the Lyapunov-Net, and an Adversarial Verifier, which actively employs gradient-free optimization to pinpoint states that violate the Lie derivative condition. These strategically discovered counterexamples then guide a focused sampling procedure, enabling the Learner to efficiently refine the candidate function in critical regions. Numerical experiments on several benchmark systems demonstrate that the proposed method achieves higher approximation efficiency to higher-dimensional systems compared to the original Lyapunov-Net.

11:15-11:30, Paper WeA15.4	Add to My Program
Stability Preserving Safe Control of a Bicopter

Portella Delgado, Jhon Manuel	University of Maryland Baltimore County
Goel, Ankit	University of Maryland Baltimore County
Keywords: Lyapunov methods, Stability of nonlinear systems, Constrained control Abstract: This paper presents a control law for stabilization and trajectory tracking of a multicopter subject to safety constraints. The proposed approach guarantees forward invariance of a prescribed safety set while ensuring smooth tracking performance. Unlike conventional control barrier function methods, the constrained control problem is transformed into an unconstrained one using state-dependent mappings together with carefully constructed Lyapunov functions. This approach enables explicit synthesis of the control law, instead of requiring a solution of constrained optimization at each step. The transformation also enables the controller to enforce safety without sacrificing stability or performance. Simulation results for a polytopic reference trajectory confined within a designated safe region demonstrate the effectiveness of the proposed method.

11:30-11:45, Paper WeA15.5	Add to My Program
A Lyapunov and Concurrent Learning-Based Approach to Train the Weights of a DNN-Based Controller in Real-Time

Basyal, Sujata	Auburn University
Ting, Jonathan	Auburn University
Mishra, Kislaya	Auburn University
Allen, Brendon C.	Auburn University
Keywords: Lyapunov methods, Stability of nonlinear systems, Neural networks Abstract: Deep neural networks (DNNs) are a function approximation tool that are capable of approximating complex dynamic functions by learning complex relationships between the input-output data through training and optimization. Classical DNN training methods utilize numerical optimization tools to improve the function approximation performance. However, in most cases, DNN-based controllers are implemented in an openloop manner, meaning that the DNN model in the controller does not adapt its weights online and therefore fails to take into consideration any unexpected behavior that could hinder system stability and robustness. To tackle this issue, the authors recently leveraged Lyapunov-based methods to develop innovative realtime update laws that train a DNN’s weights in real-time, ensuring stability and improved performance. In this paper, the performance and stability result for DNN-based controllers are further improved by incorporating a Lyapunov-based concurrent learning (CL) inspired term within the real-time DNN update law. A rigorous Lyapunov-based stability analysis was performed to ensure that the proposed DNN-based controller and CL augmented update law results in global exponential convergence of the tracking errors and the DNN weight estimation errors towards an ultimate bound. Thus, for the first time, the proposed approach will ensure both trajectory tracking and convergence of the DNN weights towards their optimal values. Furthermore, simulations were performed that demonstrate the significant potential of the developed DNN-based control system.

11:45-12:00, Paper WeA15.6	Add to My Program
A Nonlinear Robust Controller Formulation for a Class of Systems Subject to Hysteresis

Hindistan, Cagri	Ege University
Selim, Erman	Ege University
Tatlicioglu, Enver	Ege University
Zergeroglu, Erkan	Gebze Technical University
Keywords: Lyapunov methods Abstract: This work presents the design and corresponding analysis of a robust tracking controller for a class of nonlinear systems subject to hysteresis. Specifically, the hysteresis effects are assumed to be in the form of the Bouc-Wen model and despite the lack of accurate knowledge of system dynamics and hysteresis model parameters, a continuous robust controller is designed via the use of the robustness of integral of sign of the error feedback. The stability of the closed--loop system and the convergence of the tracking error are ensured via Lyapunov based arguments. The tracking performance of the proposed control framework is investigated by numerical simulation studies for a second order system.


WeA16 Regular Session, Grand Salon 24	Add to My Program
Reinforcement Learning I

Chair: Mitra, Aritra	North Carolina State University
Co-Chair: Talebi, Shahriar	University of California, Los Angeles

10:30-10:45, Paper WeA16.1	Add to My Program
Training Task Reasoning LLM Agents for Multi-Turn Task Planning Via Single-Turn Reinforcement Learning

Hu, Hanjiang	Carnegie Mellon University
Liu, Changliu	Carnegie Mellon University
Li, Na	Harvard University
Wang, Yebin	Mitsubishi Electric Research Labs
Keywords: Reinforcement learning, Machine learning, Emerging control applications Abstract: Large Language Models (LLMs) have demonstrated remarkable capabilities in knowledge acquisition, reasoning, and tool use, making them promising candidates for autonomous agent applications. However, training LLM agents for complex multi-turn task planning faces significant challenges, including sparse episode-wise rewards, credit assignment across long horizons, and the computational overhead of reinforcement learning in multi-turn interaction settings. To this end, this paper introduces a novel approach that transforms multi-turn task planning into single-turn task reasoning problems, enabling efficient policy optimization through Group Relative Policy Optimization (GRPO) with dense and verifiable reward from expert trajectories. Our theoretical analysis shows that GRPO improvement on single-turn task reasoning results in higher multi-turn success probability under the minimal turns, as well as the generalization to subtasks with shorter horizons. Experimental evaluation on the complex task planning benchmark demonstrates that our 1.5B parameter model trained with single-turn GRPO achieves superior performance compared to larger baseline models up to 14B parameters, with success rates of 70% for long-horizon planning tasks with over 30 steps. We also theoretically and empirically validate the strong cross-task generalizability that the models trained on complex tasks can lead to the successful completion of all simpler subtasks.

10:45-11:00, Paper WeA16.2	Add to My Program
Robust Federated Q-Learning with Almost No Communication

Maity, Sreejeet	North Carolina State University, Raleigh
Mitra, Aritra	North Carolina State University
Keywords: Reinforcement learning, Machine learning, Optimization algorithms Abstract: We consider a federated reinforcement learning setting involving M agents, all of whom interact with a common Markov Decision Process (MDP). The agents exchange information via a central server to learn the optimal value function. Our goal is to understand to what extent one can hope for collaborative sample-complexity speedups in such a setting, when a small fraction of the agents are adversarial and can act arbitrarily. To that end, we propose Robust Fed-Q, a federated Q-learning algorithm that blends ideas from both model-based and model-free RL, along with the median-of-means device from robust statistics. We prove that despite corruption, with high-probability, Robust Fed-Q (i) guarantees emph{exact} convergence to the optimal value function in the limit of infinite samples, and (ii) enjoys near-optimal finite-time rates that benefit from collaboration. In addition, our approach requires just tilde{O}(1) rounds of communication to achieve each of the above guarantees, a feature of independent interest in FL where communication is the major bottleneck.

11:00-11:15, Paper WeA16.3	Add to My Program
The Deception Magic: Using Reward Machines for Privacy-Preserving Reinforcement Learning

Meshkat Alsadat, Shayan	Arizona State University
Xu, Zhe	Arizona State University
Keywords: Reinforcement learning, Markov processes Abstract: In reinforcement learning (RL) scenarios where agents must preserve privacy about their objectives, deceptive behavior becomes essential to prevent adversarial observers from inferring the agent’s true intentions. Unlike existing approaches that often assume Markovian reward functions or require model-based planning, we propose a model-free deceptive reinforcement learning framework using reward machines (RMs) that operates under adversarial settings. We introduce a belief-induced reward mechanism for deceptive RL using RMs, where agents balance ground truth task completion with entropy maximization over observer beliefs about candidate reward functions. RMs allow us to encode complex temporal dependencies in non-Markovian reward structures and guide the deceptive learning process through structured task representation. Our methods introduce Q-learning and actor-critic algorithms with reward machines to learn optimal deceptive policies that distract observers while maintaining task completion objectives, ensuring effective privacy preservation without compromising primary goals. We establish theoretical guarantees, demonstrating that our algorithms converge to optimal deceptive policies with privacy preservation properties. We further evaluate our methods against baselines in experimental case studies to demonstrate their effectiveness in balancing deception and task performance.

11:15-11:30, Paper WeA16.4	Add to My Program
Hereditary Geometric Meta-RL: Non-Local Generalization Via Task Symmetries

Nitschke, Paul	Harvard University
Talebi, Shahriar	University of California, Los Angeles
Keywords: Reinforcement learning, Mechanical systems/robotics, Robotics Abstract: Meta-Reinforcement Learning (Meta-RL) commonly generalizes via smoothness in the task encoding. While this enables local generalization around each training task, it requires dense coverage of the task space and leaves richer task space structure untapped. In response, we develop a geometric perspective that endows the task space with a "hereditary geometry" induced by the inherent symmetries of the underlying system. Concretely, the agent reuses a policy learned at the train time by transforming states and actions through actions of a Lie group. This converts Meta-RL into symmetry discovery rather than smooth extrapolation, enabling the agent to generalize to wider regions of the task space. We show that when the task space is inherited from the symmetries of the underlying system, the task space embeds into a subgroup of those symmetries whose actions are linearizable, connected, and compact--properties that enable efficient learning and inference at the test time. To learn these structures, we develop a differential symmetry discovery method. This collapses functional invariance constraints and thereby improves numerical stability and sample efficiency over functional approaches. Empirically, on a two-dimensional navigation task, our method efficiently recovers the ground-truth symmetry and generalizes across the entire task space, while a common baseline generalizes only near training tasks.

11:30-11:45, Paper WeA16.5	Add to My Program
Nash Q-Learning with Inferring Causal Signal Temporal Logic: A Study of Competitive Multi-Agent Systems

Partovi Aria, Hadi	Arizona State University
Xu, Zhe	Arizona State University
Keywords: Reinforcement learning Abstract: Multi-agent reinforcement learning with temporal logic presents unique challenges, particularly in competitive settings where agents pursue conflicting objectives. This paper presents NASTL-CIRL (Nash Signal Temporal Logic for Causal Inference in Reinforcement Learning), a novel approach that combines Nash Q-Learning with Causal Signal Temporal Logic (Causal STL) inference to guide agent behavior in competitive environments. Our approach enables agents to infer causal relationships through STL formulas that explain agent behavior and environmental dynamics, providing strategic advantages. Pacman Game scenario and Factory Assembly scenario are conducted to show the superior performance of the proposed NASTL-CIRL algorithm compared to baseline methods. Our results show that integrating causal STL inference with Nash equilibrium concepts leads to more structured and interpretable agent behaviors while maintaining competitive performance.

11:45-12:00, Paper WeA16.6	Add to My Program
Lipschitz-Aware Exploration for Safety in Reinforcement Learning

Jha, Mayank Shekhar	University of Lorraine
Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Marthi, Satya Vinay Chavan	CRAN
Theilliol, Didier	Universite De Lorraine
Keywords: Reinforcement learning, Neural networks, Optimal control Abstract: This paper develops a novel Lipschitz-aware safe exploration framework for reinforcement learning in environments with abrupt, unmodeled safety variations. Local Lipschitz constants of the safety function are estimated online using kernel density estimation (KDE), providing a data-driven measure of rapid changes in the safety landscape. These estimates are incorporated into a robust quadratic program (QP) with Lipschitz-aware control barrier function (CBF) constraints, yielding a safe exploration law that guarantees forward invariance of an enlarged safe set under probing noise. The exploration mechanism is then coupled with a safety-aware learning stage to obtain a unified safe RL framework. Simulations on an inverted pendulum illustrate the efficacy of the proposed approach.


WeA17 Invited Session, Churchill A1	Add to My Program
Healthcare and Medical Systems (I)

Chair: Pereira, Emily	Texas Tech University
Co-Chair: Hahn, Jin-Oh	University of Maryland
Organizer: Menezes, Amor A.	University of Florida
Organizer: Hahn, Jin-Oh	University of Maryland
Organizer: Medvedev, Alexander V.	Uppsala University
Organizer: Mesbah, Ali	University of California, Berkeley
Organizer: Pereira, Emily	Texas Tech University
Organizer: Zhang, Wenlong	Arizona State University

10:30-10:45, Paper WeA17.1	Add to My Program
Analytical Characterization of Inter-Spike Interval Statistics in Excitatory–inhibitory Neuronal Networks (I)

Gambrell, Oliver	University of Delaware
Bokes, Pavol	Comenius University
Singh, Abhyudai	University of Delaware
Keywords: Biological systems Abstract: A key component of intraneuronal communication is the modulation of postsynaptic firing frequencies by stochastic transmitter release from presynaptic neurons. The time interval between successive postsynaptic firings is called the inter-spike interval (ISI), and understanding its statistics is integral to neural information processing. We start with a model of an excitatory chemical synapse with postsynaptic neuron firing governed as per a classical integrate-and-fire model. Using a first-passage time framework, we derive exact analytical results for the ISI statistical moments, revealing parameter regimes driving precision in postsynaptic action potential timing. Next, we extended this analysis to include both an excitatory and an inhibitory presynaptic connection onto the same postsynaptic neuron. We consider both a fixed postsynaptic-firing threshold and a threshold that adapts based on the postsynaptic membrane potential history. Our analysis shows that the latter adaptive threshold can result in scenarios where increasing the inhibitory input frequency increases the postsynaptic firing frequency. Moreover, we characterize parameter regimes where ISI noise is hypo-exponential or hyper-exponential based on its coefficient of variation being less than or higher than one, respectively.

10:45-11:00, Paper WeA17.2	Add to My Program
Toward Skill-Informed Haptic Feedback for Human Motor Learning in High-Dimensional De-Novo Tasks (I)

Kamboj, Ankur	Michigan State University
Ranganathan, Rajiv	Michigan State University
Tan, Xiaobo	Michigan State University
Srivastava, Vaibhav	Michigan State University
Keywords: Human-in-the-loop control, Markov processes, Biomedical Abstract: This work addresses the challenge of designing haptic feedback for high-dimensional motor learning tasks, which is difficult due to the latent evolution of motor skill and the redundancy inherent in high-dimensional control spaces. We propose to formulate the feedback design problem as a Partially Observable Markov Decision Process (POMDP) and to compute an optimal nudging policy. In our framework, the POMDP's latent states represent the evolution of the learner's internal motor skill during a high-dimensional, de-novo (novel) motor learning task performed with a hand exoskeleton. Results from a human study (N=10) on novel motor skill acquisition show faster improvement in task performance in the participant group trained with our POMDP-derived nudging policy than in a group trained with no feedback. These results demonstrate the potential of skill-informed feedback design to accelerate learning and performance.

11:00-11:15, Paper WeA17.3	Add to My Program
Reduced-Input Model Predictive Control for Electrical Stimulation in Multi-Electrode Systems (I)

Hakam, Noor	North Carolina State University
Singh, Mayank	North Carolina State Univeristy
Xue, Xiangming	North Carolina State University
Lambeth, Krysten	North Carolina State University
Favorov, Oleg	University of North Carolina at Chapel Hill
Sharma, Nitin	North Carolina State University
Keywords: Biomedical, Model Validation, Human-in-the-loop control Abstract: Abstract--- Functional electrical stimulation (FES) with multi-electrode systems enables more flexible control of muscle activation but is complicated by nonlinear, redundant input–output relationships that often lead to unintended co-activation. We address this as a control allocation problem and introduce a selectivity-aware framework that integrates Dynamic Active Subspaces (DAS) with Model Predictive Control (MPC). By reducing the input space to dominant, task-relevant directions, the method simplifies optimization, reducing the computational intensity of the control problem, while preserving biological richness. We demonstrate the approach using simulations from an invasive sciatic nerve cuff in a pig model.

11:15-11:30, Paper WeA17.4	Add to My Program
Ergodic Quasilinearization and Control for Brain Dynamics (I)

Nasiriziba, Ilia	Department of Mechanical Engineering, University of Illinois at Urbana-Champaign
Singh, Matthew	University of Illinois, Urbana-Champaign; Beckman Institute for Advanced Science & Technology
Keywords: Control of networks, Large-scale systems, Stochastic optimal control Abstract: Controlling complex, nonlinear systems, such as the brain, presents a fundamental challenge that requires simplified models for practical controller design. Traditional approaches often fail when these systems operate far from steady states under noise and changing inputs. Different control strategies drive systems into distinct behavioral regimes, each requiring a specific approximation. Rather than imposing a single approximation, this work introduces ergodic quasilinearization (EQL), which automatically identifies the appropriate linear model for each operating scenario. EQL generates adaptive linear models whose parameters adjust based on the system's long-term statistical behavior under varying inputs and noise levels. These statistics are derived analytically from the steady-state equalities, eliminating the need for repeated computation of the full nonlinear dynamics. The effectiveness of EQL is demonstrated on large-scale brain network models, where traditional methods encounter difficulties due to complex nonlinearities and high dimensionality. Conventional linearization methods perform well under fixed conditions but lose accuracy when control strategies change the operating regime. In contrast, EQL maintains accuracy across diverse operating scenarios, supporting robust controller design for systems that rarely reach simple steady states. We demonstrate the power of EQL in predicting brain-model responses to complex stimulation protocols and in identifying an optimal open-loop control for reproducing target brain-activity patterns.

11:30-11:45, Paper WeA17.5	Add to My Program
Blood Pressure Prediction During Blood Transfusion: A Population-Informed Multi-Modal Sequential Inference Approach

Kao, Yi-Ming	University of Maryland
Rezaei, Parham	University of Maryland, College Park
Masoumi Shahrbabak, Sina	University of Maryland
Pepino, Jeremy	Massachusetts General Hospital
Shogren, Ian	Massachusetts General Hospital
Wang, Yang	Massachusetts General Hospital
Reisner, Andrew	Harvard Medical School
Hahn, Jin-Oh	University of Maryland
Keywords: Healthcare and medical systems, Biomedical Abstract: Blood pressure (BP) management is a critical component of blood transfusion, but no mature technology capable of predicting BP response to blood transfusion exists. This paper concerns the development and preliminary in vivo testing of a BP prediction method applicable to hemorrhage and blood transfusion. Key obstacles are (i) large inter-individual variability in the BP response to blood transfusion, (ii) unknown hemorrhage, and (iii) input/state-dependent observability. To cope with these challenges, we developed a multi-modal sequential inference-enabled BP prediction method built upon a mathematical model of patient physiology parameterized by population-informed prior. The method infers patient-specific physiological state and hemorrhage, and uses them to predict future BP in a patient receiving blood transfusion. The in vivo testing of the method using the data collected from large animals undergoing hemorrhage and blood transfusion showed that it could adequately predict mean arterial BP with median absolute errors for 5-min and 15-min predictions of 3.1 mmHg and 7.4 mmHg as well as adequately infer physiological state and hemorrhage: all the hemorrhage events were detected with <3.5 min delay, with median F1 score of 85%. In sum, the prediction of BP response to blood transfusion may be feasible, even in the presence of unknown hemorrhage.

11:45-12:00, Paper WeA17.6	Add to My Program
Symptom-Driven Personalized Proton Pump Inhibitors Therapy Using Bayesian Neural Networks and Model Predictive Control

Li, Yutong	University of Michigan, Ann Arbor
Kolmanovsky, Ilya V.	The University of Michigan
Keywords: Constrained control, Biomedical, Neural networks Abstract: Proton Pump Inhibitors (PPIs) are the standard of care for gastric acid disorders but carry significant risks when administered chronically at high doses. Precise long-term control of gastric acidity is challenged by the impracticality of invasive gastric acid monitoring beyond 72 hours and wide inter-patient variability. We propose a noninvasive, symptom-based framework that tailors PPI dosing solely on patient-reported reflux and digestive symptom patterns. A Bayesian Neural Network prediction model learns to predict patient symptoms and quantifies its uncertainty from historical symptom scores, meal, and PPIs intake data. These probabilistic forecasts feed a chance-constrained Model Predictive Control (MPC) algorithm that dynamically computes future PPI doses to minimize drug usage while enforcing acid suppression with high confidence—without any direct acid measurement. In silico studies over diverse dietary schedules and virtual patient profiles demonstrate that our learning-augmented MPC reduces total PPI consumption by 65% compared to standard fixed regimens, while maintaining acid suppression with at least 95% probability. The proposed approach offers a practical path to personalized PPI therapy, minimizing treatment burden and overdose risk without invasive sensors.


WeA18 Regular Session, Churchill A2	Add to My Program
Constrained Control I

Chair: Kiumarsi, Bahare	Michigan State University
Co-Chair: Molnar, Tamas G.	Wichita State University

10:30-10:45, Paper WeA18.1	Add to My Program
Safety-Certified Planning and Control in Dynamic Environments Via Model Predictive Control

Khaledi, Marjan	Michigan State University
Kiumarsi, Bahare	Michigan State University
Keywords: Constrained control, Optimization Abstract: This paper proposes a novel approach to safe navigation in environments with static and dynamic obstacles by embedding control barrier functions (CBFs) within the model predictive control (MPC) framework. Unlike conventional methods that rely on unbounded additive slack variables, the proposed approach enforces each CBF constraint separately, allowing individual flexibility through dedicated slack variables with bounded relaxation weights. These weights modulate the permissible degree of constraint relaxation, ensuring that any safety softening remains quantitatively bounded, systematically tunable, and theoretically consistent with the CBF-based safety guarantees. Furthermore, the feasibility of the proposed approach is guaranteed, and the effectiveness of our method is demonstrated through the simulation results.

10:45-11:00, Paper WeA18.2	Add to My Program
Lag-Compensating Control Barrier Functions for Feasible Safety-Critical Control

Chen, Yuchen	University of Michigan
Molnar, Tamas G.	Wichita State University
Orosz, Gabor	University of Michigan
Keywords: Constrained control, Lyapunov methods, Nonlinear output feedback Abstract: First-order lags are commonly used in control system design to approximate system delays, which can significantly compromise system safety. To maintain safety under lags, this paper introduces the concept of lag-compensating control barrier function (LCCBF). First, we propose a lag-eliminating state transformation that removes first-order lag dynamics from linear systems. Then, we use this transformation to construct an LCCBF from the CBF designed for the lag-free system. We prove that controllers synthesized via the LCCBF ensure safety under actuator lags, while preserving feasibility for the same input bounds that are used to synthesize controllers via the original CBF. Finally, we simulate a double integrator with actuator limits to validate that the LCCBF maintains both safety and feasibility under lags.

11:00-11:15, Paper WeA18.3	Add to My Program
Safety-Critical Control with Guaranteed Lipschitz Continuity Via Filtered Control Barrier Functions

Liu, Shuo	Boston University
Xiao, Wei	WPI
Belta, Calin	University of Maryland
Keywords: Constrained control, Lyapunov methods, Optimal control Abstract: In safety-critical control systems, ensuring both system safety and smooth control input is essential for practical deployment. Existing Control Barrier Function (CBF) frameworks, especially High-Order CBFs (HOCBFs), effectively enforce safety constraints, but also raise concerns about the smoothness of the resulting control inputs. While smoothness typically refers to continuity and differentiability, it does not by itself ensure bounded input variation. In contrast, Lipschitz continuity is a stronger form of continuity that not only is necessary for the theoretical guarantee of safety, but also bounds the rate of variation and eliminates abrupt changes in the control input. Such abrupt changes can degrade system performance or even violate actuator limitations, yet current CBF-based methods do not provide Lipschitz continuity guarantees. This paper introduces Filtered Control Barrier Functions (FCBFs), which extend HOCBFs by incorporating an auxiliary dynamic system—referred to as an input regularization filter—to produce Lipschitz continuous control inputs. The proposed framework ensures safety, control bounds, and Lipschitz continuity of the control inputs simultaneously by integrating FCBFs and HOCBFs within a unified quadratic program (QP). Theoretical guarantees are provided and simulations on a unicycle model demonstrate the effectiveness of the proposed method compared to standard and smoothness-penalized HOCBF approaches.

11:15-11:30, Paper WeA18.4	Add to My Program
Input-To-State Safe Backstepping: Robust Safety-Critical Control with Unmatched Uncertainties

Cohen, Max	North Carolina State University
Ong, Pio	California Institute of Technology
Ames, Aaron D.	California Institute of Technology
Keywords: Constrained control, Lyapunov methods Abstract: Guaranteeing safety in the presence of unmatched disturbances---uncertainties that cannot be directly canceled by the control input---remains a key challenge in nonlinear control. This paper presents a constructive approach to safety-critical control of nonlinear systems with unmatched disturbances. We first present a generalization of the input-to-state safety (ISSf) framework for systems with these uncertainties using the recently developed notion of an Optimal Decay CBF, which provides more flexibility for satisfying the associated Lyapunov-like conditions for safety. From there, we outline a procedure for constructing ISSf-CBFs for two relevant classes of systems with unmatched uncertainties: i) strict-feedback systems; ii) dual-relative-degree systems, which are similar to differentially flat systems. Our theoretical results are illustrated via numerical simulations of an inverted pendulum and planar quadrotor.

11:30-11:45, Paper WeA18.5	Add to My Program
Combinatorial Control Barrier Functions: Nested Boolean and P-Choose-R Compositions of Safety Constraints

Ong, Pio	California Institute of Technology
Lee, Haejoon	University of Michigan
Molnar, Tamas G.	Wichita State University
Panagou, Dimitra	University of Michigan, Ann Arbor
Ames, Aaron D.	California Institute of Technology
Keywords: Constrained control, Lyapunov methods Abstract: This paper investigates the problem of composing multiple control barrier functions (CBFs)---and matrix control barrier functions (MCBFs)---through logical and combinatorial operations. Standard CBF formulations naturally enable conjunctive (AND) combinations, but disjunctive (OR) and more general logical structures introduce nonsmoothness and possibly a combinatorial blow-up in the number of logical combinations. We introduce the framework of combinatorial CBFs that addresses p-choose-r safety specifications and their nested composition. The proposed framework ensures safety for the exact safe set in a scalable way, using the original number of primitive constraints. We establish theoretical guarantees on safety under these compositions, and we demonstrate their use on a patrolling problem in a multi-agent system.

11:45-12:00, Paper WeA18.6	Add to My Program
Reformulations of Quadratic Programs for Lipschitz Continuity

Agrawal, Devansh Ramgopal	University of Michigan
Lee, Haejoon	University of Michigan
Panagou, Dimitra	University of Michigan, Ann Arbor
Keywords: Constrained control, Optimization Abstract: Optimization-based controllers often lack regularity guarantees, such as Lipschitz continuity, when multiple constraints are present. When used to control a dynamical system, these conditions are essential to ensure the existence and uniqueness of the system's trajectory. Here we propose a general method to convert a Quadratic Program (QP) into a Second Order Cone Program (SOCP), which is shown to be Lipschitz continuous. Key features of our approach are that (i) the regularity of the resulting formulation does not depend on the structural properties of the constraints, such as the linear independence of their gradients; and (ii) it admits a closed-form solution under some assumptions, which is not available for general QPs with multiple constraints, enabling faster computation. We support our method with rigorous analysis and examples.


WeA19 Regular Session, Churchill B1	Add to My Program
Student Best Paper Award

Chair: Hale, Matthew	Georgia Institute of Technology
Co-Chair: Fekih, Afef	University of Louisiana at Lafayette


WeA20 Invited Session, Churchill B2	Add to My Program
Set-Based Methods in Dynamic Systems and Control I

Chair: Pangborn, Herschel	The Pennsylvania State University
Co-Chair: Ruths, Justin	University of Texas at Dallas
Organizer: Koeln, Justin	University of Texas at Dallas
Organizer: Pangborn, Herschel	The Pennsylvania State University
Organizer: Jain, Neera	Purdue University
Organizer: Ruths, Justin	University of Texas at Dallas

10:30-10:45, Paper WeA20.1	Add to My Program
A Set-Based Approach for Stable MPC with Minkowski Cost Functions (I)

Leung, Jordan	Mitsubishi Electric Research Laboratories
P. Vinod, Abraham	Mitsubishi Electric Research Laboratories
Keywords: Predictive control for linear systems, Constrained control, Optimal control Abstract: This paper considers the formulation and implementation of model predictive control (MPC) laws with cost functions defined as Minkowski functions of convex, compact sets. We propose constructive procedures for selecting stabilizing terminal elements based on a lambda-contractive set. In addition, we describe a set-based implementation of the exact MPC policy based on constrained zonotopes and a suboptimal MPC policy based on a polytopic set approximation. Our numerical simulations demonstrate that the proposed suboptimal set-based policy achieves performance comparable to the standard trajectory-based optimal control formulation, while requiring less computational effort for low-dimensional systems.

10:45-11:00, Paper WeA20.2	Add to My Program
Safe Control of Sampled-Data Systems Via Discretized Parametric Contracting Dynamics (I)

Marvi, Zahra	University of Minnesota
Bullo, Francesco	Univ of California at Santa Barbara
Alleyne, Andrew G.	University of Minnesota
Keywords: Constrained control, Hybrid systems, Lyapunov methods Abstract: This paper presents discrete control barrier proximal dynamics (D-CBPD), a computationally-light safe control method for sampled-data systems. Ensuring safety in such systems requires bridging the gap between continuous-time control design and discrete implementation. D-CBPD is a discretized contracting dynamics that solves a control barrier function (CBF)-based quadratic program (QP). We first characterize how discretized parametric contracting dynamics track their fixed point and derive an explicit upper bound on the tracking error. We then analyze their use as discrete controllers for continuous-time systems. Leveraging Lipschitz properties, we show that trajectories remain bounded under constant inputs at each interval and exhibit linear growth in passive and weakly contracting systems. This allows us to bound the deviation of the discrete control signal from the optimal continuous-time controller, both at sampling instants and during inter-sample evolution. Interpreting this deviation as an input disturbance enables analysis of the discrete-time controller as a continuous-time system subject to disturbance. Building on this, we introduce D-CBPD, a safe and scalable discrete controller that guarantees safety of a continuously evolving system with a bounded and adjustable violation margin, provide explicit condition for step size to ensure convergence, and demonstrate its effectiveness in thermal management of a simplified lithium-ion battery.

11:00-11:15, Paper WeA20.3	Add to My Program
Online Constraint Tightening for MPC Using Constrained Zonotope Reachability Analysis and Zonotope Over-Approximations (I)

Robbins, Joshua	The Pennsylvania State University
Glunt, Jonah	The Pennsylvania State University
Thompson, Andrew	The Pennsylvania State University
Pangborn, Herschel	The Pennsylvania State University
Keywords: Predictive control for nonlinear systems, Constrained control, Formal verification/synthesis Abstract: Robust model predictive control (MPC) generally relies on Pontryagin difference calculations to tighten constraints. In cases where the dynamics model or disturbances are not known a priori, it is desirable to perform these constraint tightening calculations online. This paper presents a constraint tightening method based on reachability analysis of error dynamics that is tractable to implement online. Constrained zonotopes are used to bound the error dynamics using polyhedral envelopes to account for dynamic nonlinearities. To leverage a recently developed method for inner-approximating Pontryagin differences where the subtrahend is a zonotope, a novel optimization-free method for over-approximating constrained zontopes as zonotopes is presented. The method is shown to consistently produce tighter over-approximations than existing optimization-free methods. The developed online constraint tightening procedure is evaluated in a numerical example where the MPC uses a linear time-varying model.

11:15-11:30, Paper WeA20.4	Add to My Program
Viscosity CBFs: Bridging the Control Barrier Function and Hamilton-Jacobi Reachability Frameworks in Safe Control Theory (I)

Hirsch, Dylan	UC San Diego (UCSD)
Fernández Fisac, Jaime	Princeton University
Herbert, Sylvia	UC San Diego (UCSD)
Keywords: Constrained control, Optimal control, Lyapunov methods Abstract: Control barrier functions (CBFs) and Hamilton-Jacobi reachability (HJR) are central frameworks in safe control. Traditionally, these frameworks have been viewed as distinct, with the former focusing on optimally safe controller design and the latter providing sufficient conditions for safety. A previous work introduced the notion of a control barrier value function (CB-VF), which is defined similarly to the other value functions studied in HJR but has certain CBF-like properties. In this work, we proceed the other direction by generalizing CBFs to non-differentiable ``viscosity'' CBFs. We show the connection between viscosity CBFs and CB-VFs, bridging the CBF and HJR frameworks. Through this bridge, we characterize the viscosity CBFs as precisely those functions which provide CBF-like safety guarantees (control invariance and smooth approach to the boundary). We then further show nice theoretical properties of viscosity CBFs, including their desirable closure under maximum and limit operations. In the process, we also extend CB-VFs to non-exponential anti-discounting and update the corresponding theory for CB-VFs along these lines.

11:30-11:45, Paper WeA20.5	Add to My Program
Synthesizing Provably Invariant Sets Via Stochastically Sampled Data (I)

Strong, Amy	Duke University
Kashani, Ali	University of New Mexico
Danielson, Claus	University of New Mexico
Bridgeman, Leila J.	Duke University
Keywords: Constrained control, Identification, Pattern recognition and classification Abstract: Positive invariant (PI) sets are essential for ensuring safety, i.e. constraint adherence, of dynamical systems. With the increasing availability of sampled data from complex (and often unmodeled) systems, it is advantageous to leverage these data sets for PI set synthesis. This paper uses data driven geometric conditions of invariance to synthesize PI sets from data. Where previous data driven, set-based approaches to PI set synthesis used deterministic sampling schemes, this work instead synthesizes PI sets from any pre-collected data set. Beyond a data set and Lipschitz continuity, no additional information about the system is needed. A tree data structure is used to partition the space and select samples used to construct the PI set, while Lipschitz continuity is used to provide deterministic guarantees of invariance. Finally, probabilistic bounds are given on the number of samples needed for the algorithm to determine a PI set of a certain volume.

11:45-12:00, Paper WeA20.6	Add to My Program
Guaranteed Privacy-Preserving Control of Discrete-Time Systems (I)

Khajenejad, Mohammad	The University of Tulsa
Keywords: Nonlinear output feedback, Observers for nonlinear systems, Uncertain systems Abstract: We introduce a guaranteed privacy-preserving controller for nonlinear discrete-time systems with bounded uncertainties. Moving beyond stochastic differential privacy, our design offers deterministic privacy through hard bounds on the proximity of set-valued estimates. The solution involves synthesizing a stabilizing controller for a perturbed framer system, where control gains and a privacy-inducing noise factor are co-optimized via semi-definite programming. This integrated approach ensures both input-to-state stable closed-loop dynamics and certified privacy. We also formalize the inherent performance-privacy trade-off by quantifying the accuracy loss due to privacy constraints. Simulations confirm that our method outperforms standard differential privacy techniques.


WeA21 Regular Session, Churchill C1	Add to My Program
Optimization I

Chair: Kalaimani, Rachel Kalpana	Indian Institute of Technology Madras
Co-Chair: Paternain, Santiago	Rensselaer Polytechnic Institute

10:30-10:45, Paper WeA21.1	Add to My Program
Resource Allocation under Stochastic Demands Using Shrinking Horizon Optimization

Tzikas, Alexandros	Stanford University
Ure, Nazim Kemal	Stanford University
Arief, Mansur	Stanford University
Kochenderfer, Mykel	Stanford University
Boyd, Stephen	Stanford University
Keywords: Optimization, Finance, Stochastic optimal control Abstract: We consider the problem of optimally allocating a limited number of resources across time to maximize revenue under stochastic demands. This formulation is relevant in various areas of control, such as supply chain, ticket revenue maximization, healthcare operations, and energy allocation in power grids. We propose a bisection method to solve the static optimization problem and extend our approach to a shrinking horizon algorithm for the sequential problem. The shrinking horizon algorithm computes future allocations after updating the distribution of future demands by conditioning on the observed values of demand. We illustrate the method on a simple synthetic example with jointly log-normal demands, showing that it achieves performance close to a bound obtained by solving the prescient problem.

10:45-11:00, Paper WeA21.2	Add to My Program
A Hybrid Systems Model of Feedback Optimization for Linear Systems: Convergence and Robustness

Chuy, Oscar Jed	Georgia Institute of Technology
Hale, Matthew	Georgia Institute of Technology
Sanfelice, Ricardo G.	University of California at Santa Cruz
Keywords: Optimization, Hybrid systems, Linear systems Abstract: Feedback optimization algorithms compute inputs to a system using real-time output measurements, which helps mitigate the effects of disturbances. However, existing work often models both system dynamics and computations in either discrete or continuous time, which may not accurately model some applications. In this work, we model linear system dynamics in continuous time, and we model the computations of inputs in discrete time. Therefore, we present a novel hybrid systems model of feedback optimization. We first establish the well-posedness of this hybrid model and establish completeness of solutions while ruling out Zeno behavior. Then we show the state of the system converges exponentially fast to a ball of known radius about a desired goal state. Next we analytically show that this system is robust to perturbations in (i) the values of measured outputs, (ii) the matrices that model the linear time-invariant system, and (iii) the times at which inputs are applied to the system. Simulation results confirm that this approach successfully mitigates the effects of disturbances.

11:00-11:15, Paper WeA21.3	Add to My Program
Resource-Aware Greedy Sensor Scheduling with Guarantees under Sequence Supermodular Objectives

Cho, Wooyeong	University of California, Los Angeles
Bhanot, Isean	University of California, Los Angeles (UCLA)
Mehta, Ankur	University of California Los Angeles
Keywords: Optimization, Linear systems, Large-scale systems Abstract: We address resource-aware optimal sensor scheduling for linear gaussian systems over a finite horizon, aiming to minimize the total number of sensor activations while ensuring a target estimation accuracy, measured by the log determinant of the posterior error covariance. Due to the NP hard nature of the problem and the temporal coupling over the time step, prior work has mainly focused on a time-invariant approximation of optimal sensor selection while comparing with greedy scheduling. In this paper, we extend this to time varying sensor scheduling in order to approximate the total number of sensor activations over the finite time horizon. We build a sequential analysis aligned with Kalman recursion and derive a theoretical utilization bound using greedy scheduling under a sequence-supermodular objective. Our simulations across diverse system settings demonstrate that the proposed bound reliably captures sensor usage while ensuring the desired accuracy.

11:15-11:30, Paper WeA21.4	Add to My Program
Proximal Dynamic Framework for Equilibrium Problems with Prescribed-Time Guarantees

Abdel Aal, Osama Fuad	MESA Lab at UC Merced
Ozbek, Necdet Sinan	TOBB University of Economics and Technology
Viola, Jairo	University of California, Merced
Chen, YangQuan	University of California, Merced
Keywords: Optimization, Optimization algorithms, Lyapunov methods Abstract: Fixed-point dynamical systems provide a powerful framework for representing and analyzing a wide variety of problems in control, optimization, and learning. Many fundamental tasks—such as solving nonlinear equations, minimizing objective functions, enforcing constraints, and coordinating distributed agents—can be formulated as fixed-point problems. Traditional methods often rely on asymptotic convergence and may not guarantee performance within a finite horizon. In this work, we first establish a framework for analyzing prescribed-time convergence of fixed-point dynamical systems, ensuring that trajectories reach a fixed point within a user-defined and uniformly bounded time, independent of initial conditions. Building on this foundation, we extend the framework to proximal dynamics for equilibrium problems, thereby unifying prescribed-time fixed-point analysis with proximal operator-based methods. Rigorous analysis confirms prescribed-time stability under smooth assumptions, and numerical experiments illustrate the effectiveness of the proposed approach.

11:30-11:45, Paper WeA21.5	Add to My Program
Zeroth Order Gradient Descent for Low-Rank Functions

Thakur, Aman	Indian Institute of Technology Madras
Kalaimani, Rachel Kalpana	Indian Institute of Technology Madras
Keywords: Optimization, Optimization algorithms, Computational methods Abstract: In this paper we propose a Zeroth-order gradient descent algorithm for cost functions with Low-rank structure, i.e., the gradients of the cost function exists in a lower-dimensional space. Low-rank structures have emerged as a recurring phenomenon in modern machine learning, deep learning, and recommendation systems. In higher-dimensional problems with costly gradient computations, such Low-rank structures have been exploited to design first-order algorithms with reduced computational cost. We propose a Zeroth-order algorithm to cater for scenarios where the cost function is not explicitly available and only function evaluations are possible by querying a noisy oracle. The algorithm first identifies the lower dimensional subspace where the gradients exists mostly. Then in order to compute an approximate gradient the queries to the oracle are done only in the directions corresponding to this lower dimensional subspace thereby reducing the oracle complexity. We provide convergence analysis for the case of strongly convex and non-convex functions.

11:45-12:00, Paper WeA21.6	Add to My Program
Online Optimization on Hadamard Manifolds: Curvature Independent Regret Bounds on Horospherically Convex Objectives

Sahinoglu, Emre	Northeastern University
Shahrampour, Shahin	Northeastern University
Keywords: Optimization algorithms, Optimization Abstract: We study online Riemannian optimization on Hadamard manifolds under the framework of horospherical convexity (h-convexity). Prior work mostly relies on the geodesic convexity (g-convexity), often leading to regret bounds scaling poorly with the manifold curvature. To address this limitation, we analyze Riemannian online gradient descent for h-convex and strongly h-convex functions and establish O(sqrt{T}) and O(log(T)) regret guarantees, respectively. These bounds are curvature-independent and match the results in the Euclidean setting. We validate our approach with experiments on the manifold of symmetric positive-definite (SPD) matrices equipped with the affine-invariant metric. In particular, we investigate online Tyler's M-estimation and online Fréchet mean computation, showing the application of h-convexity in practice.


WeA22 Invited Session, Churchill C2	Add to My Program
Estimation and Control of Distributed Parameter Systems I

Chair: Demetriou, Michael A.	Worcester Polytechnic Institute
Co-Chair: Hu, Weiwei	University of Georgia
Organizer: Demetriou, Michael A.	Worcester Polytechnic Institute
Organizer: Hu, Weiwei	University of Georgia

10:30-10:45, Paper WeA22.1	Add to My Program
Delay Compensation of Multi-Input Distinct Delay Nonlinear Systems Via Neural Operators (I)

Bajraktari, Filip	University of Belgrade, Serbia
Bhan, Luke	University of California, San Diego
Krstic, Miroslav	University of California, San Diego
Shi, Yuanyuan	University of California San Diego
Keywords: Delay systems, Machine learning, Lyapunov methods Abstract: In this work, we present the first stability results for approximate predictors in multi-input non- linear systems with distinct actuation delays. We show that if the predictor approximation satisfies a uniform (in time) error bound, semi-global practical stability is correspondingly achieved. For such approximators, the required uniform error bound depends on the desired region of attraction and the number of control inputs in the system. The result is achieved through transforming the delay into a transport PDE and con- ducting analysis on the coupled ODE-PDE cascade. To highlight the viability of such error bounds, we demonstrate our results on a class of approximators - neural operators - showcasing sufficiency for satisfying such a universal bound both theoretically and in simulation on a mobile robot experiment.

10:45-11:00, Paper WeA22.2	Add to My Program
Reduction of Traffic-Related Pollution through Green Infrastructure-Solutions (I)

Rarita', Luigi	University of Salerno
D'Apice, Ciro	University of Salerno
Manzo, Rosanna	University of Salerno
Piccoli, Benedetto	Rutgers University - Camden
Keywords: Differential-algebraic systems, Numerical algorithms, Simulation Abstract: This paper investigates the effect of green infrastructures in mitigating air pollution in urban environments. This complex issue needs a thorough examination of traffic-related emissions and the atmospheric dispersion of chemical species. A macroscopic second-order model is employed to simulate traffic conditions, followed by a microscopic approach to estimate emission levels. Finally, partial differential equations-based atmospheric dispersion, driven by chemistry ordinary differential equations for particulate/nitrogen oxides/ozone interactions, handles vertical transport under varying conditions. Numerical results show that different configurations of green barriers yield varying pollutant concentrations in metropolitan regions.

11:00-11:15, Paper WeA22.3	Add to My Program
Equation-Free Coarse Control of Distributed Parameter Systems Via Local Neural Operators (I)

Fabiani, Gianluca	Johns Hopkins University
Siettos, Constantinos	Università Degli Studi Di Napoli Federico II
Kevrekidis, Ioannis G.	Johns Hopkins University
Keywords: Distributed parameter systems, Machine learning, Computer-aided control design Abstract: The control of high-dimensional distributed parameter systems (DPS) remains a challenge when explicit coarse-grained equations are unavailable. Classical equation-free (EF) approaches rely on fine-scale simulators treated as black-box timesteppers. However, repeated simulations for steady-state computation, linearization, and control design are often computationally prohibitive, or the microscopic timestepper may not even be available, leaving us with data as the only resource. We propose a data-driven alternative that uses local neural operators, trained on spatiotemporal microscopic/mesoscopic data, to obtain efficient short-time solution operators. These surrogates are employed within Krylov subspace methods to compute coarse steady and unsteady-states, while also providing Jacobian information in a matrix-free manner. Krylov–Arnoldi iterations then approximate the dominant eigenspectrum, yielding reduced models that capture the open-loop slow dynamics without explicit Jacobian assembly. Both discrete-time Linear Quadratic Regulator (dLQR) and pole-placement (PP) controllers are based on this reduced system and lifted back to the full nonlinear dynamics, thereby closing the feedback loop. The framework is validated by stabilizing an unstable steady-state of the Liouville–Bratu PDE, demonstrating consistent performance between the learned surrogate and the true system, with quantified degradation under plant-model mismatch.

11:15-11:30, Paper WeA22.4	Add to My Program
Covariate-Based Localized Conditioning of Random Optimal Parameters in an Abstract Parabolic Model for the Transdermal Transport of Ethanol and a Transdermal Alcohol Biosensor (I)

Tan, Qinyang	University of Southern California
Chen, Hongyi	University of Southern California
Luczak, Susan	University of Southern California
Rosen, I. Gary	University of Southern California
Keywords: Distributed parameter systems, Identification, Uncertain systems Abstract: We study an inverse problem arising in the estimation of breath alcohol concentration (BrAC) from transdermal alcohol concentration (TAC) measured by wearable biosensors. The forward model is a parabolic PDE–ODE system describing ethanol diffusion through the skin, with random parameters reflecting inter-individual and environmental variability. To reduce uncertainty in BrAC estimation, we develop a framework that conditions the distribution of skin parameters on subject-specific covariates by exploiting the dependence of BrAC model parameters on those covariates. The key analytical tools are Banach space versions of the Implicit Function Theorem and sensitivity results expressed in terms of Frechet derivatives of parameter dependent semigroups. A central contribution is the rigorous proof of convergence of second derivatives under Galerkin approximation, ensuring that sensitivity and uncertainty quantification computed in finite dimensions faithfully approximate the infinite-dimensional setting. Together, these results establish a mathematically rigorous basis for population calibration and real-time BrAC reconstruction from TAC, with applications to clinical monitoring, addiction treatment, and digital health.

11:30-11:45, Paper WeA22.5	Add to My Program
Optimizing Pricing and Performance of Sensor Network for Optimal Estimation of Parabolic PDEs (I)

Demetriou, Michael A.	Worcester Polytechnic Institute
Keywords: Distributed parameter systems, Kalman filtering Abstract: This paper introduces economic aspects in the state estimator design for a class of infinite dimensional systems. At one end a spatially distributed sensor providing the best possible filter performance and associated with a highly reliable sensor is identified as a very expensive choice. At the other end, a network of inexpensive sensors with pointwise spatial distribution and characterized by reduced reliability are utilized as an alternative to the expensive sensor. Using a modification to centroidal Voronoi Tessellations to arrive at a network of pointwise sensors that collectively approximate the spatially varying expensive sensor, both the total price and the associated filter performance are examined to provide a joint economic and filter performance metric for sensor design in spatially distributed systems.

11:45-12:00, Paper WeA22.6	Add to My Program
Grad-Div Stabilized Mixed Finite Element Method for Control of Incompressible Flows (I)

Ravindran, S.S.	University of Alabama in Huntsville
Keywords: Control applications, Computational methods, Fluid flow systems Abstract: This paper studies inf-sup stable finite element discretizations for control of viscous incompressible flows with a grad-div stabilization. The proposed grad-div stabilization method augments the mixed Galerkin finite element with a term enhancing mass conservation and improving the performance of the algorithm. We analyse a fully discrete optimality system with grad-div stabilized finite element method. An optimal order error estimate of the approximations of the optimality system is derived. We formulate and solve computationally a control problem that involves velocity matching in a closed cavity. Numerical experiments show the feasibility and applicability of the grad-div stabilized finite element method for control of incompressible flows.


WeB03 Tutorial Session, Grand Salon 3	Add to My Program
Bridging Control with Neural Network Verifier alpha, Beta-CROWN

Chair: Hu, Bin	University of Illinois at Urbana-Champaign
Co-Chair: Zhang, Huan	UIUC
Organizer: Li, Haoyu	University of Illinois, Urbana-Champaign
Organizer: Zhong, Xiangru	University of Illinois Urbana-Champaign
Organizer: Cheng, Hao	University of Illinois Urbana-Champaign
Organizer: Hu, Bin	University of Illinois at Urbana-Champaign
Organizer: Zhang, Huan	UIUC

13:30-15:00, Paper WeB03.1	Add to My Program
Bridging Control with Neural Network Verifier Alpha-Beta-CROWN: A Tutorial (I)

Li, Haoyu	University of Illinois, Urbana-Champaign
Zhong, Xiangru	University of Illinois Urbana-Champaign
Cheng, Hao	University of Illinois Urbana-Champaign
Hu, Bin	University of Illinois at Urbana-Champaign
Zhang, Huan	UIUC
Keywords: Neural networks, Machine learning, Reinforcement learning Abstract: Learning-based methods for synthesizing controllers have gained popularity due to their high expressiveness and strong empirical performance. However, in safety-critical scenarios such as autonomous driving, robotics, and power systems, empirical performance alone is insufficient, and formal verification of controller properties such as stability and safety is highly desirable. Unfortunately, many prior verification approaches are either tied to specific structural assumptions on the system or certificate, making them difficult to transfer across settings, or suffer from poor scalability on higher-dimensional neural network systems. In this tutorial, we present a unified framework that aims at mitigating this gap via bridging control with the state-of-the-art neural network verifier α,β-CROWN (alpha-beta-CROWN). At its core, α,β-CROWN is a general-purpose bounding engine for general computation graphs: given an input domain, it can automatically produce certified bounds and explicit linear relaxation of the objective function. These certified bounds are useful on their own for tasks such as reachability analysis and local linear approximation, and they also provide the foundation for more complex routines that perform verification, satisfiability checking, and optimization. More specifically, many control problems reduce to verifying real-valued inequalities over a state domain (e.g., Lyapunov theory and barrier functions). Consequently, α,β-CROWN enables scalable verification of such conditions by computing tight bounds on these conditions and recursively partitioning and pruning subdomains based on the bounds. Thanks to GPU parallelization, this pipeline demonstrates superior scalability on verification and optimization problems that are challenging for traditional approaches. In this tutorial, we discuss the basics of α,β-CROWN, and introduce its application to various control-related tasks. Finally, we will discuss training frameworks for co-synthesizing controllers and corresponding verifier-friendly certificates.


WeB04 Regular Session, Grand Salon 4	Add to My Program
Aerospace II

Chair: Nguyen, Tam W.	Kyoto University
Co-Chair: Wan, Yan	University of Texas at Arlington

13:30-13:45, Paper WeB04.1	Add to My Program
Reliable Air-To-Air Communication for UAVs Using Directional Antennas

Zheng, Xinquan	San Diego State University
Zhang, Haomeng	San Diego State University
Xie, Junfei	San Diego State University
Chang, Jiajian	University of Texas at Arlington
Wan, Yan	University of Texas at Arlington
Keywords: Communication networks, Flight control, Stability of nonlinear systems Abstract: In unmanned aerial vehicle (UAV) air-to-air (A2A) communication with rigidly mounted directional antennas, flight-induced attitude variations can cause beam deflections, leading to degraded link quality or even complete signal loss. Existing studies typically assume level flight, thereby overlooking this critical effect. In this paper, we formulate communication reliability as a yaw alignment and pitch-constrained control problem, where yaw regulation ensures beam pointing and pitch is restricted within the half-power beamwidth (HPBW) to preserve link reliability. To address this formulation, we design a time-varying gain command-filtered backstepping controller that achieves accurate trajectory tracking while rigorously enforcing attitude constraints. Theoretical analysis shows that the closed-loop signals converge within a prescribed time and ultimately decay to zero, thus avoiding large transient responses. Simulation results demonstrate that the proposed method sustains stable and reliable communication under representative flight scenarios.

13:45-14:00, Paper WeB04.2	Add to My Program
Flight Control of a Fixed-Wing Aircraft with Angle-Of-Attack Sensor Failure

Richards, Riley J.	University of Michigan
Vander Schaaf, Jacob	University of Michigan
Islam, Syed Aseem Ul	University of Michigan
Bernstein, Dennis S.	Univ. of Michigan
Keywords: Flight control, Robust adaptive control, Fault tolerant systems Abstract: Autopilots for fixed-wing aircraft typically use a combination of inertial sensors, which include accelerometers and rate gyros, and non-inertial sensors, which include GPS and air-data sensors. By measuring airspeed, angle of attack, and sideslip angle, air-data sensors determine the magnitude and direction of the aircraft velocity vector in the aircraft frame. Since lift and drag are aerodynamic effects, air-data sensors are essential for estimating the aerodynamic forces and moments applied to the vehicle. Unfortunately, air-data sensors are susceptible to failure, with potentially catastrophic consequences. For a fixed-wing flight control system based on adaptive model predictive control, this paper assesses the performance degradation due to the failure of an angle-of-attack sensor.

14:00-14:15, Paper WeB04.3	Add to My Program
A RRT+B-Spline Planner Guaranteeing Safe Obstacle Navigation for UAVs

Anderson, Dean B.	Brigham Young University
Beard, Randal W.	Brigham Young Univ
Keywords: Flight control, Robotics Abstract: This paper addresses the problem of obstacle nav- igation and path planning for UAVs. This is accomplished by finding a path from start to end points through an obstacle field. To address this problem, a modified Rapidly Exploring Random Trees (RRT) algorithm is used to randomly create a tree of rectangular Safe Flight Corridors (SFCs). When a valid series of SFCs has been found from the start to the goal, a flight path that stays within that series is generated using a Basis Spline (B-Spline), which provides continuous position, velocity, and acceleration control. The path is generated by adjusting the B-Spline control points to minimize the path length objective function with additional constraints on the curvature of the B- Spline. Simulation results demonstrate the effectiveness of this algorithm for UAV path planning.

14:15-14:30, Paper WeB04.4	Add to My Program
A Quaternion-Based Attitude Tracking Control of UAVs Using Model Predictive Control and Prescribed Performance Approaches

Mohammadzamani, Fatemeh	Carleton University
Shen, Chao	Carleton University
Hashim, Hashim A	Carleton University
Keywords: Flight control, Predictive control for nonlinear systems, Robotics Abstract: This paper presents an attitude tracking control method for unmanned aerial vehicles (UAVs) using a unit-quaternion representation. Initially, a prescribed performance controller (PPC) is developed to ensure closed-loop stability, demonstrating that tracking errors are exponentially converging within predefined bounds. Subsequently, the control performance is enhanced through the application of the Lyapunov-based model predictive control (LMPC). Compared with existing methods, the combination of the LMPC and PPC (i) optimizes the tracking performance, (ii) accounts for design constraints, and (iii) addresses the nonlinearities of the system's dynamics. The recursive feasibility of LMPC is rigorously proved, and the asymptotic stability of the closed-loop system can be shown using Lyapunov analysis. Finally, the applicability and validity of the theoretical results are demonstrated in simulations. To highlight the performance of the LMPC controller, it compares to two baseline controllers, PPC and an output feedback controller (OFC) that is widely studied for UAVs. The simulation results demonstrate the advantages of LMPC in attitude tracking applications. In particular, the combination of LMPC and PPC reduces the transient tracking error by 60.0% compared to OFC and 8.2% compared to PPC and improves the steady-state error by 92.0% compared to OFC and 46.4% compared to PPC.

14:30-14:45, Paper WeB04.5	Add to My Program
Fast RLS Identification Leveraging the Linearized System Sparsity: Predictive Cost Adaptive Control for Quadrotors

Nguyen, Tam W.	Kyoto University
Keywords: Flight control, Indirect adaptive control, Optimal control Abstract: This paper presents a centralized predictive cost adaptive control (PCAC) strategy for the position and attitude control of quadrotors. PCAC is an optimal, prediction-based control method that uses recursive least squares (RLS) to identify model parameters online, enabling adaptability in dynamic environments. Addressing challenges with black-box approaches in systems with complex couplings and fast dynamics, this study leverages the unique sparsity of quadrotor models linearized around hover points. By identifying only essential parameters related to nonlinear couplings and dynamics, this approach reduces the number of parameters to estimate, accelerates identification, and enhances stability during transients. Furthermore, the proposed control scheme removes the need for an attitude setpoint, typically required in conventional cascaded control designs.

14:45-15:00, Paper WeB04.6	Add to My Program
Coordinated UAV Beamforming and Control for Directional Jamming and Nulling

Fotiadis, Filippos	The University of Texas at Austin
Sadler, Brian M.	The University of Texas at Austin
Topcu, Ufuk	The University of Texas at Austin
Keywords: Control over communications, Communication networks, Optimal control Abstract: Efficient mobile jamming against eavesdroppers in wireless networks necessitates accurate coordination between mobility and antenna beamforming. We study the coordinated beamforming and control problem for a UAV that carries two omnidirectional antennas, and which uses them to jam an eavesdropper while leaving a friendly client unaffected. The UAV can shape its jamming beampattern by controlling its position, its antennas' orientation, and the relative phasing for each antenna. We derive a closed-form expression for the antennas' phases that guarantees zero jamming impact on the client. In addition, we determine the antennas’ orientation and the UAV’s position that maximizes jamming impact on the eavesdropper through an optimal control problem, optimizing the orientation pointwise and the position through the UAV’s control input. Simulations show how this coordinated beamforming and control scheme enables directional GPS denial while guaranteeing zero interference towards a friendly direction.


WeB05 Tutorial Session, Grand Salon 6	Add to My Program
Redefining End-Of-Life: Intelligent Automation for Electronics Remanufacturing

Chair: Tian, Sibo	Texas A&M University
Co-Chair: Zheng, Minghui	Texas A&M University
Organizer: Zheng, Minghui	Texas A&M University
Organizer: Behdad, Sara	University of Florida

13:30-15:00, Paper WeB05.1	Add to My Program
Redefining End-Of-Life: Intelligent Automation for Electronics Remanufacturing Systems (I)

Tian, Sibo	Texas A&M University
Liang, Xiao	Texas A&M University
Behdad, Sara	University of Florida
Zheng, Minghui	Texas A&M University
Keywords: Manufacturing systems, Robotics, Machine learning Abstract: Remanufacturing is fundamentally more challenging than traditional manufacturing due to the significant uncertainty, variability, and incompleteness inherent in end-of-life (EoL) products. At the same time, it has become increasingly essential and urgent for facilitating a circular economy, driven by the growing volume of discarded electronic products and the escalating scarcity of critical materials. In this paper, we review the existing literature and examine the key challenges as well as emerging opportunities in intelligent automation for EoL electronics remanufacturing, providing a comprehensive overview of how robotics, control, and artificial intelligence (AI) can jointly enable scalable, safe, and intelligent remanufacturing systems. This paper starts with the definition, scope, and motivation of remanufacturing within the context of a circular economy, highlighting its societal and environmental significance. Then it delves into intelligent automation approaches for disassembly, inspection, sorting, and component reprocessing in this domain, covering advanced methods for multimodal perception, decision-making under uncertainty, flexible planning algorithms, and force-aware manipulation. The paper further reviews several emerging techniques, including large foundation models, human-in-the-loop integration, and digital twins that have the potential to support future research in this area. By integrating these topics, we aim to illustrate how next-generation remanufacturing systems can achieve robust, adaptable, and efficient operation in the face of complex real-world challenges.


WeB06 Regular Session, Grand Salon 7	Add to My Program
Game Theory II

Chair: Akyol, Emrah	SUNY Binghamton
Co-Chair: Nazari, Shima	UC Davis

13:30-13:45, Paper WeB06.1	Add to My Program
Bayesian Holonic Systems: Equilibrium, Uniqueness, and Computation

Pan, Yunian	New York University
Zhu, Quanyan	New York University
Keywords: Game theory, Networked control systems, Stochastic systems Abstract: This paper addresses the challenge of modeling and control in hierarchical, multi-agent systems, known as holonic systems, where local agent decisions are coupled with global systemic outcomes. We introduce the Bayesian Holonic Equilibrium (BHE), a concept that ensures consistency between agent-level rationality and system-wide emergent behavior. We establish the theoretical soundness of the BHE by showing its existence and, under stronger regularity conditions, its uniqueness. We propose a two-time scale learning algorithm to compute such an equilibrium. This algorithm mirrors the system's structure, with a fast timescale for intra-holon strategy coordination and a slow timescale for inter-holon belief adaptation about external risks. The convergence of the algorithm to the theoretical equilibrium is validated through a numerical experiment on a continuous public good game. This work provides a complete theoretical and algorithmic framework for the principled design and analysis of strategic risk in complex, coupled control systems.

13:45-14:00, Paper WeB06.2	Add to My Program
A Generalized Potential Game Approach of UAV Swarm Coordination for Hidden Target Localization

Guckert, Mathis	Inria Centre at the University of Lille
Le Cadre, Helene	Inria Centre at the University of Lille
Le Hénaff, Jean	École Nationale Des Ponts Et Chaussées
Keywords: Game theory, Optimization, Emerging control applications Abstract: Considering a swarm of Unmanned Aerial Vehicles (UAVs) carrying sensors with nondeterministic detection and noisy localization measurements, while sharing observations with neighboring UAVs, we address the problem of localization of a hidden target in continuous space and discrete time. The goal is to coordinate UAVs to maximize the information gathered while minimizing their individual energy costs. We formulate the problem as a time-varying non-cooperative game with coupling constraints. We show that the target is localized in finite time with probability one and that the game has a generalized potential structure. Further, we provide an exact best-response algorithm for UAVs to iteratively compute their trajectories. Finally, we numerically compare the potential game to the team-based approach, demonstrating comparable performance under different communication graph structures and assessing the impact of the swarm size on various metrics.

14:00-14:15, Paper WeB06.3	Add to My Program
Optimal Modified Feedback Strategies in LQ Games under Control Imperfections

Rabbani, Mahdis	University of California, Davis
Mojahed Baghbadorani, Navid	UC Davis
Nazari, Shima	UC Davis
Keywords: Game theory, Optimal control, Adaptive control Abstract: Game-theoretic approaches and Nash equilibrium have been widely applied across various engineering domains. However, practical challenges such as disturbances, delays, and actuator limitations can hinder the precise execution of Nash equilibrium strategies. This work investigates the impact of such implementation imperfections on game trajectories and players' costs in the context of a two-player finite-horizon linear quadratic (LQ) nonzero-sum game. Specifically, we analyze how small deviations by one player, measured or estimated at each stage affect the state trajectory and the other player’s cost. To mitigate these effects, we construct a compensation law for the influenced player by augmenting the nominal game with the measurable deviation dynamics. The resulting policy is shown to be optimal within a causal affine policy class, and, for sufficiently small deviations, it locally outperforms the uncompensated equilibrium-derived feedback. Rigorous analysis and proofs are provided, and the effectiveness of the proposed approach is demonstrated through a representative numerical example.

14:15-14:30, Paper WeB06.4	Add to My Program
Game-Theory-Assisted Reinforcement Learning for Border Defense: Early Termination Based on Analytical Solutions

Das, Goutam	Purdue University
Dorothy, Michael	US Army Research Laboratory
Volle, Kyle	University of Florida
Shishika, Daigo	George Mason University
Keywords: Game theory, Reinforcement learning, Cooperative control Abstract: Game theory provides the gold standard for analyzing adversarial engagements, offering strong optimality guarantees. However, these guarantees often become brittle when assumptions such as perfect information are violated. Reinforcement learning (RL), by contrast, is adaptive but can be sample-inefficient in large, complex domains. This paper introduces a hybrid approach that leverages game-theoretic insights to improve RL training efficiency. We study a border defense game with limited perceptual range, where defender performance depends on both search and pursuit strategies, making classical differential game solutions inapplicable. Our method employs the Apollonius Circle (AC) to compute equilibrium in the post-detection phase, enabling early termination of RL episodes without learning pursuit dynamics. This allows RL to concentrate on learning search strategies while guaranteeing optimal continuation after detection. Across single- and multi-defender settings, this early termination method yields 10–20% higher rewards, faster convergence, and more efficient search trajectories.

14:30-14:45, Paper WeB06.5	Add to My Program
Multi-Agent Guided Policy Search for Non-Cooperative Dynamic Games

Li, Jingqi	University of Texas at Austin
Qu, Gechen	University of California, Berkeley
Choi, Jason J.	UCLA
Sojoudi, Somayeh	UC Berkeley
Tomlin, Claire J.	UC Berkeley
Keywords: Game theory, Reinforcement learning, Iterative learning control Abstract: Multi-agent reinforcement learning (MARL) optimizes strategic interactions in non-cooperative dynamic games, where agents have misaligned objectives. However, data-driven methods such as multi-agent policy gradients (MA-PG) often suffer from instability and limit-cycle behaviors. Prior stabilization techniques typically rely on entropy-based exploration, which slows learning and increases variance. We propose a model-based approach that incorporates approximate priors into the reward function as regularization. In linear quadratic (LQ) games, we prove that such priors stabilize policy gradients and guarantee local exponential convergence to an approximate Nash equilibrium. We then extend this idea to infinite-horizon nonlinear games by introducing Multi-agent Guided Policy Search (MA-GPS), which constructs short-horizon local LQ approximations from trajectories of current policies to guide training. Experiments on nonlinear vehicle platooning and a six-player strategic basketball formation show that MA-GPS achieves faster convergence and more stable learning than existing MARL methods.

14:45-15:00, Paper WeB06.6	Add to My Program
Revisiting Dynamic Pricing Games: Nash Equilibrium under Fairness

Anand, Anju	Binghamton University
Roberts, Cassidy	Binghamton University
Akyol, Emrah	SUNY Binghamton
Keywords: Game theory, Smart grid, Optimization Abstract: This paper studies the Nash equilibrium of a timeof- use electricity pricing game. Existing formulations in this literature are typically solved as Stackelberg problems, with the company posting prices and the user responding by adjusting load. In our prior work, we studied both Stackelberg orders of play and introduced a fairness penalty to prevent opportunistic price increases when the company acts as the follower. Here we formulate the simultaneous-move game directly, establish existence and uniqueness of the Nash equilibrium under the adopted model, derive the players’ best responses, and compare the resulting equilibrium numerically with the two Stackelberg benchmarks.


WeB07 Invited Session, Grand Salon 9	Add to My Program
Safety-Critical Control of Cyber-Physical Systems

Chair: Pare, Philip E.	Purdue University
Co-Chair: Butler, Brooks A.	University of California, Irvine
Organizer: Samadi, Saba	Purdue University
Organizer: Butler, Brooks A.	University of California, Irvine
Organizer: Pare, Philip E.	Purdue University

13:30-13:45, Paper WeB07.1	Add to My Program
Safe Event-Triggered Learning for Sampled-Data Systems (I)

Begzadić, Azra	University of California, San Diego
Lederer, Armin	National University of Singapore
Hirche, Sandra	Technische Universität München
Herbert, Sylvia	UC San Diego (UCSD)
Cortes, Jorge	UC San Diego
Keywords: Uncertain systems, Sampled-data control, Machine learning Abstract: Control Barrier Functions provide a principled framework for ensuring safety, yet their reliance on accurate system models is a critical limitation for practical deployment. Moreover, the majority of theory relies on continuous-time guarantees that may not hold for discrete controllers. To address these challenges, we employ Gaussian processes to learn the components of the CBF safety condition, which depend on the unknown system dynamics, directly from data. Since controllers typically need to be implemented in a sampled-data fashion, we develop robust CBF conditions that account for inter-sampling effects to ensure the safety of unknown control-affine systems. The Gaussian Process model is updated online under a bimodal control framework that switches between control-focused and learning-focused phases, and its uncertainty is leveraged to guarantee satisfaction of the CBF conditions with high probability. The effectiveness of the proposed framework is demonstrated in simulations.

13:45-14:00, Paper WeB07.2	Add to My Program
Data-Driven Robust Control under Input-Output Stealthy Attacks (I)

Bhowmik, Arijit	Michigan State University
Bopardikar, Shaunak D.	Michigan State University
Hespanha, Joao P.	Univ. of California, Santa Barbara
Keywords: Data driven control, Robust control, Game theory Abstract: We consider the problem of robust control of an unknown but minimal linear time-invariant system under input and output disturbances and adversarial manipulation. An attacker can (i) corrupt sensor measurements (deception attacks) and (ii) perturb the control channel (actuation attacks). To address the lack of model knowledge, we adapt the Data-enabled Predictive Control (DeePC) framework, which constructs predictors directly from input–output data with bounded disturbance. We formulate a finite-horizon open-loop control problem as a two-player zero-sum game with asymmetric information: the defender selects control inputs based on measured data and only knows an upper bound on disturbances, whereas the attacker has access to the true disturbance realization and can remain stealthy by hiding within this uncertainty set. The main contributions are (i) sufficient conditions for the existence of a Nash equilibrium corresponding to saddle-point policies for this game, and (ii) an analysis of the defender’s security strategy against deception and actuation attacks. Simulation studies on finite-horizon control demonstrate the effectiveness of the proposed approach.

14:00-14:15, Paper WeB07.3	Add to My Program
Partial Resilient Leader-Follower Consensus in Time-Varying Graphs (I)

Lee, Haejoon	University of Michigan
Panagou, Dimitra	University of Michigan
Keywords: Network analysis and control, Control of networks, Fault tolerant systems Abstract: This work studies resilient leader-follower consensus with a bounded number of adversaries. Existing approaches typically require robustness conditions of the entire network to guarantee resilient consensus. However, the behavior of such systems when these conditions are not fully met remains unexplored. To address this gap, we introduce the notion of partial leader-follower consensus, in which a subset of non-adversarial followers successfully tracks the leader’s reference state despite insufficient robustness. We propose a novel distributed algorithm - the Bootstrap Percolation and Mean Subsequence Reduced (BP-MSR) algorithm - and establish sufficient conditions for individual followers to achieve consensus via the BP-MSR algorithm in arbitrary time-varying graphs. We validate our findings through simulations, demonstrating that our method guarantees partial leader-follower consensus, even when standard resilient consensus algorithms fail.

14:15-14:30, Paper WeB07.4	Add to My Program
Collaborative Altruistic Safety in Coupled Multi-Agent Systems (I)

Butler, Brooks A.	University of California, Irvine
Tan, Xiao	Beihang University
Ames, Aaron D.	California Institute of Technology
Egerstedt, Magnus	University of North Carolina, Chapel Hill
Keywords: Cooperative control, Networked control systems, Agents-based systems Abstract: This paper presents a novel framework for ensuring safety in dynamically coupled multi-agent systems through collaborative control. Drawing inspiration from ecological models of altruism, we develop collaborative control barrier functions that allow agents to cooperatively enforce individual safety constraints under coupling dynamics. We introduce an altruistic safety condition based on the so-called Hamilton’s rule, enabling agents to trade off their own safety to support higher-priority neighbors. By incorporating these conditions into a distributed optimization framework, we demonstrate increased feasibility and robustness in maintaining system-wide safety. The effectiveness of the proposed approach is illustrated through simulation in a simplified formation control scenario.

14:30-14:45, Paper WeB07.5	Add to My Program
Robust Safety-Critical Control of Networked SIR Dynamics (I)

Samadi, Saba	Purdue University
Butler, Brooks A.	University of California, Irvine
Pare, Philip E.	Purdue University
Keywords: Networked control systems, Network analysis and control, Control applications Abstract: We develop a robust safety-critical control framework for networked susceptible–infected–recovered (SIR) dynamics using control barrier functions (CBFs). Each node applies a local intervention to keep infections below a prescribed threshold despite coupling with neighbors and uncertainty in epidemic parameters and measurements. We derive a closed-form nominal CBF controller and extend it to robust CBFs via compensation terms. We study two uncertainty models: a constant bounded disturbance and a low-prevalence–amplified model that increases conservatism when infections are small. Simulations on a small network show nominal safety under low uncertainty and formal robustness under higher uncertainty, with a clear safety–effort trade-off.

14:45-15:00, Paper WeB07.6	Add to My Program
Safe Bayesian Optimization across Noise Models Via Scenario Programming (I)

Tokmak, Abdullah	Aalto University
Schön, Thomas (Bo)	Uppsala University
Baumann, Dominik	Aalto University
Keywords: Statistical learning, Optimization algorithms, Reinforcement learning Abstract: Safe Bayesian optimization (BO) with Gaussian processes is an effective tool for tuning control policies in safety-critical real-world systems, specifically due to its sample efficiency and safety guarantees. However, most safe BO algorithms assume homoscedastic sub-Gaussian measurement noise, an assumption that does not hold in many relevant applications. In this article, we propose a straightforward yet rigorous approach for safe BO across noise models, including homoscedastic sub-Gaussian and heteroscedastic heavy-tailed distributions. We provide a high-probability bound on the measurement noise via the scenario approach, integrate these bounds into high probability confidence intervals, and prove safety and optimality for our proposed safe BO algorithm. We deploy our algorithm in synthetic examples and in tuning a controller for the Franka Emika manipulator in simulation.


WeB08 Invited Session, Grand Salon 10-13	Add to My Program
Multi-Agent Control and Coordination I

Chair: Von Moll, Alexander	Air Force Research Laboratory
Co-Chair: Weintraub, Isaac	Air Force Research Laboratory
Organizer: Weintraub, Isaac	Air Force Research Laboratory
Organizer: Von Moll, Alexander	Air Force Research Laboratory
Organizer: Sinha, Abhinav	The University of Cincinnati

13:30-13:45, Paper WeB08.1	Add to My Program
Trajectory Encryption Cooperative Salvo Guidance (I)

Gopikannan, Lohitvel	Indian Institute of Technology Bombay
Kumar, Shashi Ranjan	Indian Institute of Technology Bombay
Sinha, Abhinav	The University of Cincinnati
Keywords: Aerospace, Multivehicle systems, Cooperative control Abstract: This paper introduces the concept of trajectory encryption in cooperative simultaneous target interception, wherein heterogeneity in guidance principles across a team of unmanned autonomous systems is leveraged as a strategic design feature. By employing a mix of heterogeneous time-to-go formulations leading to a cooperative guidance strategy, the swarm of vehicles is able to generate diverse trajectory families. This diversity expands the feasible solution space for simultaneous target interception, enhances robustness under disturbances, and enables flexible time-to-go adjustments without predictable detouring. From an adversarial perspective, heterogeneity obscures the collective interception intent by preventing straightforward prediction of swarm dynamics, effectively acting as an encryption layer in the trajectory domain. Simulations demonstrate that the swarm of heterogeneous vehicles is able to intercept a moving target simultaneously from a diverse set of initial engagement configurations.

13:45-14:00, Paper WeB08.2	Add to My Program
On Robustness of Consensus Over Pseudo-Undirected Path Graphs (I)

Sinha, Abhinav	The University of Cincinnati
Mukherjee, Dwaipayan	Indian Institute of Technology Bombay
Kumar, Shashi Ranjan	Indian Institute of Technology Bombay
Keywords: Aerospace, Cooperative control Abstract: Consensus over networked agents is typically studied using undirected or directed communication graphs. Undirected graphs enforce symmetry in information exchange, leading to convergence to the average of initial states, while directed graphs permit asymmetry but make consensus dependent on root nodes and their influence. Both paradigms impose inherent restrictions on achievable consensus values and network robustness. This paper introduces a theoretical framework for achieving consensus over a class of network topologies, termed pseudo-undirected graphs, which retains bidirectional connectivity between node pairs but allows the corresponding edge weights to differ, including the possibility of negative values under bounded conditions. The resulting Laplacian is generally non-symmetric, yet it guarantees consensus under connectivity assumptions, to expand the solution space, which enables the system to achieve a stable consensus value that can lie outside the convex hull of the initial state set. We derive admissibility bounds for negative weights for a pseudo-undirected path graph, and show an application in the simultaneous interception of a moving target.

14:00-14:15, Paper WeB08.3	Add to My Program
Nonlinear Guidance for a Pursuer-Turret Team for Maneuvering Target Engagement (I)

Bajpai, Shivam	University of Cincinnati
Sinha, Abhinav	The University of Cincinnati
Weintraub, Isaac	Air Force Research Laboratory
Sharma, Rajnikant	IS4S
Casbeer, David W.	Air Force Research Laboratory
Keywords: Aerospace, Control applications Abstract: This paper addresses the problem of engaging a maneuvering target using a mobile pursuer equipped with a rotating turret. Unlike conventional point-capture formulations, the objective is to maintain a prescribed standoff distance between the pursuer and the target, while ensuring that the turret’s heading aligns with the instantaneous line of sight (LOS) with respect to the mobile target. The pursuer is modeled as a non-holonomic vehicle with lateral acceleration capability, whereas the turret can have free angular maneuverability. We develop a nonlinear guidance strategy that coordinates the pursuer’s motion with the turret’s rotation to neutralize the target. The proposed method explicitly accounts for the nonlinear coupled kinematics of the pursuer-turret team, ensuring stability and convergence without linearization. Simulation results demonstrate the effectiveness of the guidance strategy against a general maneuvering target.

14:15-14:30, Paper WeB08.4	Add to My Program
Adaptive Event-Triggered Policy Gradient for Multi-Agent Reinforcement Learning (I)

Umer, Muhammad	The University of Texas at San Antonio
Sinha, Abhinav	The University of Cincinnati
Cao, Yongcan	University of Texas, San Antonio
Keywords: Reinforcement learning, Networked control systems Abstract: Conventional multi-agent reinforcement learning (MARL) methods rely on time-triggered execution, where agents sample and communicate actions at fixed intervals. This approach is often computationally expensive and communication-intensive. To address this limitation, we propose ET-MAPG (Event-Triggered Multi-Agent Policy Gradient reinforcement learning)-- a framework that jointly learns an agent's control policy and its event-triggering policy. Unlike prior work that decouples these mechanisms, ET-MAPG integrates them into a unified learning process, enabling agents to learn what action to take and when to execute it. For scenarios with inter-agent communication, we introduce AET-MAPG, an attention-based variant that leverages a self-attention mechanism to learn selective communication patterns. AET-MAPG empowers agents to determine when to trigger an action, and also with whom to communicate and what information to exchange, thereby optimizing coordination. Both methods can be integrated with any policy gradient MARL algorithm. Extensive experiments across diverse MARL benchmarks demonstrate that our approaches achieve performance comparable to state-of-the-art, time-triggered baselines while significantly reducing both computational load and communication overhead.

14:30-14:45, Paper WeB08.5	Add to My Program
Cooperative Relay-Assisted Traveling Salesman Problem Using Graphs of Convex Sets (I)

Manyam, Satyanarayana Gupta	DCS Corporation (Air Force Research Labs Cont.)
Casbeer, David W.	Air Force Research Laboratory
Weintraub, Isaac	Air Force Research Laboratory
Keywords: Optimization, Optimization algorithms, Autonomous systems Abstract: A cooperative routing problem involving two vehicles, a sensing agent and a relay agent, is considered. The sensing agent must visit a given set of targets to collect data and transmit it to a ground station. Hardware limitations restrict the transmission range, requiring the second agent to act as a relay between the sensing agent and the ground station. The collected data is transmitted through a two-hop network: from the sensing agent to the relay agent, and then to the ground station. Given a set of target locations, the routing problem aims to determine the optimal sequence of targets to be visited by the sensing agent, along with the corresponding positions of the relay agent, to minimize the combined travel cost of both vehicles. This problem is formulated as a mixed-integer convex program utilizing the framework of graphs of convex sets. The formulation avoids ``big-M" constraints, resulting in a tighter convex relaxation. Furthermore, this approach is shown to be applicable to the neighborhood traveling salesman problem and can be extended to a three-agent scenario with two relay agents. Computational experiments are conducted to evaluate the proposed formulations and validate the tightness of the lower bounds obtained via convex relaxation.

14:45-15:00, Paper WeB08.6	Add to My Program
Tactical Asset Allocation Game (TAAG) with Sensor Performance, Deception, and Differential Subgames

Judy, Rachael L.	University of Cincinnati
Fuchs, Zachariah E.	University of Cincinnati
Keywords: Game theory, Optimal control, Information theory and control Abstract: We examine a Tactical Asset Allocation Game (TAAG) in which two opposing players attempt to place offensive and defensive assets to optimize their respective expected value upon initiation of an engagement. The Defender player selects an initial Target position to maximize the expected distance between the Target and a Penetrator agent. The Defender also positions an Interceptor to intercept the Penetrator agent and, incurring a corresponding cost, selects an obfuscation tactic for the sensor network. Based on the sensor data, the Attacker then selects an incursion point for the Penetrator with the objective of minimizing that same expected distance between Target and Penetrator. The values of the possible engagements emerging from the placement strategies are determined using a variation of the Target-Penetrator-Interceptor (TPI) game. The TAAG is then resolved by utilizing a linear program based on the matrix game established by combinations of different allocation and deception opportunities and further demonstrates the potential of differential games to be utilized in solving more complex problems in control and game theory.


WeB09 Invited Session, Grand Salon 12	Add to My Program
Control and Design for Energy Storage Systems

Chair: Ouedraogo, Asmaou Sougri-Nooma	Texas Tech University
Co-Chair: Zhang, Dong	University of Oklahoma
Organizer: Ouedraogo, Asmaou Sougri-Nooma	Texas Tech University
Organizer: Docimo, Donald	Texas Tech University
Organizer: Soudbakhsh, Damoon	Temple University
Organizer: Zhang, Dong	University of Oklahoma
Organizer: Song, Ziyou	University of Michigan, Ann Arbor
Organizer: Araujo Xavier, Marcelo	Amazon Leo
Organizer: Moura, Scott	University of California, Berkeley
Organizer: Lin, Xinfan	University of California, Davis
Organizer: Cui, Xiaofan	University of California, Los Angeles
Organizer: Filgueira da Silva, Samuel	The Ohio State University
Organizer: Tang, Shuxia	Texas Tech University
Organizer: Dey, Satadru	The Pennsylvania State University
Organizer: Jahan, Tania Rifat	Texas Tech University
Organizer: Hasankhani, Arezoo	University of New Hampshire

13:30-13:45, Paper WeB09.1	Add to My Program
Optimal and Heuristic Dispatch of PV-BESS for Multi-Load Peak Shaving (I)

McDougal, Caleb	Texas A&M University
Rasmussen, Bryan	Texas A&M University
Keywords: Optimization, Energy systems, Control applications Abstract: Simultaneous peak shaving for multi-load microgrids is an important problem with significant financial impact on industrial and commercial energy users. Cost savings can be realized by strategically reducing peak demand across multiple loads through the use of solar photovoltaic (PV) and battery energy storage systems (BESS). A mixed integer linear programming (MILP) optimization problem formulation for PV-BESS multi-load dispatch is outlined. It serves as a standard against which live controller performance can be compared. A heuristic dispatch control strategy is developed leveraging the nature of the 15 minute average demand billing structure. The practical controller has no need of complex live weather and demand forecasting and is developed for a multi-load application. Industrial electric utility data from a manufacturing facility in Houston, Texas is used for testing of the heuristic control strategy. Results show net savings within as near as 3.3% when comparing the heuristic controller and MILP on the industrial data.

13:45-14:00, Paper WeB09.2	Add to My Program
A Control Co-Design Framework to Achieve Solution Feasibility in Energy System Optimization Problems (I)

Jahan, Tania Rifat	Texas Tech University
Docimo, Donald	Texas Tech University
Keywords: Control applications, Energy systems Abstract: This work explores methods to identify energy system designs for infeasible control co-design optimization problems. Control co-design, or CCD, has been recognized as a powerful tool to maximize energy system capabilities through simultaneous determination of plant and controller parameters. However, due to the inherent nonlinearities, complexity, and conflicting criteria of energy systems, CCD optimization problems are susceptible to infeasibility and can lack potential solutions. While transforming the optimization problem by relaxing constraints has been developed for optimal control infeasibility challenges, solution feasibility for CCD is relatively unexplored. This paper proposes a framework to convert infeasible optimization problems into solvable forms for a class of CCD problems. The framework introduces a procedure to rank metric bounds from least likely to most likely to cause infeasibility. This provides guidance to algorithmically relax a limited number of constraints, leaving others intact. The proposed framework is applied to a CCD problem for designing a battery within a microgrid. Comparison against a baseline approach for relaxing optimization problems shows the framework requires only a reduced number of iterations to determine a solution.

14:00-14:15, Paper WeB09.3	Add to My Program
Safety-Aware Fast Charging Control for Lithium-Ion Batteries Via Physics-Enhanced Gaussian Process (I)

Choo, Wonoo	University of Oklahoma
Kajiura, Yuichi	University of Oklahoma
Espin, Jorge	University of Oklahoma
Xu, Zhicheng	University of Oklahoma
Zhang, Dong	University of Oklahoma
Keywords: Energy systems, Machine learning, Optimal control Abstract: Fast charging accelerates lithium-ion battery degradation and can trigger lithium plating, posing serious safety risks. While existing charging strategies such as Constant-Current Constant-Voltage (CCCV) or equivalent circuit model (ECM)-based charging methods are computationally efficient, they lack access to internal electrochemical states and cannot enforce degradation-aware safety limits. This work proposes a hybrid Model Predictive Control (MPC) framework that augments an ECM with a Gaussian Process (GP) surrogate trained on high-fidelity Doyle-Fuller-Newman (DFN) simulations. The key novelty lies in a GP surrogate that estimates lithium plating overpotential at the anode-separator interface, enabling real-time enforcement of physically meaningful safety constraints without the computational burden of electrochemical models. Simulation results on an LG-M50 cell show that the proposed GP-ECM MPC reduces plating violations by 85-95% compared to CCCV, ECM-MPC, and SPMe-MPC methods, with less than a 10% increase in charging time and only 7 ms of additional computation per optimization step. These results demonstrate a scalable, safety-conscious fast-charging strategy suitable for practical deployment in future battery management systems.

14:15-14:30, Paper WeB09.4	Add to My Program
Optimization of Phase Change Material Integration for Active Cooling Control (I)

Ouedraogo, Asmaou Sougri-Nooma	Texas Tech University
Docimo, Donald	Texas Tech University
Keywords: Energy systems, Control applications Abstract: This paper presents a unified optimization framework for phase change material (PCM) based cooling systems. Thermal management is critical in applications such as photovoltaic (PV) modules, battery packs, and power electronics, where excessive heat reduces performance and lifespan. Designing such systems is challenging because energy dynamics, capacity, heat rejection, and structural constraints must all be considered. Although prior studies have investigated PCM applications and heat transfer enhancement, there are limited efforts that unify such diverse performance objectives through formalized design methods. This paper develops a framework that formulates the PCM design problem using critical energy-based terms, with static and dynamic objectives capturing the PCM physical design and control aspects. Two case studies are used to validate the approach: the first explores passive cooling, and the second implements an active cooling configuration. The results compare the design and control of these systems, showing improvement in individual performance metrics between the two options.

14:30-14:45, Paper WeB09.5	Add to My Program
Threshold Policy-Enhanced Reinforcement Learning for Multi-Path Wireless Charging Scheduling in Coupled Power and Transportation Systems

Hao, Liangliang	Shandong University
Cai, Yuanyu	Shandong University
Jin, Jiangliang	Donghua University
Chen, Cong	Cornell University
Keywords: Power systems, Reinforcement learning, Stochastic optimal control Abstract: We study the optimal dynamic wireless charging (DWC) for electric vehicles in coupled power and transportation systems with stochastic renewable energy supply. We formulate the problem as a Markov decision process (MDP) that captures the interdependencies between traffic flows, charging demand, and grid operations. Our key contribution is a path-dependent threshold mechanism that reduces the high-dimensional charging decision space by exploiting the structural properties of optimal policies along vehicles’ paths. To approximate optimal thresholds, we develop a customized deep reinforcement learning framework that achieves efficient learning through intelligent action space reduction. The proposed method dynamically adjusts charging decisions by jointly considering traffic conditions, renewable availability, and grid capacity constraints. Experiments on realistic traffic-grid scenarios demonstrate up to 44% operational cost reduction compared to state-of-the-art benchmarks while reducing training time by 22%.

14:45-15:00, Paper WeB09.6	Add to My Program
Strategic Congestion Manipulation by Monopolistic Storage Aggregator in Power Networks

Davoudi, Mehdi	Purdue University
Qin, Junjie	Purdue University
Keywords: Power systems, Smart grid, Optimization Abstract: This paper studies the strategic behavior of a monopolistic storage aggregator in electricity markets, focusing on how coordinated operation of geographically dispersed grid-scale storage units can deliberately alter power flows to create artificial transmission congestion, i.e., congestion driven by strategic actions rather than fundamental supply-demand patterns. We model the interaction between the aggregator, who earns from both energy arbitrage and financial transmission rights (FTRs), and the system operator as a Stackelberg game. In a two-bus setting, we analytically characterize equilibrium outcomes and identify necessary and sufficient conditions for artificial congestion, expressed as a threshold on the total energy that the aggregator can strategically mobilize. We further show that FTR ownership strongly shapes incentives: Congestion is more likely when the aggregator owns/longs FTRs, but can be avoided if the aggregator is required to short them. For general networks, we present methods to solve for the equilibrium of the Stackelberg game and screen for artificial congestion.


WeB10 Regular Session, Grand Salon 15	Add to My Program
Linear Systems II

Chair: Cichella, Venanzio	University of Iowa
Co-Chair: Miksits, Adam	Ericsson Research

13:30-13:45, Paper WeB10.1	Add to My Program
Enabling Steep Slope Walking in Husky Using Reduced Order Modeling and Quadratic Programming

Venkatesh Krishnamurthy, Kaushik	Northeastern University
Sihite, Eric	California Institute of Technology
Wang, Chenghao	Northeastern University
Pitroda, Shreyansh	Northeastern University
Salagame, Adarsh	Northeastern University
Ramezani, Alireza	Northeastern University
Gharib, Morteza	Caltech
Keywords: Linear systems, Predictive control for linear systems, Biologically-inspired methods Abstract: Wing-assisted inclined running (WAIR) observed in some young birds, is an attractive maneuver that can be extended to legged aerial systems. This study proposes a control method using a modified Variable Length Inverted Pendulum (VLIP) by assuming a fixed zero moment point and thruster forces collocated at the center of mass of the pendulum. A QP MPC is used to find the optimal ground reaction forces and thruster forces to track a reference position and velocity trajectory. Simulation result of this VLIP model on a slope of 40 degrees is maintained and shows thruster forces that can be obtained through posture manipulation. The simulation also provides insight to how the combined efforts of the thrusters and the tractive forces from the legs make WAIR possible in thruster-assisted legged systems.

13:45-14:00, Paper WeB10.2	Add to My Program
Peak Bounds for the Estimation Error under Sensor Attacks (I)

Stafström, Axel	Uppsala University
Arnström, Daniel	Uppsala University
Miksits, Adam	Ericsson Research
Umsonst, David	Ericsson Research
Keywords: Linear systems, Robust control, Observers for Linear systems Abstract: This paper investigates bounds on the estimation error of a linear system affected by norm-bounded disturbances and full sensor attacks. The system is equipped with a detector that evaluates the norm of the innovation signal to detect faults, and the attacker wants to avoid detection. We utilize induced L∞ system norms, also called peak-to-peak norms, to compare the estimation error bounds under nominal operations and under attack. This leads to a sufficient condition for when the bound on the estimation error is smaller during an attack than during nominal operation. This condition is independent of the attack strategy and depends only on the attacker's desire to remain undetected and (indirectly) the observer gain. Therefore, we investigate both an observer design method, that seeks to reduce the error bound under attack while keeping the nominal error bound low, and detector threshold tuning. As a numerical illustration, we show how a sensor attack can deactivate a robust safety filter based on control barrier functions if the attacked error bound is larger than the nominal one. We also statistically evaluate our observer design method and the effect of the detector threshold.

14:00-14:15, Paper WeB10.3	Add to My Program
Direct Data-Driven Control to Output Tracking of Switched Linear Systems

Zhang, Zezhou	Rutgers University
Zou, Qingze	Rutgers, the State University of New Jersey
Keywords: Linear systems, Switched systems, Behavioural systems Abstract: In this paper, we consider the problem of directly synthesizing a control input from the past input-output data to achieve precision tracking for switched linear systems. To this end, we proposed a data-driven direct input synthesis controller that generates the control input from the past input-output data directly. An initialization process is designed to collect the information of transient response, and it's shown that when the past data is rich enough, i.e., satisfying the persistently excitation condition, the transient response can be precisely calculated. The control input is generated by the data-driven direct input synthesis control method, the tracking performance is guaranteed under the finite preview condition, and the tracking error is bounded by the summation of a monotonic increasing function, {alpha}_1(cdot), and a monotonic decreasing function, {alpha}_2(cdot). The proposed method is illustrated through a numerical simulation on a switched linear system.

14:15-14:30, Paper WeB10.4	Add to My Program
Generic Numbers of Finite Poles and Zeros: A Bond Graph-Based Determination

Laaribi, Amine	INSA Lyon
Eberard, Damien	Univ. LYON, INSA
Marquis-Favre, Wilfrid	Ampère Laboratory - INSA Of
Keywords: Linear systems Abstract: This paper presents a graph-based procedure for computing the generic number of finite poles and zeros. The two key ingredients are the path-cycle families and the separator-based decomposition. The former generically encodes degree and valuation of minors. The latter gives an equivalent system matrix that eases system zeros characterization within graphs. Using graph connectivity properties, the generic number of zeros located at s=0 are computed. The remaining finite structure is then derived.

14:30-14:45, Paper WeB10.5	Add to My Program
Design of Input-Output Observers for a Population of Systems with Bounded Frequency-Domain Variation Using DK-Iteration

Adams, Timothy Everett	McGill University
Forbes, James Richard	McGill University
Keywords: Observers for Linear systems, Robust control, Uncertain systems Abstract: This paper proposes a linear input-output observer design methodology for a population of systems in which each observer uses knowledge of the linear time-invariant dynamics of the particular device. Observers are typically composed of a known model of the system and a correction mechanism to produce an estimate of the state. The proposed design procedure characterizes the population variation in the frequency domain and synthesizes a single robust correction filter. The correction filter guarantees a level of estimation performance for all systems compatible with the uncertainty characterization. This is accomplished by posing a robust performance problem using the observer error dynamics and solving it using DK-iteration. The design procedure is experimentally demonstrated on a flexible joint robotic manipulator with varied joint stiffnesses. It is shown that the proposed robust correction filter achieves comparable estimation performance to a method using a correction gain tailored toward each joint stiffness configuration.

14:45-15:00, Paper WeB10.6	Add to My Program
Robust Linear Output Regulation under Piecewise Bernstein/Bézier Exogenous Signals

Cichella, Venanzio	University of Iowa
Mimmo, Nicola	University of Bologna
Marconi, Lorenzo	Univ. Di Bologna
Keywords: Output regulation, Linear systems, Constrained control Abstract: This paper addresses robust output regulation for linear systems driven by exogenous signals represented as composite Bernstein/Bézier polynomials. Building on prior work for single-segment signals, we develop a continuous internal-model-based controller that guarantees exponential convergence of the regulation error without hybrid switching laws. We rigorously characterize steady-state trajectories induced by composite polynomials and analyze transient errors at segment transitions. Based on this analysis, we propose a replanning strategy for future segments that mitigates switching effects while satisfying state and input constraints. Numerical simulations on navigation tasks demonstrate the effectiveness of the approach.


WeB11 Regular Session, Grand Salon 16	Add to My Program
Modeling

Chair: Wang, Jing	Illinois State University
Co-Chair: Xu, Jiawei	University of Michigan

13:30-13:45, Paper WeB11.1	Add to My Program
Data-Driven Balancing Formulation for Linear Systems with Quadratic Outputs

Padhi, Reetish	Virginia Tech
Gosea, Ion Victor	Max Planck Institute for Dynamics of Complex Technical Systems
Pontes Duff Pereira, Igor	Max Planck Institute for Dynamics of Complex Technical Systems
Gugercin, Serkan	Virginia Tech
Keywords: Model/Controller reduction, Large-scale systems, Computational methods Abstract: We develop the theoretical formulation for a non-intrusive, quadrature-based method for approximate balanced truncation (QuadBT) of linear systems with quadratic outputs, thus extending the applicability of QuadBT, which was originally designed for data-driven balanced truncation of standard linear systems with linear outputs only. The new approach makes use of the time-domain and frequency-domain quadrature-based representation of the system's infinite Gramians, only implicitly. We show that by sampling solely the extended impulse responses (kernels) of the original system and their derivatives (or the corresponding transfer functions), we construct a reduced-order model that mimics the approximation quality of the intrusive (projection-based) balanced truncation. Although the sampling of the required kernels via input/output simulations or physical experiments is still an open question, we demonstrate a proof of concept for the proposed framework on an example using numerically evaluated data.

13:45-14:00, Paper WeB11.2	Add to My Program
Modeling and Experimental Validations of Human Driver Controls under Advisory Speed Guidance in Connected Mixed Traffic

Wang, Rongyao	Clemson University
Han, Jihun	Argonne National Laboratory
Ard, Tyler	Argonne National Lab
Karbowski, Dominik	Argonne National Laboratory
Vahidi, Ardalan	Clemson University
Jia, Yunyi	Clemson Universtiy
Keywords: Modeling, Automotive control, Supervisory control Abstract: Human driver modeling is fundamental for analyzing connected mixed traffic, where human-driven and autonomous vehicles will coexist, and real-time advisory guidance is increasingly available. Traditional car-following models capture physical interactions with the lead vehicle but often fail to represent how drivers integrate advisory speed inputs into their decision-making. This paper proposes an optimization-based human driver model that imitates car-following behavior while accounting for advisory speed guidance. The model combines front-vehicle states with advisory information through a confidence mechanism that reflects driver trust, allowing it to reproduce both compliance with and rejection of advisories. Model parameters are optimized offline using naturalistic data collected in a mixed-reality digital twin driving simulator, enabling precise calibration to individual driving styles. Experimental results in congestive traffic scenarios show that the proposed model achieves sub–0.5 m/s trajectory error and outperforms the existing naive models.

14:00-14:15, Paper WeB11.3	Add to My Program
Regularized Online Adaptation of Parametric Interactive Multiple Model against Overfitting for Vehicle Motion Tracking

Wang, Zini	Central South University
Huang, Zhiwu	Central South University
Liu, Yongjie	Changsha University of Science & Technology
Li, Heng	Central South University
Liu, Weirong	Central South University
Wang, Jing	Illinois State University
Keywords: Modeling, Control applications, Estimation Abstract: Interactive Multiple Model (IMM) has been widely applied in motion tracking tasks due to its capability in handling multiple macroscopic behavior hypotheses. In the context of human-driven vehicle motion tracking, drivers often exhibit diverse and personalized driving preferences, which challenge conventional IMM algorithms that rely on predefined models. Nevertheless, the adaptation of multiple models to match observations may induce an overfitting issue, which compromises the ability to identify the real behavior. To address these challenges, this paper presents a regularized online adaptation approach against overfitting for IMM-based vehicle motion tracking. We introduce an advanced motion modeling framework in the Frenet coordinate frame, which naturally decouples longitudinal and lateral motions. Then, a differentiable parametric motion policy is incorporated for enhancing the model adaptability based on simultaneous state and parameter estimation using extended Kalman filtering (EKF). Furthermore, the overfitting problem is addressed by imposing regularization constraints on the model parameters in the filtering correction step based on the Alternating direction method of multipliers (ADMM) iteration. In simulation, the proposed method shows advantages in accurate motion tracking while remaining computationally efficient.

14:15-14:30, Paper WeB11.4	Add to My Program
From Ellipsoids to Midair Control of Dynamic Hitches

Xu, Jiawei	University of Michigan
Bhattacharya, Subhrajit	Lehigh University
Saldana, David	Lehigh University
Keywords: Modeling, Lyapunov methods, Optimal control Abstract: The ability to manipulate and interlace cables using aerial vehicles can greatly improve aerial transportation tasks. Such interlacing cables create hitches by winding two or more cables around each other, which can enclose payloads or can further develop into knots. Dynamic modeling and control of such hitches are key to mastering inter-cable interactions in the context of cable-suspended aerial manipulation. This paper introduces an ellipsoid-based kinematic model to connect the geometric nature of a hitch created by two cables and the dynamics of the hitch driven by four aerial vehicles, which reveals the control-affine form of the system. As the constraint for maintaining tension of a cable is also control-affine, we design a quadratic programming-based controller that combines Control Lyapunov and High-Order Control Barrier Functions (CLF-HOCBF-QP) to precisely track a desired hitch position and system shape while enforcing safety constraints like cable tautness. We convert desired geometric reference configurations into target robot positions and introduce a composite error into the Lyapunov function to ensure a relative degree of one to the input. Numerical simulations validate our approach, demonstrating stable, high-speed tracking of dynamic references.

14:30-14:45, Paper WeB11.5	Add to My Program
On Improved Statistical Accuracy of Low-Order Polynomial Chaos Approximations

Deshpande, Vedang M.	Mitsubishi Electric Research Laboratories
Bhattacharya, Raktim	Texas A&M
Keywords: Reduced order modeling, Stochastic systems, Uncertain systems Abstract: Polynomial chaos expansions provide surrogate models for stochastic systems, with coefficients typically derived using Galerkin projection, stochastic collocation, or least squares approximation. These traditional approaches often fail to accurately capture statistical moments without resorting to high-order approximations. We propose a constrained optimization framework that modifies standard techniques to determine polynomial chaos coefficients that precisely recover the first two statistical moments. The effectiveness of our approach is demonstrated on several candidate algebraic functions of random variables, showing significant improvements in statistical accuracy even with low-order approximations.


WeB12 Invited Session, Grand Salon 18	Add to My Program
Design, Control, and Optimization of Marine Energy Systems

Chair: Vermillion, Christopher	University of Michigan
Co-Chair: Shabara, Mohamed	National Renewable Energy Labs (US)
Organizer: Hasankhani, Arezoo	University of New Hampshire
Organizer: Vermillion, Christopher	University of Michigan
Organizer: Zuo, Lei	University of Michigan
Organizer: Tang, Yufei	Florida Atlantic University
Organizer: Shabara, Mohamed	National Renewable Energy Labs (US)

13:30-13:45, Paper WeB12.1	Add to My Program
Model Predictive Control of a Motion Rectifier Power Take-Off with Unidirectional Power Flow (I)

Yang, Lisheng	University of Michigan
Huang, Jianuo	University of Michigan
Li, Xiaofan	University of Michigan
Mi, Jia	Stevens Institute of Technology
Lambert, Scott	National Renewable Energy Laboratory
Sindler, Petr	National Renewable Energy Laboratory
Hajj, Muhammad	Stevens Institute of Technology
Zuo, Lei	University of Michigan
Keywords: Mechatronics, Mechanical systems/robotics, Switched systems Abstract: This paper describes a control algorithm and its implementation within a model predictive framework for a special motion rectifier power take-off (PTO). Commonly used for wave energy converters, the motion rectifier PTO has advantages in preventing unnecessary generator velocity reversals. However, control of this type of PTO is hard to design, especially when direct control optimization is needed in model predictive control (MPC). A modified dynamic programming (DP) algorithm is then developed to perform open-loop optimization of both the generator torque and the PTO switching. This DP optimizer is embedded inside a rolling horizon MPC style controller to enable real-time control of the PTO. Due to the sequential nature of the DP algorithm, unidirectional power constraint can be easily enforced in the optimization, allowing power evaluation of a passive power conversion system, which can use simpler components but is traditionally challenging for optimal control. Simulation results demonstrate the effectiveness of the proposed control algorithm. It is found that the rectifier PTO increases power by 120% for the selected large generator inertia, and unidirectional power flow constraint only decreases power by 8%. Finally, a hardware-in-loop experiment is conducted to demonstrate the control algorithm in real-time. Experiment results show a 50% power increase by enabling active switching control while keeping the generator damping fixed.

13:45-14:00, Paper WeB12.2	Add to My Program
Biconjugate Impedance Matching Control of the HERO WEC (I)

Riley, Allison	Colorado School of Mines
Shabara, Mohamed	National Laboratory of the Rockies
Aslangil, Denis	Colorado School of Mines
Keywords: Energy systems Abstract: Effective control strategies are essential for maximizing the power output of wave energy converters. This study applies port-based modeling for biconjugate impedance matching to the HERO WEC (hydraulic and electric reverse osmosis wave energy converter), a small-scale, modular point absorber for water desalination, exploring the benefits and limitations for this type of reactive control with a hydraulic system. Despite its foundational role in electrical engineering, biconjugate impedance matching has had limited use in wave energy and has not yet been demonstrated in wave-powered desalination systems. Representing this complex multidisciplinary system as an equivalent circuit enables analysis of frequency-based relations. This paper discusses important considerations of port-based modeling for hydraulic systems and identifies optimal parameters to reduce impedance mismatch by 99.93%, providing a clear framework for biconjugate impedance matching in wave-powered water desalination systems.

14:00-14:15, Paper WeB12.3	Add to My Program
From Cut-In to Rated: Multi-Region Floating Offshore Wind Farm Control for Secondary Frequency Regulation

Ampleman, Stephen	Johns Hopkins University
Gayme, Dennice	Johns Hopkins University
Keywords: Energy systems, Predictive control for nonlinear systems, Reduced order modeling Abstract: This paper describes a multi-region control framework for floating offshore wind farms. Specifically, we propose a novel generator torque controller that regulates rotor speed in Region 2, corresponding to wind speeds between the cut-in and rated values. In Region 3 (wind speeds at or above rated but below cut-out speed) we employ a PI-LQR for collective blade pitch. Control blending across the transitional wind speeds (Region 2.5) employs a sigmoid weighting function applied to the control variables. Two modeling paradigms are proposed for farm-level power tracking with rotor speed regularization: a nonlinear model predictive controller (NL-MPC) with a dynamic wake model, and a reduced order model predictive controller based on linear parameter varying turbine models with a time delay representation of wake advection (LPVTD-MPC). These approaches are evaluated over three wind inlet conditions using the PJM ancillary service certification criteria for participation in a secondary frequency regulation market. Results show that both approaches achieve scores of at least 89.9% for the three different testing scenarios, which are well above the qualification threshold of 75%. However, the LPVTD-MPC approach solves the problem in under half the time versus NL-MPC but with slightly larger fluctuations in farm-level power output, highlighting the trade-off between performance and computational tractability. The control framework is among the first to address multi-region wind turbine dynamics together with market driven power tracking objectives for floating offshore wind farms. Such multi-region control becomes increasingly necessary in the floating turbine setting where large (region spanning) wind speed variations are common due to wave induced platform pitching.

14:15-14:30, Paper WeB12.4	Add to My Program
A Hierarchical Optimization Method for Mission Planning of a Mobile, Anchorless Wave-Powered Desalination Device (I)

Alam, Minhazul	University of Michigan, Ann Arbor
McGuire, Carson	North Carolina State University
Liu, Limeng	University of MIchigan
Bryant, Matthew	North Carolina State University
Vermillion, Christopher	University of Michigan
Keywords: Energy systems, Maritime control, Emerging control applications Abstract: This paper presents a hierarchical approach for optimal mission planning of a mobile, anchorless wave-powered mobile desalination device in a spatiotemporally varying environment. The goal is to fill an on-board freshwater tank in minimum time by strategically targeting areas of maximum wave energy resource. In such an environment, the complexity of mission planning lies in the varying nature of the wave resource. In this paper, we first present the use of indirect methods to derive a continuous heading trajectory for the system. Owing to the fact that this optimization problem is ill-posed under many circumstances, we separate the problem into the optimization of a discrete set of waypoints and an effective speed at which to proceed between waypoints (while the system has very limited control over its absolute speed, it can "zig-zag" in order to achieve reduced speeds when advantageous). Here, an upper-level optimizer iteratively uses a genetic algorithm (GA) to generate populations of waypoints, and a lower-level optimizer fully optimizes the effective speed trajectories for each member of the population. We present a series of simulation results for a specified device design, based on wave energy data from the Hawaiian islands.

14:30-14:45, Paper WeB12.5	Add to My Program
Q-Learning Based Adaptive Linear Extended State Observer for Core Power Control of Pressurized Water Reactor

Haider, Faraz	Indian Institute of Technology Patna
Kumar, Sumit Ranjan	Indian Institute of Technology Patna
Ali, Ahmad	Indian Institute of Technology Patna
Keywords: Control applications, Reinforcement learning, Observers for nonlinear systems Abstract: Core power control of pressurized water reactor (PWR) is challenging as it involves highly nonlinear dynamics characterized by time-varying system parameters, modeling uncertainties, internal and external disturbances. Linear extended state observer (LESO) in the framework of active disturbance rejection control (ADRC) offers a simple yet robust paradigm for control of uncertain systems under diverse disturbances. This work proposes a maximum sensitivity constrained region based graphical approach, termed graphical LESO (GLESO), to tune the controller parameters of linear ADRC. Furthermore, a Q-learning based strategy, called Q-GLESO, is developed to adaptively tune the controller parameters between scaled multiples of those obtained from GLESO. Effectiveness of proposed schemes is demonstrated through comprehensive simulations on the nonlinear mathematical model of PWR which illustrate the superiority of Q-GLESO over GLESO and LESO for various tracking and disturbance rejection test scenarios.


WeB13 Regular Session, Grand Salon 19	Add to My Program
Autonomous Systems II

Chair: Smith, Stephen L.	University of Waterloo
Co-Chair: Beaver, Logan E.	Old Dominion University

13:30-13:45, Paper WeB13.1	Add to My Program
Trajectory Tracking with Reachability-Guided Quadratic Programming and Freeze-Resume

Gholampour, Hossein	Old Dominion University
Beaver, Logan E.	Old Dominion University
Keywords: Autonomous systems, Optimization, Control applications Abstract: Many robotic systems must follow planned paths yet pause safely and resume when people or objects intervene. We present an output–space method for systems whose tracked output can be feedback-linearized to a double integrator (e.g., manipulators). The approach has two parts. Offline, we perform a pre-run reachability check to verify that the motion plan respects speed and acceleration magnitude limits. Online, we apply a quadratic program to track the motion plan under the same limits. We use a one-step reachability test to bound the maximum disturbance the system is capable of rejecting. When the state coincides with the reference path we recover perfect tracking in the deterministic case, and we correct errors using a KKT-inspired weight. We demonstrate that safety stops and unplanned deviations are handled efficiently, and the system returns to the motion plan without replanning. We demonstrate our system's improved performance over pure pursuit in simulation.

13:45-14:00, Paper WeB13.2	Add to My Program
Barometer-Aided Attitude Estimation

Nyoba Tchonkeu, Melone	University of Quebec in Outaouais (UQO)
Berkane, Soulaimane	University of Quebec in Outaouais
Hamel, Tarek	I3S-CNRS-UCA
Keywords: Autonomous vehicles, Observers for nonlinear systems, Sensor fusion Abstract: Accurate and robust attitude estimation is a central challenge for autonomous vehicles operating in GNSS-denied or highly dynamic environments. In such cases, Inertial Measurement Units (IMUs) alone are insufficient for reliable tilt estimation due to the ambiguity between gravitational and inertial accelerations. While auxiliary velocity sensors, such as GNSS, Pitot tubes, Doppler radar, or visual odometry, are often used, they can be unavailable, intermittent, or costly. This work introduces a barometer-aided attitude estimation architecture that leverages barometric altitude measurements to infer vertical velocity and attitude within a nonlinear observer on SO(3). The design cascades a deterministic Riccati observer with a complementary filter, ensuring Almost Global Asymptotic Stability (AGAS) under a uniform observability condition while maintaining geometric consistency. The analysis highlights barometer-aided estimation as a lightweight and effective complementary modality.

14:00-14:15, Paper WeB13.3	Add to My Program
A Multi-Layer Resilient Architecture for Constrained Autonomous Vehicles Subject to False Data Injections During Long Range Missions

Franze, Giuseppe	Universita' Della Calabria
Lucia, Walter	Concordia University
Tedesco, Francesco	Università Della Calabria
Venturino, Antonello	Università Della Calabria
Youssef, Amr	Concordia University
Keywords: Autonomous systems, Predictive control for linear systems, Networked control systems Abstract: In this paper, the problem of safely driving an autonomous vehicle for long range missions despite malicious actions on the data shared with the remote side is considered. This operating scenario poses two key questions: the capability of the underlying control algorithm to be feasible until the mission is accomplished; the effectiveness of adequate countermeasures to contrast the anomalous dynamical system behavior resulting from unpredictable attack phenomena. These requirements are here addressed by designing a layered control architecture that takes advantage of three methodologies: model predictive control, data encryption and perturbation analysis.

14:15-14:30, Paper WeB13.4	Add to My Program
Application of Actively Coupled Sensor Configuration and Planning to an Unknown Forest Fire Environment

DesRoches, Jeffrey	Worcester Polytechnic Institute
Poudel, Prakash	Worcester Polytechnic Institute
Cowlagi, Raghvendra V.	Worcester Polytechnic Institute
Keywords: Autonomous systems, Information theory and control, Sensor fusion Abstract: We study autonomous path planning in an unknown, time-varying forest fire threat field, where an ego vehicle must minimize temperature exposure using partial observations from small unmanned aerial vehicle (SUAV) sensors. Threat estimates are obtained indirectly from overhead imagery processed by a U-Net segmentation model, producing noisy, biased measurements of the evolving field. To address this problem, we apply an Actively Coupled Sensor Configuration and Path-planning (A-CSCP) framework that jointly optimizes sensor placement and ego trajectory. Four sensor reconfiguration strategies are compared: random placement, lawnmower sweep, standard mutual information (SMI), and context-relevant mutual information (CRMI). To remain tractable for the large state dimension, SMI and CRMI are implemented in approximate diagonal form. Simulation results show that CRMI provides the best performance, eliminating exposure and reducing mission duration by aligning sensing actions with the ego vehicle’s future path. These results demonstrate the practical value of approximate information-driven sensing and planning in dynamic hazardous environments.

14:30-14:45, Paper WeB13.5	Add to My Program
Output-Feedback for UAS Using Bearing and Angular Size in an Anti-Pronav Collision Avoidance Scheme

Liu, Jen-Jui	Brigham Young University
Evans, Curtis P.	Brigham Young University
Beard, Randal W.	Brigham Young Univ
Keywords: Autonomous systems, Vision-based control, Nonlinear output feedback Abstract: Small unmanned aircraft must avoid mid-air collisions under strict size, weight, and power limits that constrain sensing and estimation. We propose a sensor-minimal collision-avoidance controller that uses only monocular bearing, apparent angular diameter, and the sign of bearing rate, without estimating range, time-to-collision, or relative velocity. A constructive geometric proof establishes forward invariance of the safety set and finite-time recovery under realistic kinematic and control bounds for planar encounters. Monte Carlo trials (10,000 per scenario) across intruder sizes from small UAS to Boeing 737, including non-spherical silhouettes, achieved 100% success for constant-velocity encounters and remained robust to pixel-level noise and modest intruder maneuvers. The results indicate that camera-level cues alone can support provably safe and computationally lightweight avoidance for resource-constrained aircraft.

14:45-15:00, Paper WeB13.6	Add to My Program
Hierarchical Informative Path Planning Via Graph Guidance and Trajectory Optimization

Iskandar, Avraiem	University of Waterloo
Dutta, Shamak	University of Waterloo
Murrant, Kevin	Memorial University
Pant, Yash Vardhan	University of Waterloo
Smith, Stephen L.	University of Waterloo
Keywords: Autonomous systems, Optimization, Estimation Abstract: We study informative path planning (IPP) with travel budgets in cluttered environments, where an agent collects measurements of a latent field modeled as a Gaussian process (GP) to reduce uncertainty at target locations. Graph-based solvers provide global guarantees but assume pre-selected measurement locations, while continuous trajectory optimization supports path-based sensing but is computationally intensive and sensitive to initialization in obstacle-dense settings. We propose a hierarchical framework with three stages: (i) graph-based global planning, (ii) segment-wise budget allocation using geometric and kernel bounds, and (iii) spline-based refinement of each segment with hard constraints and obstacle pruning. By combining global guidance with local refinement, our method achieves lower posterior uncertainty than graph-only and continuous baselines, while running faster than continuous-space solvers (up to 9x faster than gradient-based methods and 20x faster than black-box optimizers) across synthetic cluttered environments and Arctic datasets.


WeB14 Regular Session, Grand Salon 21	Add to My Program
Koopman

Chair: Sabelhaus, Andrew	Boston University
Co-Chair: Liu, Hugh Hong-Tao	Univ. of Toronto

13:30-13:45, Paper WeB14.1	Add to My Program
Efficient Optimal Path Planning in Dynamic Environments Using Koopman MPC

Abtahi, Mohammad	University of California Davis
Mojahed Baghbadorani, Navid	University of California Davis
Nazari, Shima	University of California Davis
Keywords: Robotics, Nonlinear systems identification, Optimal control Abstract: This paper presents a data-driven model predictive control framework for mobile robots navigating in dynamic environments, leveraging Koopman operator theory. Unlike the conventional Koopman-based approaches that focus on the linearization of system dynamics only, our work focuses on finding a global linear representation for the optimal path planning problem that includes both the nonlinear robot dynamics and collision-avoidance constraints. We deploy extended dynamic mode decomposition to identify linear and bilinear Koopman realizations from input–state data. Our open-loop analysis demonstrates that only the bilinear Koopman model can accurately capture nonlinear state–input couplings and quadratic terms essential for collision avoidance, whereas linear realizations fail to do so. We formulate a quadratic program (QP) for the robot path planning in the presence of moving obstacles in the lifted space and determine the optimal robot action in an MPC framework. Our approach is capable of finding the safe optimal action 320 times faster than a nonlinear MPC counterpart that solves the path planning problem in the original state space. Our work highlights the potential of bilinear Koopman realizations for linearization of highly nonlinear optimal control problems subject to nonlinear state and input constraints to achieve computational efficiency similar to linear problems.

13:45-14:00, Paper WeB14.2	Add to My Program
Koopman Spectral Analysis and System Identification for Stochastic Dynamical Systems Via Yosida Approximation of Generators

Zhou, Jun	Huazhong University of Science and Technology
Meng, Yiming	University of Illinois Urbana-Champaign
Liu, Jun	University of Waterloo
Keywords: Identification, Stochastic systems, Estimation Abstract: System identification and Koopman spectral analysis are crucial for uncovering physical laws and understanding the long-term behaviour of stochastic dynamical systems governed by stochastic differential equations (SDEs). In this work, we propose a novel method for estimating the Koopman generator of systems of SDEs, based on the theory of resolvent operators and the Yosida approximation. This enables both spectral analysis and accurate estimation and reconstruction of system parameters. The proposed approach relies on only mild assumptions about the system and effectively avoids the error amplification typically associated with direct numerical differentiation. It remains robust even under low sampling rates or with only a single observed trajectory, reliably extracting dominant spectral modes and dynamic features. We validate our method on two systems and compare it with existing techniques as benchmarks. The experimental results demonstrate the effectiveness and improved performance of our approach in system parameter estimation, spectral mode extraction, and overall robustness.

14:00-14:15, Paper WeB14.3	Add to My Program
Koopman Control Parametrization: Data-Driven Convex Controller Design for a Class of Nonlinear Systems

Ondogan, Taha	Boston University
Jing, Ran	Boston University
Sabelhaus, Andrew	Boston University
Tron, Roberto	Boston University
Keywords: Identification for control, Lyapunov methods, Optimization Abstract: Although Koopman operators provide a way to represent autonomous nonlinear systems as linear operators in a high-dimensional space, they do not immediately apply to non-autonomous systems with inputs. State (or output) feedback controller design therefore remains nonconvex in typical formulations, even after the use of existing approximations via bilinear control-affine terms. We address this gap by introducing the Koopman Control Parametrization, a novel parameterization of control-affine dynamical systems that directly includes a feedback controller defined as a linear combination of nonlinear measurements. With this choice, the Koopman operator of the closed-loop system is a bilinear combination of two matrices: one representing the system, and the other the controller. We propose a set of sufficient conditions such that the parametrization holds. Then, we present an algorithm that calculates the feedback matrix via semi-definite programming, producing a Lyapunov-stable closed-loop system with convex optimization. We evaluate the proposed controllers on two canonical examples of control-affine nonlinear systems (inverted pendulums), showing that our approach successfully stabilize both with a proper choice of basis functions. In summary, this manuscript introduces a control synthesis method for stabilization of nonlinear systems that is broadly applicable, can be computed efficiently, is verifiably stable and data-driven.

14:15-14:30, Paper WeB14.4	Add to My Program
Data-Driven Observer Synthesis for Autonomous Limit Cycle Systems through Estimation of Koopman Eigenfunctions

Ni, Angela	North Carolina State University
Tang, Wentao	NC State University
Keywords: Observers for nonlinear systems, Machine learning, Nonlinear systems identification Abstract: The signal of system states needed for feedback controllers is estimated by state observers. A generic observer design is the Kazantzis--Kravaris/Luenberger (KKL) observer, an extension of the Luenberger observer for linear to nonlinear systems. The main challenge in the KKL design is the construction of an injective mapping of states, which requires solving PDEs based on a first-principles model. This paper proposes a data-driven, Koopman operator-based method for the construction of KKL observers for planar limit cycle systems. Specifically, for such systems, the KKL injection is guaranteed to be a linear combination of Koopman eigenfunctions. Hence, the determination of such an injection is reduced to a least-squares regression problem, and the inverse of the injective mapping is then approximated via kernel ridge regression. The entire synthesis procedure uses solely convex optimization. We apply the proposed approach to the Brusselator system, demonstrating accurate estimations of the system states.

14:30-14:45, Paper WeB14.5	Add to My Program
Trajectory Tracking of Quadrotors Using Deep Koopman Operator-Based Data-Driven Model Predictive Control

Zhang, Kunwu	University of Toronto
Liu, Hugh Hong-Tao	Univ. of Toronto
Keywords: Predictive control for nonlinear systems, Flight control, Learning Abstract: This work investigates the trajectory tracking of quadrotors using data-driven model predictive control (DDPC). We first propose a novel deep learning-enabled Koopman operator to establish the linear representation of the quadrotor systems. Rather than employing dynamic mode decomposition, this approach leverages the physics-informed neural network (PINN) to learn both observable functions and model parameters, thereby integrating the strengths of both model-based and model-free techniques. As a result, the bluetext{proposed PINN-based deep Koopman operator} can accurately capture both the dynamic behaviours and physical constraints of actual quadrotor systems. In addition, we develop a DDPC strategy based on the learned Koopman model to stabilize the quadrotor without relying on prior knowledge of the system dynamics. The proposed deep Koopman-based MPC strategy (DK-MPC) yields a quadratic programming formulation, enabling efficient computational solutions. Finally, two numerical examples and the comparison study are provided to verify the efficacy of the proposed method.

14:45-15:00, Paper WeB14.6	Add to My Program
On the Parametric Interpolation of the Koopman Operator for Dynamical Systems Prediction

Pavel, Radu	Ohio State University
Stockar, Stephanie	The Ohio State University
Goswami, Debdipta	The Ohio State University
Keywords: Nonlinear systems identification, Modeling, Identification Abstract: This paper proposes the parameter-interpolated extended dynamic mode decomposition (piEDMD) method that explicitly incorporates parameter dependence into the Koopman operator formulation by exploiting parameter-affine system structure. The approach enables interpolation across parameter values without retraining, providing a pathway toward generalizable surrogate models. The proposed method is evaluated on two nonlinear systems, namely a damped pendulum and a Van der Pol oscillator, demonstrating accuracy comparable to standard extended dynamic mode decomposition (EDMD) while eliminating the need for repeated retraining. In testing, the standard EDMD achieved an average RMSE of 8.9% for the pendulum and 11.11% for the Van der Pol oscillator, while the piEDMD achieved 7.8% and 12.69%, respectively.


WeB15 Regular Session, Grand Salon 22	Add to My Program
Nonholonomic Systems

Chair: Leung, Jordan	Mitsubishi Electric Research Laboratories
Co-Chair: Krstic, Miroslav	University of California, San Diego

13:30-13:45, Paper WeB15.1	Add to My Program
Invariant-Set Motion-Planner for Unicycle Dynamics under Closed-Loop Feedback Linearization

Leung, Jordan	Mitsubishi Electric Research Laboratories
Di Cairano, Stefano	Mitsubishi Electric Research Labs
Keywords: Nonholonomic systems, Autonomous robots, Feedback linearization Abstract: This paper develops an invariant-set motion-planner (ISMP) for vehicles with unicycle-like dynamics. ISMPs operate by determining a sequence of obstacle-free positively invariant (PI) sets for the closed-loop system that connect the initial and target state. In this paper, we derive PI sets under a dynamic feedback linearization law. We demonstrate that the PI sets are simple in geometry and we provide an efficient algorithm for obstacle-free scaling. The PI sets are then used to develop a graph-based search for finding an obstacle-free path between two states. The approach is demonstrated in an obstacle-rich simulated environment.

13:45-14:00, Paper WeB15.2	Add to My Program
Inverse Optimal Feedback and Gain Margins for Unicycle Stabilization

Kim, Kwang Hak	University of California San Diego
Todorovski, Velimir	University of California San Diego
Krstic, Miroslav	University of California, San Diego
Keywords: Nonholonomic systems, Lyapunov methods, Adaptive control Abstract: The recent development of globally strict control Lyapunov functions (CLFs) for the challenging unicycle parking problem provides a foundation for pursuing optimality. We address this in the inverse optimal framework, thereby avoiding the need to solve the Hamilton–Jacobi–Bellman (HJB) equations, and establish a general result that is optimal with respect to a meaningful cost. We present several design examples that impose varying levels of penalty on the control effort, including arbitrarily bounded control. Furthermore, we show that the inverse optimal controller possesses an infinite gain margin thanks to the system being driftless, and by leveraging this property, we extend the design to an adaptive controller that handles model uncertainty. Finally, we compare the performance of the non-adaptive inverse optimal controller with its adaptive counterpart.

14:00-14:15, Paper WeB15.3	Add to My Program
Half-Global Deadbeat Parking for Dubins Vehicle

Krstic, Miroslav	University of California, San Diego
Kim, Kwang Hak	University of California San Diego
Todorovski, Velimir	University of California San Diego
Keywords: Nonholonomic systems, Lyapunov methods, Constrained control Abstract: This paper presents a framework for stabilizing the Dubins vehicle model to zero in finite time (deadbeat parking) by interpreting distance as a time-like variable. We develop control laws that bring the system to a desired position and orientation even when the forward velocity cannot be directly actuated. While the controllers employ inverse-distance gains, we show that the control input remains bounded for all time. In addition to basic deadbeat parking, we incorporate safety considerations by proposing algorithms that prevent the vehicle from crossing in front of the target, enforce deceleration as it approaches the target, and guarantee parking without curb violations. The resulting methods are well-suited for missile guidance and fixed-wing pursuit, however they are broadly applicable to physical systems that are represented by the Dubins vehicle model.

14:15-14:30, Paper WeB15.4	Add to My Program
Modular Design of Strict Control Lyapunov Functions for Global Stabilization of the Unicycle in Polar Coordinates

Todorovski, Velimir	University of California San Diego
Kim, Kwang Hak	University of California San Diego
Krstic, Miroslav	University of California, San Diego
Keywords: Nonholonomic systems, Lyapunov methods Abstract: Since the mid-1990s, it has been known that, unlike in Cartesian form where Brockett’s condition rules out static feedback stabilization, the unicycle is globally asymptotically stabilizable by smooth feedback in polar coordinates. In this note, we introduce a modular framework for designing smooth feedback laws that achieve global asymptotic stabilization in polar coordinates. These laws are bidirectional, enabling efficient parking maneuvers, and are paired with families of strict control Lyapunov functions (CLFs) constructed in a modular fashion. The resulting CLFs guarantee global asymptotic stability with explicit convergence rates and include barrier variants that yield “almost global” stabilization, excluding only zero-measure subsets of the rotation manifolds. The strictness of the CLFs is further leveraged in our companion paper, where we develop inverse-optimal redesigns with meaningful cost functions and infinite gain margins.

14:30-14:45, Paper WeB15.5	Add to My Program
Integrator Forwarding Design for Unicycles with Constant and Actuated Velocity in Polar Coordinates

Krstic, Miroslav	University of California, San Diego
Todorovski, Velimir	University of California San Diego
Kim, Kwang Hak	University of California San Diego
Astolfi, Alessandro	KAUST
Keywords: Nonholonomic systems, Lyapunov methods Abstract: In a companion paper, we present a modular framework for unicycle stabilization in polar coordinates that provides smooth steering laws through backstepping. Surprisingly, the same problem also allows application of integrator forwarding. In this work, we leverage this feature and construct new smooth steering laws together with control Lyapunov functions (CLFs), expanding the set of CLFs available for inverse optimal control design. In the case of constant forward velocity (Dubins car), backstepping produces finite-time (deadbeat) parking, and we show that integrator forwarding yields the very same class of solutions. This reveals a fundamental connection between backstepping and forwarding in addressing both the unicycle and, the Dubins car parking problems.

14:45-15:00, Paper WeB15.6	Add to My Program
Extremum Seeking in 3-D Environments with Obstacle Avoidance

Jiang, Lai	Zhejiang University
Pan, Gaofeng	Institute of Cyber-Systems and Control, Zhejiang University
Zhu, Yang	Zhejiang University
Keywords: Optimal control, Adaptive control, Nonholonomic systems Abstract: This paper presents a three-dimensional extremum seeking (ES) control framework for autonomous source localization in environments containing obstacles. Unlike conventional path planning methods that rely on prior maps or GPS, our approach enables a nonholonomic vehicle to seek an unknown source using only local measurements, without explicit position information. We adopt a Rimon–Koditschek navigation function to simultaneously encode the objective of source attraction and obstacle repulsion, allowing the ES controller to operate safely in complex 3-D environments. Theoretical analysis based on averaging theory establishes exponential convergence to an ellipsoidal region around the extremum point. Two simulation scenarios are conducted to validate the effectiveness of the proposed method: one with a single ellipsoidal obstacle and another involving multiple convex obstacles. The results demonstrate that the vehicle can successfully reach the source while avoiding collisions. This framework offers a feasible solution for GPS-denied applications such as rescue robotics and environmental monitoring.


WeB16 Regular Session, Grand Salon 24	Add to My Program
Reinforcement Learning II

Chair: Fierro, Rafael	University of New Mexico
Co-Chair: Lian, Bosen	Auburn University

13:30-13:45, Paper WeB16.1	Add to My Program
Convex Optimization-Based Inverse Reinforcement Learning for Linear Quadratic Control

Lian, Bosen	Auburn University
Jia-cheng, Wu	Nanyang Technological University
Joy, Md Hasanuzzaman Chowdhury	Auburn University
Xue, Wenqian	University of Florida
Nguyen, Nhan	NASA Ames Research Center
Keywords: Reinforcement learning, Optimal control, Identification for control Abstract: This paper develops a convex-optimization-based inverse reinforcement learning (CO-IRL) framework that infers all cost-functional weights and the optimal policy for linear-quadratic regulator (LQR) control systems in both discrete and continuous time. The framework incorporates adaptive dynamic programming (ADP) to eliminate the need for explicit knowledge of the system dynamics. It solves the inference problem as an iterative CO problem under appropriate convexity constraints, enabling low-cost weight recovery subject to constraints and jointly learning cost weights (for both states and controls) and value functions for policy improvement, resulting in faster convergence. We first present a model-based variant, then extend it to a model-free, data-driven version by constructing a Q-function. Convergence is proven, and simulations show substantially faster convergence than non-optimization-based IRL.

13:45-14:00, Paper WeB16.2	Add to My Program
Defending against Poisoning Attacks in LQR Controllers Using Reinforcement Unlearning

John, Varkey Medayil	Georgia Institute of Technology
Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Keywords: Reinforcement learning, Optimal control, Optimization Abstract: In this paper, we introduce ``unlearning''-based, data-driven defenses against poisoning attacks on the learning of linear quadratic controllers. We first establish conditions for detecting whether a learned controller has been compromised. We then develop defense strategies that either reconstruct the system matrix to recover the unattacked Riccati solution or solve optimization problems that adjust output and cost matrices to neutralize the attack’s influence. Simulation results confirm the effectiveness of the proposed methods in restoring optimal control performance.

14:00-14:15, Paper WeB16.3	Add to My Program
Necessary and Sufficient Conditions for the Optimization-Based Concurrent Execution of Learned Robotic Tasks

Tahmid, Sheikh	University of Waterloo
Notomista, Gennaro	University of Waterloo
Keywords: Reinforcement learning, Robotics, Lyapunov methods Abstract: In this work, we consider the problem of executing multiple tasks encoded by value functions, each learned through Reinforcement Learning, using an optimization-based framework. Prior works develop this framework but did not address when learned value functions can be concurrently executed. This work’s main contributions consist of theorems which provide necessary and sufficient conditions to concurrently execute sets of learned tasks within subsets of the state space using the previously proposed min-norm controller. These theorems provide insight into when learned control tasks can be made concurrently executable, when they may already be so, and when concurrent execution is not possible under the proposed framework. We also extend the proposed framework to account for value functions trained with a discount factor, making it more compatible with standard RL practices.

14:15-14:30, Paper WeB16.4	Add to My Program
Adversarial Reinforcement Learning for Robust Control of Fixed-Wing Aircraft under Model Uncertainty

Marquis, Dennis	Virginia Tech
Wilhelm, Blake	Virginia Tech
Muniraj, Devaprakash	Indian Institute of Technology Madras
Farhood, Mazen	Virginia Tech
Keywords: Reinforcement learning, Robust control, Autonomous systems Abstract: This paper presents a reinforcement learning-based path following controller for a fixed-wing small uncrewed aircraft system (sUAS) that is robust to uncertainties in the aerodynamic model of the sUAS. The controller is trained using the Robust Adversarial Reinforcement Learning framework, where an adversary perturbs the environment (aerodynamic model) to expose the agent (sUAS) to demanding scenarios. In our formulation, the adversary introduces rate-bounded perturbations to the aerodynamic model coefficients. We demonstrate that adversarial training improves robustness compared to controllers trained using stochastic model uncertainty. The learned controller is also benchmarked against a switched uncertain initial condition controller. The effectiveness of the approach is validated through high-fidelity simulations using a realistic six-degree-of-freedom fixed-wing aircraft model, showing accurate and robust path following performance under a variety of uncertain aerodynamic conditions.

14:30-14:45, Paper WeB16.5	Add to My Program
Improved Robustness of Deep Reinforcement Learning for Control of Time-Varying Systems by Bounded Extremum Seeking

Saxena, Shaifalee	University of New Mexico
Williams, Alan	Los Alamos National Laboratory
Fierro, Rafael	University of New Mexico
Scheinker, Alexander	Los Alamos National Lab
Keywords: Reinforcement learning, Time-varying systems, Control applications Abstract: In this paper, we study the use of robust model-independent bounded extremum seeking (ES) feedback control to improve the robustness of deep reinforcement learning (DRL) controllers for a class of nonlinear time-varying systems. DRL has the potential to learn from large datasets to quickly control or optimize the outputs of many-parameter systems, but its performance degrades catastrophically when the system model changes rapidly over time. Bounded ES can handle time-varying systems with unknown control directions, but its convergence speed slows down as the number of tuned parameters increases and, like all local adaptive methods, it can get stuck in local minima. We demonstrate that together, DRL and bounded ES result in a hybrid controller whose performance exceeds the sum of its parts with DRL taking advantage of historical data to learn how to quickly control a many-parameter system to a desired setpoint while bounded ES ensures its robustness to time variations. We demonstrate the generality of our combined ES-DRL controller approach with numerical studies of three very different dynamic systems: 1) a general time-varying system, 2) automatic tuning of the Low Energy Beam Transport section at the Los Alamos Neutron Science Center linear particle accelerator, and 3) an intermittent-contact robotic block pushing task with a time-varying goal.

14:45-15:00, Paper WeB16.6	Add to My Program
Reinforcement Learning-Based Safe Optimal Control of Nonlinear Discrete-Time Affine Systems under Adversarial Inputs

Farzanegan, Behzad	Missouri University of Science and Technology
Jagannathan, Sarangapani	Missouri Univ of Science & Tech
Keywords: Neural networks, Reinforcement learning, Optimal control Abstract: This paper presents a resilient reinforcement learning (RL)-based safety-aware optimal adaptive tracking framework for nonlinear discrete-time affine systems under state constraints and adversarial sensor, actuator, and reward attacks. Sensor and actuator attacks are modeled through networked communication, while reward attacks affect the reward function or temporal difference error (TDE). A multilayer neural network (MNN)-based actor-critic architecture estimates the cost and optimal policy, using a clipped TDE and an adaptive Gaussian-based forgetting factor to balance resilience and optimality. Sensor and actuator attack resilience is achieved via a fault-aware cost-to-go function and a control barrier function (CBF) constraint embedded in a quadratic program (QP). The effectiveness is validated on a rear-wheel-drive vehicle.


WeB17 Invited Session, Churchill A1	Add to My Program
Healthcare and Medical Systems (II)

Chair: Hahn, Jin-Oh	University of Maryland
Co-Chair: Pereira, Emily	Texas Tech University
Organizer: Menezes, Amor A.	University of Florida
Organizer: Hahn, Jin-Oh	University of Maryland
Organizer: Medvedev, Alexander V.	Uppsala University
Organizer: Mesbah, Ali	University of California, Berkeley
Organizer: Pereira, Emily	Texas Tech University
Organizer: Zhang, Wenlong	Arizona State University

13:30-13:45, Paper WeB17.1	Add to My Program
Optimal Control Strategies for Cutaneous Inflammatory Disease Treatment Using a Predictive Multi-Scale Model (I)

Subedar, Gowri Yathishchandran	Stanford University
Lau, Saimai	Stanford University
Mayalu, Michaelle	Stanford
Keywords: Systems biology, Cellular dynamics, Biomedical Abstract: Mathematical models of chronic inflammatory diseases such as psoriasis often capture biological detail but overlook the regulatory feedback structures that sustain pathological persistence and shape therapeutic response. We develop a predictive multi-scale framework that links systemic immune–skin interactions with gut microbiome feedback, extending beyond conventional psoriasis models that focus narrowly on local cytokine pathways. This formulation reveals that disease trajectories may be governed by a small set of nonlinear motifs that generate multistability-consistent with observed remission, relapse, and residual inflammation. Sensitivity and bifurcation analyses identify keratinocyte decay and probiotic-associated microbial growth as critical parameters for steering the system away from "diseased" equilibria. Leveraging these insights, we design a hybrid intervention strategy that combines impulsive phototherapy with continuous probiotic supplementation. Numerical solutions show that this approach may reduce treatment burden, accelerate remission, and be adaptable to heterogeneous initial conditions, highlighting opportunities for tailoring therapy to patient-specific conditions. Beyond psoriasis, the framework establishes a scalable foundation for analyzing coupled microbial–immune systems and for designing therapeutic strategies that exploit underlying feedback structures.

13:45-14:00, Paper WeB17.2	Add to My Program
Angular Momentum Linear Inverted Pendulum Model for Bipedal Walkers under Foot Slip: Analysis and Experiments (I)

Huang, Xinyan	Rutgers University
Chen, Kuo	Motional AD, LLC
Chen, Xunjie	Rutgers, the State University of New Jersey
Yi, Jingang	Rutgers University
Keywords: Biomedical, Modeling, Robotics Abstract: Maintaining balance during unexpected foot slip is a challenging task for bipedal locomotion. Simplified motion models from linear inverted pendulum (LIP) have demonstrated as an effective tool for recovery gait planning. However, the existing used LIP models often neglect whole-body angular momentum for slip-and-fall recovery design. We propose to use an angular-momentum LIP with flywheel (ALIP-F) model to incorporate two control inputs, center of pressure (CoP) and whole-body angular momentum, into balance recovery design. It is shown that the angular momentum provides a comparable balance recoverable actuation as the CoP control. The model analysis and comparison explain and quantify the use of angular momentum in bipedal balance recovery under foot slip. We further conduct human slip-and-fall experiments to validate the model analysis and properties. This work provides modeling and analysis methods for understanding slip recovery and designing robust biped balance recovery control.

14:00-14:15, Paper WeB17.3	Add to My Program
A Model of Cell Size Control in Proliferating Human Cells (I)

Rezaee, Sayeh	University of Delaware
Nieto, Cesar	University of Delaware
Singh, Abhyudai	University of Delaware
Keywords: Biological systems, Stochastic systems, Modeling Abstract: Regulation of cell size requires coordination between growth and division to maintain stable size distributions despite intrinsic stochastic fluctuations. Experiments have established the adder principle as a robust mechanism of size homeostasis, where cells add a nearly constant amount of size each cycle, independent of birth size. Theoretical studies suggest that this behavior emerges when cell cycle progression is coupled with cell growth. However, human cells may exhibit growth dynamics that deviate from the classical exponential assumption. Motivated by this, a stochastic framework for size regulation in human cells has been developed by incorporating a Hill-type growth law into the adder-based model to capture growth saturation. To represent the sequential nature of division, a stochastic multi-step model is implemented where cells progress through internal regulatory stages before dividing. This framework yields exact analytical expressions for statistical moments of the steady-state cell size distribution. Results show that stronger growth saturation increases the mean cell size while slightly reducing fluctuations compared to exponential growth. Importantly, the adder property is preserved, indicating that the reduced variability is a consequence of the growth law rather than simple scaling with mean size. Finally, we show how growth saturation affects variability at the population level during stochastic clonal proliferation.

14:15-14:30, Paper WeB17.4	Add to My Program
Static Friction Modeling and Compensation for Improved Force Tracking in Pneumatic Cylinders Using Sliding Mode Control

Yacoub, Ammar	Southern Methodist University
Richer, Edmond	SMU
Hurmuzlu, Yildirim	Southern Methodist Univ
Keywords: Biomedical, Fluid power control, Variable-structure/sliding-mode control Abstract: Static friction, also known as stiction, is a primary source of tracking error in pneumatic cylinders, which significantly limits their use in high-precision force control tasks. This paper presents a detailed experimental study of the static friction behavior of a small pneumatic cylinder, examining how the friction force depends on piston position and supply pressure. Based on these findings, a new static friction model that considers pressure and position is developed to accurately represent the stiction phenomenon. Additionally, a Sliding Mode Controller (SMC) is designed and implemented to produce a desired output force. The SMC's performance is evaluated experimentally both with and without the proposed friction compensation model. Results show that the SMC with the integrated compensation reduces steady-state force tracking errors compared to the uncompensated controller.

14:30-14:45, Paper WeB17.5	Add to My Program
Adaptive Chemotherapy Control under Tumor Heterogeneity Via Reinforcement Learning

Kidane, Bereket Sitotaw	The University of Texas at Arlington
Motayed, Md Samiul Haque	The University of Texas at Arlington
Wang, Shuo	University of Texas at Arlington
Keywords: Reinforcement learning, Optimal control, Biological systems Abstract: Designing effective chemotherapy regimens is hindered by tumor heterogeneity and drug resistance, which complicate the deployment of patient-specific model-based optimal control across diverse populations. We develop and compare closed-loop deep reinforcement learning (DRL) dosing policies with continuous (TD3) and discrete (DQN) action spaces trained on a high-dimensional heterogeneous tumor model. The DRL policies are benchmarked against a Pontryagin's Maximum Principle (PMP)-derived open-loop benchmark. We assess generalization under parametric heterogeneity using a 100-patient virtual cohort with 10% uniform perturbations in growth and drug-sensitivity parameters. Across this cohort, TD3 achieves higher average tumor reduction, while DQN yields tighter inter-patient dosing consistency, revealing a clear efficacy-consistency trade-off in this study. Our simulations assume full observation of all tumor subpopulations; translation to sparse and noisy clinical measurements will require partial-observability formulations and/or state estimation. Overall, the results show that simulation-trained DRL can learn state-dependent feedback dosing policies that complement open-loop optimal control benchmarks.

14:45-15:00, Paper WeB17.6	Add to My Program
Fractional-Order Neural Networks for Data-Driven Model Predictive Control of an Artificial Pancreas

Baldisseri, Federico	Sapienza University of Rome
Wrona, Andrea	La Sapienza
Menegatti, Danilo	DIAG, Università Degli Studi Di Roma La Sapienz
Delli Priscoli, Francesco	Università Di Roma
Koledin, Nebojsa	University of Messina
Caponetto, Riccardo	University of Messina
Patanè, Luca	University of Messina
Keywords: Biomedical, Identification for control, Neural networks Abstract: Accurate prediction of blood glucose trajectories is essential for safe automated insulin delivery and clinical decision support in type 1 diabetes. Yet existing data-driven methods based on historical patient data perform poorly in predicting long-term outcomes. This work presents a fractional-order neural network (FONN) framework for the identification and control of blood glucose dynamics in patients affected by type~1 diabetes. The proposed approach integrates Grünwald–Letnikov operators into the neural network architecture, enabling the inclusion of long-term memory effects into the learning process. In open-loop system identification, the FONN is benchmarked against its integer-order ablation (IONN), and a Long Short-Term Memory (LSTM) model. On a widely used in-silico simulator, LSTM attains the best short-horizon forecasts, whereas FONN yields higher accuracy as the prediction horizon increases. Closed-loop evaluations in the context of data-driven model predictive control confirm the advantages of the FONN compared to a linear model-based formulation and the LSTM approach itself, with improved performance in terms of time in normoglycemia and euglycemia.


WeB18 Regular Session, Churchill A2	Add to My Program
Constrained Control II

Chair: Li, Huayi	University of Kentucky
Co-Chair: Hosseinzadeh, Mehdi	Washington State University

13:30-13:45, Paper WeB18.1	Add to My Program
Safety-Critical Control with Bounded Inputs: A Closed-Form Solution for Backup Control Barrier Functions

van Wijk, David E. J.	California Institute of Technology
Das, Ersin	Illinois Institute of Technology
Molnar, Tamas G.	Wichita State University
Ames, Aaron D.	California Institute of Technology
Burdick, Joel W.	California Inst. of Tech
Keywords: Constrained control, Lyapunov methods, Autonomous systems Abstract: Verifying the safety of controllers is critical for many applications, but is especially challenging for systems with bounded inputs. Backup control barrier functions (bCBFs) offer a structured approach to synthesizing safe controllers that are guaranteed to satisfy input bounds by leveraging the knowledge of a backup controller. While powerful, bCBFs require solving a high-dimensional quadratic program at run time, which may be too costly for computationally-constrained systems such as aerospace vehicles. We propose an approach that optimally interpolates between a nominal controller and the backup controller, and we derive the solution to this optimization problem in closed form. We prove that this closed-form controller is guaranteed to be safe while obeying input bounds. We demonstrate the effectiveness of the approach on a double integrator and a nonlinear fixed-wing aircraft example.

13:45-14:00, Paper WeB18.2	Add to My Program
From Vertices to Convex Hulls: Certifying Set-Wise Compatibility for CBF Constraints

Mousavi, Shima Sadat	California Institute of Technology (Caltech)
Tan, Xiao	Beihang University
Ames, Aaron D.	California Institute of Technology
Keywords: Constrained control, Optimization Abstract: This paper develops certificates that propagate compatibility of multiple control barrier function (CBF) constraints from sampled vertices to their convex hull. Under mild concavity and affinity assumptions, we present three sufficient feasibility conditions under which feasible inputs over the convex hull can be obtained per coordinate, with a common input, or via convex blending. We also describe the associated computational methods, based on interval intersections or an offline linear program (LP). Beyond certifying compatibility, we give conditions under which the quadratic-program (QP) safety filter is affine in the state. This enables explicit implementations via convex combinations of vertex-feasible inputs. Case studies illustrate the results.

14:00-14:15, Paper WeB18.3	Add to My Program
Neural Explicit Reference Governor for Safe Control of Resource-Constrained Autonomous Systems in Dynamic Environments

Momani, Mu'taz	Washington State University
Hosseinzadeh, Mehdi	Washington State University
Keywords: Constrained control, Autonomous systems, Machine learning Abstract: The explicit reference governor is a systematic approach for ensuring safety constraint satisfaction in pre-stabilized systems. Its core principle is to adjust the applied reference by solving an optimization problem in real time, a task that becomes computationally demanding when the safe set is non-convex and time-varying, particularly for autonomous systems with limited computational resources. To address this challenge, this paper proposes leveraging neural networks to approximate the solution of such complex optimization problems. A complementary mechanism is introduced to guarantee constraint satisfaction despite approximation errors in the network outputs. Under reasonable assumptions on the rate of change of the constraints and system dynamics, both constraint satisfaction and convergence properties are rigorously established. The effectiveness of the proposed scheme is demonstrated experimentally on a drone navigation task, where the admissible flight region is non-convex, time-varying, and contains a moving obstacle represented by a human.

14:15-14:30, Paper WeB18.4	Add to My Program
CPU and GPU-Based Parallelization of the Robust Reference Governor

Ossareh, Hamid	University of Vermont
Shayne, William	University of Vermont
Chevalier, Samuel	MIT
Keywords: Constrained control, Predictive control for nonlinear systems, Robust control Abstract: Constraint management is a central challenge in modern control systems. A solution is the Reference Governor (RG), which is an add-on strategy to pre-stabilized feedback control systems to enforce state and input constraints by shaping the reference command. While robust formulations of RG exist for linear systems, their extension to nonlinear systems is often computationally intractable. This paper develops a scenario-based robust RG formulation for nonlinear systems and investigates its parallel implementation on multi-core CPUs and CUDA-enabled GPUs. We analyze the computational structure of the algorithm, identify parallelization opportunities, and implement the resulting schemes on modern parallel hardware. Benchmarking on a nonlinear hydrogen fuel cell model demonstrates up to three orders of magnitude speed-up as compared to sequential implementations.

14:30-14:45, Paper WeB18.5	Add to My Program
A Less-Conservative Reference Governor Strategy for Safe Command Tracking of Nonlinear Systems

Li, Huayi	University of Kentucky
Keywords: Constrained control, Predictive control for nonlinear systems Abstract: Disturbance rejection reference governors based on linear models can enforce constraints for command tracking of nonlinear systems by considering the mismatch between the linear prediction model and the plant dynamics as a fictitious disturbance. Strategies to implement a linear model-based reference governor on a nonlinear system often use a constant error set for the mismatch, which may lead to a large safety margin and thus conservative performance. Solutions proposed in the literature often trade off the advantages and proven properties of the conventional design, such as recursive feasibility and computational effectiveness. To provide a balanced treatment, this paper proposes exploiting error sets that converge to the singleton of the origin, achieved by enforcing limits to the change rate of the modified reference. We formulate the proposed reference governor as a quadratic programming command governor problem with linear inequality constraints to contain the computational footprint and prove that the problem is recursively feasible. Numerical simulation results are reported.

14:45-15:00, Paper WeB18.6	Add to My Program
Connectivity-Preserving Multi-Agent Area Coverage Via Density-Driven Optimal Control (D2OC)

Lee, Kooktae	New Mexico Tech
Brook, Ethan	New Mexico Tech
Keywords: Constrained control, Cooperative control, Decentralized control Abstract: Multi-agent systems are widely used for area coverage tasks in applications such as search-and-rescue, environmental monitoring, and precision agriculture. Achieving non-uniform coverage, where certain regions are prioritized, requires coordinating agents while accounting for dynamic and communication constraints. Existing density-driven methods effectively distribute agents according to a reference density but typically do not guarantee connectivity, which can lead to disconnected agents and degraded coverage in practical deployments. This letter presents a connectivity-preserving approach within the Density-Driven Optimal Control (D2OC) framework. The coverage problem, expressed via the Wasserstein distance between agent distributions and a reference density, is formulated as a quadratic program. Communication constraints are incorporated through a smooth penalty function, ensuring strict convexity and global optimality while naturally maintaining inter-agent connectivity without rigid formations. Simulation results demonstrate that the proposed method effectively keeps agents within communication range, improving coverage quality and convergence speed compared to methods without explicit connectivity enforcement.


WeB19 Regular Session, Churchill B1	Add to My Program
Optimal Control I

Chair: Stein, Adrian	Louisiana State University
Co-Chair: Ghorbanpour, Amin	Duquesne University

13:30-13:45, Paper WeB19.1	Add to My Program
A Geometric Optimal Control Algorithm for Navigation through Heterogeneous Regions

Oradiambalam Sachidanandam, Sarjana	University of Maryland
Diaz-Mercado, Yancy	University of Maryland
Keywords: Optimal control, Autonomous robots, Robotics Abstract: Autonomous robots often require the ability to autonomously navigate in heterogeneous regions. The most efficient and the safest path, i.e., the least cost path in these scenarios, may no longer be the same as the shortest path. Optimal control approaches can allow incorporating these considerations into the controller design. However, they can be computationally costly to solve. In this paper, we propose a navigation algorithm that integrates geometrical approaches with optimal control to enable efficient robot navigation across heterogeneous terrains. The work introduces a system representation that exploits the underlying geometrical properties of the environment and the robot dynamics to formulate the optimal control problem. Through simulations, we demonstrate that this representation effectively captures system behavior and yields control outcomes consistent with theoretical expectations. Numerical results validate the effectiveness of the proposed approach: the model captures key aspects of system behavior, and drives the agent, consistently, toward low-cost regions. When combined with a shooting method approach, the model facilitates solving the optimal control problem, effectively driving the agent to any desired goal location in the domain.

13:45-14:00, Paper WeB19.2	Add to My Program
Online Intention Prediction Via Control-Informed Learning

Zhou, Tianyu	Purdue University
Liang, Zihao	Purdue University
Lu, Zehui	Independent Researcher
Mou, Shaoshuai	Purdue University
Keywords: Optimal control, Autonomous systems, Estimation Abstract: This paper presents an online intention prediction framework for estimating the goal state of autonomous systems in real time, even when intention is time-varying, and system dynamics or objectives include unknown parameters. The problem is formulated as an inverse optimal control / inverse reinforcement learning task, with the intention treated as a parameter in the objective. A shifting horizon strategy discounts outdated information, while online control-informed learning enables efficient gradient computation and online parameter updates. Simulations under varying noise levels and hardware experiments on a quadrotor drone demonstrate that the proposed approach achieves accurate, adaptive intention prediction in complex environments

14:00-14:15, Paper WeB19.3	Add to My Program
Differential Flatness Based Path Planning Around Obstacles of a 2D Gantry Crane

Mountain, Eric	University at Buffalo
Stein, Adrian	Louisiana State University
Maxime, Jesse	University at Buffalo
Singh, Tarunraj	State Univ. of New York at Buffalo
Keywords: Optimal control, Control applications, Flexible structures Abstract: This paper presents a method of obstacle avoidance for a nonlinear gantry crane model using differential flatness. The crane motion is two dimensional, allowing for lateral motion of the gantry, and hoisting motion of the payload. A unique method of synthesizing the obstacle avoiding trajectory is presented by defining a safe ``tube" for the point mass to pass through. Bernstein polynomials are used to parameterize the flat outputs. The point mass trajectory is discretized, and linear programming is utilized to solve for the polynomial coefficients that ensure the trajectory is within the tube. Simulations are generated to show the point mass trajectory and states over time, and experimental data is captured to validate these simulations. Furthermore, experimental data of a gantry crane following a rectangular motion profile is captured. This is used as a reference to illustrate the profound improvement in the vibration minimization of the flat trajectory.

14:15-14:30, Paper WeB19.4	Add to My Program
Output-Feedback Optimal Control of Heterogeneous Quadrotor Swarms with Communication Delays

Soleimani, Ehsan	Missouri University of Science and Technology
Jagannathan, Sarangapani	Missouri Univ of Science & Tech
Keywords: Optimal control, Neural networks, Aerospace Abstract: This paper presents a unified framework for optimal output-feedback control of heterogeneous quadrotor unmanned aerial vehicles (QUAVs) operating in coordinated formations under communication delays, without requiring exact system dynamics. To address partial state observability, a multi-layer neural network (MNN) observer is developed to precisely estimate unmeasured states. Reinforcement learning (RL) is employed to achieve optimal control using an MNN critic that ensures adaptability. Together, the MNN observer and RL relax the need for explicit system dynamics, enabling the framework to be applied to heterogeneous QUAVs with differing masses and parameters. A Lyapunov–Krasovskii functional is incorporated into the learning process to compensate for communication delays, which arise either in transmitting commands from the ground station to the leader or in relaying leader information to the followers. Such delays, as a non-stationary source, can otherwise cause instability and degrade formation performance. A three-dimensional leader–follower formation strategy expressed in spherical coordinates enables efficient maneuvering. Theoretical analysis establishes closed-loop stability, and simulation studies validate the effectiveness of the proposed framework in ensuring optimal, stable, and adaptive formation control of heterogeneous QUAVs under communication delays.

14:30-14:45, Paper WeB19.5	Add to My Program
Model Predictive Control for Energy-Optimal Regenerative Braking in Autonomous Electric Vehicle

Ghorbanpour, Amin	Duquesne University
Baldwin, Stephen	Duquesne University
Keywords: Optimal control, Control applications, Differential-algebraic systems Abstract: This paper models and optimizes the braking phase of an autonomous electric vehicle (AEV) with the goal of maximizing regenerative energy while maintaining tire-road safety and passenger comfort. The longitudinal drivetrain dynamics are formulated as an index-1 differential-algebraic equation (DAE), and its ODE reduction is used to pose a constrained optimal control problem. Wheel Slip ratios are regularized for low speed and penalized with a hinge-squared (ReLU) term to discourage excessive slip. A nonlinear model predictive control (NMPC) scheme is then developed to compute optimal braking actions while respecting key physical and safety constraints. These include limits on the maximum practical braking force, bounds on passenger comfort expressed through limits on longitudinal acceleration, and a no-late-acceleration condition that prevents acceleration during the braking maneuver. The resulting optimal control problem is discretized using a direct multiple-shooting method that is solved using the CasADi optimization framework. We establish the existence of optimal solutions for the continuous DAE-based optimal control problem under standard index-1 regularity and compactness assumptions, and we note the per-step existence of minimizers for the discretized NMPC problems. Simulation studies illustrate that the controller harvests energy while keeping slip within prescribed bounds.

14:45-15:00, Paper WeB19.6	Add to My Program
Optimal Path Planning for Wheel Loader Automation Via Hybrid Physics-Informed Data-Driven Soil-Tool Interaction Modeling

Abdolmohammadi, Armin	University of California, Davis
Mojahed Baghbadorani, Navid	University of California, Davis
Ravani, Bahram	University of California, Davis
Nazari, Shima	University of California, Davis
Keywords: Optimal control, Predictive control for nonlinear systems, Nonlinear systems identification Abstract: Earthmoving operations with wheel loaders require substantial power and incur high operational costs. This letter presents an efficient automation framework based on a physics-informed, data-driven soil-tool interaction model. A reduced-order multi-step parameter estimation method, guided by the Fundamental Earthmoving Equation (FEE), is deployed for excavation force estimation. An optimal control problem is then formulated to compute energy-efficient bucket trajectories using soil parameters identified in the previous digging cycle. The results are validated using Algoryx Dynamics physics-based digital model of a wheel loader. It is shown that up to 40% energy saving is possible during the excavation phase compared to typical operator executed paths. Furthermore, the total computation time is comparable to a single digging cycle, highlighting the framework’s potential for real-time, energy-optimized wheel loader automation.


WeB20 Invited Session, Churchill B2	Add to My Program
Set-Based Methods in Dynamic Systems and Control II

Chair: Jain, Neera	Purdue University
Co-Chair: Ruths, Justin	University of Texas at Dallas
Organizer: Koeln, Justin	University of Texas at Dallas
Organizer: Pangborn, Herschel	The Pennsylvania State University
Organizer: Jain, Neera	Purdue University
Organizer: Ruths, Justin	University of Texas at Dallas

13:30-13:45, Paper WeB20.1	Add to My Program
Efficient Norm-Based Reachable Sets Via Iterative Dynamic Programming (I)

Harapanahalli, Akash	Georgia Institute of Technology
Coogan, Samuel	Georgia Institute of Technology
Keywords: Formal verification/synthesis, Optimal control, Optimization Abstract: In this work, we present a numerical optimal control framework for reachable set computation using normotopes, a new set representation as a norm ball with a shaping matrix. In reachable set computations, we expect to continuously vary the shape matrix along trajectories. Incorporating the shape dynamics as an input, we build a controlled embedding system using a linear differential inclusion bounding the dynamics of the system, where a single forward simulation of this embedding system always provides an overapproximating reachable set of the system, no matter the choice of hypercontrol. By iteratively solving a linear quadratic approximation of the nonlinear optimal hypercontrol problem, we synthesize less conservative final reachable sets, providing a natural tradeoff between runtime and accuracy. Terminating our algorithm at any point always returns a valid overapproximating reachable set.

13:45-14:00, Paper WeB20.2	Add to My Program
Exact Representation Complexity Reduction for Constrained Zonotopes with Applications to Dynamic Systems and Control (I)

Dulal, Aayush	University of Texas Dallas
Koeln, Justin	University of Texas at Dallas
Keywords: Computational methods, Numerical algorithms Abstract: Zonotopes and constrained zonotopes have become one of the most widely used set representations for the analysis and design of dynamic systems and their controllers. While a linear increase in set representation complexity of zonotopic sets can represent an exponential increase in set complexity in terms of vertices and facets, practical application of set-based methods is still limited by the effects of set representation complexity growth with repeated set operations. This paper presents the characterization of several sources of redundancy in zonotope and constrained zonotope set representations and algorithms for representation complexity reduction while preserving the set. First, formal characterizations of minimal, or irredundant, zonotopic representations and criteria for redundancy and representation complexity reduction are provided. Then, a proposed algorithm for redundancy detection and removal is presented and demonstrated to provide a practical balance between computational costs and the achieved reduction for several numerical examples, including robust controllable sets and sets produced by the domain partitioning of a ReLU neural network.

14:00-14:15, Paper WeB20.3	Add to My Program
Computationally Efficient State and Model Estimation Via Interval Observers for Partially Unknown Systems (I)

Khajenejad, Mohammad	The University of Tulsa
Jin, Zeyuan	Arizona State University
Keywords: Estimation, Learning, Observers for nonlinear systems Abstract: This paper addresses the synthesis of interval observers for partially unknown nonlinear systems subject to bounded noise, aiming to simultaneously estimate system states and learn a model of the unknown dynamics. Our approach leverages Jacobian sign-stable (JSS) decompositions, tight decomposition functions for nonlinear systems, and a data-driven over-approximation framework to construct interval estimates that provably enclose the true augmented states. By recursively computing tight and tractable bounds for the unknown dynamics based on current and past interval framers, we systematically integrate these bounds into the observer design. Additionally, we formulate semi-definite programs (SDP) for observer gain synthesis, ensuring input-to-state stability and optimality of the proposed framework. Finally, simulation results demonstrate the computational efficiency of our approach compared to a method previously proposed by the authors.

14:15-14:30, Paper WeB20.4	Add to My Program
Enhanced Set-Based State Estimation Via Dependency Preservation (I)

Wagner, Jonas	University of Texas at Dallas
Ruths, Justin	University of Texas at Dallas
Keywords: Uncertain systems, Observers for Linear systems, Estimation Abstract: This paper introduces a dependency-preserving set-based observer which can improve state estimation by accounting for correlation which is regularly ignored by traditional set-based observers. A proposed state estimation method is motivated using a DC motor example where the measurement noise depends on the system noise. Further applications where correlation arises are discussed, e.g., using autoregressive models or closed-loop feedback within reachability forecasting. Throughout the paper zonotopic sets are used to illustrate the dependency preservation within the parameterized factor space, as well as to produce the numerical results.

14:30-14:45, Paper WeB20.5	Add to My Program
Improved Systematic Interval Observers for Bounded-Error LTI Systems (I)

Gedefaw, Elisabeth	University of Tulsa
Mareddu, Siva Rohit	University of Tulsa
Khajenejad, Mohammad	The University of Tulsa
Keywords: Estimation, Observers for Linear systems, Uncertain systems Abstract: This paper presents an improved interval observer design for linear time-invariant (LTI) continuous-time (CT) and discrete-time (DT) systems with bounded uncertainties by integrating a systematic coordinate transformation approach with a multiple-gain structure. The proposed method relaxes potentially restrictive Metzler/positivity assumptions typically required in coordinate transformation-based designs. To further reduce conservatism, particularly in approaches that rely solely on solving the Sylvester equation for the transformation, the observer gains are optimized by minimizing the L1-norm of the error dynamics. The effectiveness of the proposed interval observer is demonstrated through several DT and CT examples, with comparisons against two benchmark designs.

14:45-15:00, Paper WeB20.6	Add to My Program
A Reachability-Oriented Approach to Linear System Identification under Unknown Bounded Noise (I)

Jeevanandam, Sibibalan	Purdue University
Jain, Neera	Purdue University
Keywords: Identification for control, Linear systems, Uncertain systems Abstract: This paper presents a reachability-oriented approach to system identification for linear time-invariant (LTI) systems subject to unknown‑but‑bounded noise. Unlike traditional methods that assume prior knowledge of the noise bounds or distribution, the proposed method jointly estimates the system matrices and the smallest convex noise set that guarantees containment of the observed trajectories by the predicted reachable sets. Our main result shows that, for any set-size measure that is monotone with respect to inclusion and is invariant for a tight convex over-approximating set, the noise set need not be a decision variable. Instead, an optimal noise set is analytically recoverable from the optimal system matrices alone, reducing identification to an optimization solely over the system matrices. We also establish asymptotic convergence of the parameter estimates under i.i.d., bounded noise with full support and under persistent excitation and show that all system matrices converge to their true values when the noise set-size is measured by the p-norm of its axis-aligned widths, for any finite p. Monte Carlo experiments under uniform, truncated Gaussian, and truncated exponential noise demonstrate tighter noise sets than those inferred from ordinary least-squares residuals, with comparable parameter bias but slightly lower coverage of test trajectories by the predicted reachable sets, thereby highlighting the tradeoff between set tightness and conservatism.


WeB21 Regular Session, Churchill C1	Add to My Program
Optimization II

Chair: Paternain, Santiago	Rensselaer Polytechnic Institute
Co-Chair: Nash, Austin	Rose-Hulman Institute of Technology

13:30-13:45, Paper WeB21.1	Add to My Program
Policy Optimization in Robust Control: Weak Convexity and Subgradient Methods (I)

Watanabe, Yuto	University of California, San Diego
Liao, Feng-Yi	University of California San Diego
Zheng, Yang	University of California San Diego
Keywords: H-infinity control, Optimization algorithms, Reinforcement learning Abstract: Robust control seeks stabilizing policies that perform reliably under adversarial disturbances, with H-infinity control as a classical formulation. It is known that policy optimization of robust H-infinity control naturally leads to nonsmooth and nonconvex problems. This paper builds on recent advances in nonsmooth optimization to analyze the discrete-time static output-feedback H-infinity control. We show that the H-infinity cost is weakly convex over any convex subset of a sublevel set. This structural property allows us to establish the first non-asymptotic deterministic convergence rate for the subgradient method under suitable assumptions. In addition, we prove a weak Polyak-Lojasiewicz (PL) inequality in the state-feedback case, implying that all stationary points are globally optimal. We finally present a few numerical examples to validate the theoretical results.

13:45-14:00, Paper WeB21.2	Add to My Program
Topology-Control-Based Moving Target Defense against False Data Injection in Distribution Networks

Ding, Lei	Shanghai Jiao Tong University
Long, Chengnian	Shanghai Jiao Tong University
Wu, Jing	Shanghai Jiao Tong University
Keywords: Optimization, Power systems Abstract: False Data Injection (FDI) attack is one of critical threats to the reliability of power systems, where moving target defense (MTD) is an effective proactive detection defense strategy. Existing MTD studies have primarily concentrated on transmission networks through Distributed Flexible AC Transmission System (D-FACTS) devices. However, those methods cannot be extended to distribution networks due to significantly limited D-FACTS deployment. To detect FDI attacks in distribution networks, a topology-control-based moving target defense (TC-MTD) approach is proposed. Unlike D-FACTSbased MTD that alters line impedances, TC-MTD perturbs the system topology through switch operations and the rank of the admittance matrix difference is employed as a performance metric which indicates the extent of perturbation. Furthermore, a heuristic algorithm is developed to reconfigure the system topology, aiming to maximize detection performance and minimize power loss with the fewest switch operations. Case studies conducted on both modified and standard distribution networks validate the effectiveness of TC-MTD.

14:00-14:15, Paper WeB21.3	Add to My Program
Generalized Multi-Constraint Extremum Seeking

Williams, Alan	Los Alamos National Laboratory
Cortes, Jorge	UC San Diego
Scheinker, Alexander	Los Alamos National Lab
Keywords: Optimization algorithms, Adaptive control Abstract: We generalize the Safe Extremum Seeking algorithm to address the minimization of an unknown objective function subject to multiple unknown inequality and equality constraints, relying on recent results of gradient flow systems. These constraints may represent safety or other critical conditions. The proposed ES algorithm functions as a general nonlinear programming tool, offering practical maintenance of all constraints and semiglobal practical asymptotic stability, utilizing a Lyapunov argument on the penalty function and the set-valued Lie derivative. The efficacy of the algorithm is demonstrated on a 2D problem.

14:15-14:30, Paper WeB21.4	Add to My Program
Robust Control Co-Design with Experimental Validation for an Electromechanical System

Nash, Austin	Rose-Hulman Institute of Technology
Morehouse, James	Rose-Hulman Institute of Technology
Keywords: Optimization, Robust control, Predictive control for linear systems Abstract: Control co-design (CCD) broadens the performance envelope for engineering systems by considering the notion of feedback control at the system (plant) design stage. Because CCD is a model-based design and control framework, quantifying and enabling robustness to uncertainty is critically important. While recent research has focused on developing CCD frameworks that incorporate notions of uncertainty, validation has largely been limited to model-based simulation. This work experimentally validates a robust CCD framework with a mass-spring cart system actuated by a controllable drive motor. Separate systems are co-designed to generate physical systems with maximum mass and minimum required control effort, with each system capable of meeting position tracking constraints in the presence of disturbance uncertainty introduced through a separate disturbance motor. The resulting systems are built and tested experimentally with results showing that (1) the experimental system responses to a baseline (planned) disturbance closely match the predictions from the robust CCD optimization process, and (2) the experimental system responses are robust to bounded disturbance uncertainties. This works represents an important step toward bringing robust CCD into engineering practice.

14:30-14:45, Paper WeB21.5	Add to My Program
PAD-TRO: Projection-Augmented Diffusion for Direct Trajectory Optimization

Chen, Jushan	Rensselaer Polytechnic Institute
Paternain, Santiago	Rensselaer Polytechnic Institute
Keywords: Optimization, Robotics, Stochastic systems Abstract: Recently, diffusion models have gained popularity and attention in trajectory optimization due to their capability of modeling multi-modal probability distributions. However, addressing nonlinear equality constraints, i.e, dynamic feasibility, remains a great challenge in diffusion-based trajectory optimization. Recent diffusion-based trajectory optimization frameworks rely on a single-shooting style approach where the denoised control sequence is applied to forward propagate the dynamical system, which cannot explicitly enforce constraints on the states and frequently leads to sub-optimal solutions. In this work, we propose a novel direct trajectory optimization approach via model-based diffusion, which directly generates a sequence of states. To ensure dynamic feasibility, we propose a gradient-free projection mechanism that is incorporated into the reverse diffusion process. Our results show that, compared to a recent state-of-the-art baseline, our approach leads to zero dynamic feasibility error and approximately 4x higher success rate in a quadrotor waypoint navigation scenario involving dense static obstacles.

14:45-15:00, Paper WeB21.6	Add to My Program
A Multi-Rate Feedback Control System to Model and Analyze Distributed Algorithms with Error Feedback

Liang, Liyuan	University of California, Berkeley
Zhang, Xinwei	Amazon.com
Elia, Nicola	University of Minnesota
Hong, Mingyi	University of Minnesota
Keywords: Optimization algorithms, Large-scale systems, Nonlinear output feedback Abstract: Error Feedback (EF)—also known as error compensation—has emerged as a fundamental technique for correcting biases that arise from quantization and compression operations, thereby enabling efficient communication in distributed systems. While numerous algorithms have recently been proposed that combine various implementations of EF with different base distributed algorithms, a unified theoretical framework for understanding these approaches has remained elusive. This paper addresses this gap by leveraging multi-rate feedback control theory to establish a comprehensive foundation for EF mechanisms in distributed optimization. We demonstrate that when both the EF procedure and the distributed optimization process satisfy a set of easily verifiable conditions, their integration yields a convergent algorithm with provable rates. Our theoretical framework not only simplifies the analysis of existing EF-based distributed algorithms but, more significantly, provides a systematic approach for designing new algorithms. This unified perspective bridges the gap between theoretical understanding and practical implementation, advancing the field of distributed optimization.


WeB22 Invited Session, Churchill C2	Add to My Program
Estimation and Control of Distributed Parameter Systems II

Chair: Demetriou, Michael A.	Worcester Polytechnic Institute
Co-Chair: Hu, Weiwei	University of Georgia
Organizer: Demetriou, Michael A.	Worcester Polytechnic Institute
Organizer: Hu, Weiwei	University of Georgia

13:30-13:45, Paper WeB22.1	Add to My Program
Design of Restricted Kalman Filters for Infinite Dimensional Systems with Moving Sensors (I)

Demetriou, Michael A.	Worcester Polytechnic Institute
Hu, Weiwei	University of Georgia
Keywords: Distributed parameter systems, Kalman filtering Abstract: This paper presents conditions under which a Kalman filter designed for a parabolic PDE over a larger spatial domain will have a filter kernel with spatial support over a prescribed subdomain contained in the spatial domain of the PDE. This comes from the implementation of a domain decomposition filter for PDEs and aims at minimizing filter design complexity and reducing computational costs. When an optimal filter designed over the larger spatial domain has a kernel that vanishes outside a specific subdomain, then the resulting domain decomposition filter becomes optimal since the solution to the Kalman filter over the entire spatial domain and that over the smaller subdomain will be identical.

13:45-14:00, Paper WeB22.2	Add to My Program
Boundary Control of Infinite Horizon Semilinear Parabolic Equations (I)

Casas Rentería, Eduardo	Universidad De Cantabria
Kunisch, Karl	University of Graz
Keywords: Distributed parameter systems Abstract: This work concentrates on a class of Neumann optimal control problems for semilinear parabolic equations on an infinite horizon domain Q = Omega times (0,infty) subject to a control constraint of the form alpha le u(x,t) le beta for (x,t) in Gamma times (0,infty), where Gamma is the boundary of Omega. Existence of a solution, first- and second-order optimality conditions are established. Finally, the approximation of the solution by finite horizon control problems is addressed and some error estimates are provided.

14:00-14:15, Paper WeB22.3	Add to My Program
Optimal Control in Poroelastic Systems Via E-Radiality Theory (I)

Bociu, Lorena	North Carolina State University
Alalabi, Ala'	University of Waterloo
Keywords: Optimal control, Distributed parameter systems, Differential-algebraic systems Abstract: Fluid flows through deformable, porous media arise in numerous applications, ranging from geomechanics to biomechanics. These processes are governed by coupled systems of partial differential equations (PDEs) that describe the interaction between fluid transport and mechanical deformation. A widely adopted framework for such systems is Biot's theory of poroelasticity, which leads, under quasi-static assumptions, to a system of coupled parabolic-elliptic PDEs, falling under the umbrella of implicit, degenerate evolution equations. While the existence and regularity theory for such systems is well established, optimal control problems constrained by poroelastic dynamics, especially relevant in biomedical contexts where internal pressure and displacement must be regulated, have only recently been considered. These control problems are challenging due to the degeneracy of the equations, the implicit nature of the coupling, and the structure of the control-to-state map. In this work, we reinterpret the coupled flow-deformation problem as an implicit dynamical system on Hilbert spaces and analyze it using E-radiality theory - a generalization of the classical Hille-Yosida framework for non-explicit evolution equations. This abstract formulation yields new insights into well-posedness theory and enables a unified treatment of linear control problems constrained by degenerate evolution dynamics.

14:15-14:30, Paper WeB22.4	Add to My Program
Discrete-Time Luenberger Observer and Kalman Filter Design for Non-Isothermal Pipeline Systems (I)

Khari, Safa	University of Alberta
Koch, Charles Robert	University of Alberta
Dubljevic, Stevan	University of Alberta
Keywords: Estimation, Kalman filtering, Distributed parameter systems Abstract: This work develops a non-isothermal pipeline flow model, with an emphasis on state estimation through the Kalman filter and Luenberger observer design. A nonlinear infinite-dimensional model is derived for flow velocity, pressure, density, and temperature based on nonlinear coupled hyperbolic partial differential equations (PDEs). The model is linearized around an operational steady state, and the Cayley–Tustin transformation is used for temporal discretization. This approach preserves essential system properties such as controllability, observability, and stabilizability without requiring spatial discretization or model reduction. The resolvent operator is then determined, enabling the formulation of a Kalman filter for the infinite-dimensional pipeline model in the presence of process and measurement noise, and a Luenberger observer for state reconstruction in the noise-free process. The performance of both estimation strategies is validated through numerical simulations, demonstrating their effectiveness for pipeline flow monitoring.

14:30-14:45, Paper WeB22.5	Add to My Program
Distributed Attraction-Repulsion Potential for Multi-Agents Formation Control (I)

Ban, Hemanta	University of Tennessee
Djouadi, Seddik, M.	University of Tennessee
Tomsovic, Kevin	Clemson University
Keywords: Distributed control, Distributed parameter systems, Networked control systems Abstract: In this paper, a distributed multi-agent formation control driven the gradient of the Lennard-Jones potential is analyzed. For a collision-free initial data we prove global well-posedness together with a uniform lower bound on all inter-agent distances, thereby excluding hard collisions. Taking the total energy as a Lyapunov function, LaSalle’s invariance principle shows that every positive limit point is an equilibrium. Since trajectories remain uniformly away from collisions, the energy is analytic along the flow and an argument yields convergence to a single equilibrium modulo translations. Local exponential rate of convergence is determined explicitly. Illustrative numerical examples are presented.

14:45-15:00, Paper WeB22.6	Add to My Program
Reachability Analysis for Design Optimization (I)

Nguyen, Steven	University of California, San Diego
Cortes, Jorge	UC San Diego
Kramer, Boris	University of California San Diego
Keywords: Aerospace, Optimization, Emerging control applications Abstract: We present an approach to approximate reachable sets for linear systems with bounded L-infinity controls in finite time. Our first approach investigates the boundaries of these sets and reveals an exact characterization for single-input, planar systems with real, distinct eigenvalues. The second approach leverages convergence of the Lp-norms to L-infinity and uses Lp-norm reachable sets as an approximation of the L-infinity-norm reachable sets. Our optimal control results yield insights that make computational approximations of the Lp-norm reachable sets more tractable, and yield exact characterizations for L-infinity with the previous assumptions on the system. As an example, we incorporate our reachability analysis into the design optimization of a highly-maneuverable aircraft. Introducing constraints based on reachability allow us to factor physical limitations to desired flight maneuvers into the design process.


WeC03 Tutorial Session, Grand Salon 3	Add to My Program
Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization

Chair: Bullo, Francesco	Univ of California at Santa Barbara
Co-Chair: Montanari, Arthur	Northwestern University
Organizer: Bullo, Francesco	Univ of California at Santa Barbara
Organizer: Krotov, Dmitry	MIT-IBM Watson AI Lab
Organizer: Montanari, Arthur	Northwestern University
Organizer: Motter, Adilson E.	Northwestern University

15:30-17:00, Paper WeC03.1	Add to My Program
Energy-Based Dynamical Models for Neurocomputation, Learning, and Optimization (I)

Montanari, Arthur N.	Northwestern University
Bullo, Francesco	Univ of California at Santa Barbara
Krotov, Dmitry	Independent Researcher
Motter, Adilson E.	Northwestern University
Keywords: Learning, Biologically-inspired methods, Neural networks Abstract: Recent advances at the intersection of control theory, neuroscience, and machine learning have revealed novel mechanisms by which dynamical systems perform computation. These advances encompass a wide range of conceptual, mathematical, and computational ideas, with applications for model learning and training, memory retrieval, data-driven control, and optimization. This tutorial focuses on neuro-inspired approaches to computation that aim to improve scalability, robustness, and energy efficiency across such tasks, bridging the gap between artificial and biological systems. Particular emphasis is placed on energy-based dynamical models that encode information through gradient flows and energy landscapes. We begin by reviewing classical formulations, such as continuous-time Hopfield networks and Boltzmann machines, and then extend the framework to modern developments. These include dense associative memory models for high-capacity storage, oscillator-based networks for large-scale optimization, and proximal-descent dynamics for composite and constrained reconstruction. The tutorial demonstrates how control-theoretic principles can guide the design of next-generation neurocomputing systems, steering the discussion beyond conventional feedforward and backpropagation-based approaches to artificial intelligence.


WeC04 Regular Session, Grand Salon 4	Add to My Program
Spacecraft Control

Chair: Taheri, Ehsan	Auburn University
Co-Chair: Soderlund, Alexander	The Ohio State University

15:30-15:45, Paper WeC04.1	Add to My Program
Comparison of Gain Matrix Parameterizations for Nonlinear Spacecraft Attitude Control

Nurre, Nicholas	Auburn University
Taheri, Ehsan	Auburn University
Keywords: Spacecraft control, Control applications, Optimization Abstract: An important step in designing closed-loop controllers is the tuning of one or more square matrices constrained by definiteness, e.g., penalty or gain matrices. This work considers four different parameterizations of such a matrix: one is a (standard) diagonal matrix and, using spectral decomposition, the other three consider the larger set of "full" ("dense" or "non-diagonal") matrices. Definiteness can be enforced via simple box constraints on the decision variables of each parameterization. We compare the benefits of each parameterization for optimizing the gain matrices of a closed-loop nonlinear spacecraft attitude control law to minimize control effort. Several gradient-free global optimization algorithms are used for tuning and are also compared. Results show that improved performance with respect to control effort can be found by considering the full gain matrix parameterizations. Furthermore, the choice of parameterization for the orthogonal eigenvector matrix, in the full matrix parameterization, can significantly impact the tuning outcome.

15:45-16:00, Paper WeC04.2	Add to My Program
Efficient Stabilization of Hybrid Coulomb Spacecraft Formations Using Control Lyapunov Functions

Tahir, Adam	University of Washington
Keywords: Spacecraft control, Lyapunov methods Abstract: A control allocation algorithm using control Lyapunov functions to determine stabilizing charges and thrusts of hybrid Coulomb spacecraft formations (HCSFs) is presented. The goal is to stabilize a desired configuration while minimizing the thruster actuation and maximizing Coulomb actuation to minimize propellant usage. A proportion of the decrease of the control Lyapunov function is designated for Coulomb actuation and the rest is performed by thrusters. Simulations show that an 85% reduction of propellant compared to using solely thrusters is attainable using the proposed algorithm. It is shown that the best role for thrusters in a HCSF is to provide small corrections that cannot be provided by Coulomb actuation.

16:00-16:15, Paper WeC04.3	Add to My Program
Real-Time Retargeting Using Controllability Boundary for Chandrayaan-3 Lunar Landing

Kumar, Suraj	U R Rao Satellite Center, Indian Space Research Organization
Chakrabarti, Debajyoti	University of California, Los Angeles
Rallapalli, Aditya	UR Rao Satellite Centre
Bharat Kumar, Gvp	U R Rao Satellite Centre , Isro
Ashok Kumar, Kakula	U R Rao Satellite Centre(ursc) , Isro
Keywords: Spacecraft control, Optimization, Machine learning Abstract: This paper presents the real-time retargeting guidance policy developed for the Chandrayaan-3 lunar landing mission. The baseline guidance generates approximate fuel-optimal descent trajectories, while a high-level policy enables safe retargeting to alternate sites when the nominal site becomes infeasible. The retargeting strategy leverages a convex representation of the controllability boundary, allowing rapid feasibility checks and real-time target updates. To the best of the authors’ knowledge, this represents the first application of a data-driven retargeting framework in an operational lunar landing mission. Pre-flight simulations and Chandrayaan-3 flight results validate the effectiveness of the proposed approach.

16:15-16:30, Paper WeC04.4	Add to My Program
Nonlinear MPC for Cislunar Trajectory Correction under Impulsive Maneuver Uncertainties: A Computational Performance Analysis

Yamamoto, Koya	Texas A&M Univeristy
Taheri, Ehsan	Auburn University
Junkins, John L.	Texas A&M University
Keywords: Predictive control for nonlinear systems, Optimization, Spacecraft control Abstract: This work studies nonlinear model predictive control (NMPC) for correcting cislunar transfer trajectories under impulsive maneuver uncertainties. By combining pre-planned impulsive changes with low-thrust NMPC corrections in the CR3BP, the method robustly compensates for execution errors. A maneuver between southern L2 Halo orbits in the Earth-Moon system is considered. A computational comparison of IPOPT and MATLAB's fmincon solvers shows IPOPT is up to 2 times faster. NMPC achieves sub-kilometer tracking errors and a total Delta v tracking cost around 315 m/s, demonstrating an effective and practical guidance for future cislunar missions.

16:30-16:45, Paper WeC04.5	Add to My Program
Model Predictive Control of Collinear Coulomb Spacecraft Formations

Tahir, Adam	University of Washington
Keywords: Spacecraft control, Predictive control for nonlinear systems, Optimal control Abstract: A model predictive control scheme to stabilize desired configurations of collinear Coulomb spacecraft formations is derived in this paper. The nonlinearities of the dynamics with respect to the input make this problem difficult to solve, computationally. It is shown that the nonlinearities in the input lead to a finite horizon optimization problem which is a nonconvex quadratically-constrained quadratic program (QCQP). A convex relaxation of the nonconvex QCQP is therefore derived which can be solved quickly using a convex optimization solver. A simulation of a four spacecraft formation is provided which demonstrates why optimizing over a prediction horizon is a prudent approach to Coulomb spacecraft formation control.

16:45-17:00, Paper WeC04.6	Add to My Program
Stochastic Optimal Control with Weighted Terminal Accuracy under Control-Linear Noise

Amarell, Kristen I.	The University of Colorado Boulder
Scheeres, Daniel J.	The University of Colorado Boulder
McMahon, Jay W.	The University of Colorado Boulder
Keywords: Spacecraft control, Stochastic optimal control, Linear systems Abstract: Space missions involving launch, rendezvous, or orbit insertion often require precise targeting of specific state components, such as position or velocity along preferred directions, while tolerating dispersion in others. This motivates control strategies that differentially penalize deviations across the state. We address this need by minimizing the weighted mean squared deviation from a target state for a stochastic linear system with control-linear noise and noiseless, full-state measurements at specified times. Unlike prior unweighted formulations, this approach enables mission-specific prioritization of state components via reachability theory, promoting feasibility in nonlinear launch-to-rendezvous scenarios. We use an approximation of the controllable set as the terminal weighting matrix, improving rendezvous feasibility and guiding neighboring trajectories about a nonlinear launch profile. A complete solution to this optimization problem is presented, in which the optimal feedback control law is linear in the initial state, the target state, and the sampled measurements. The feedback gain matrices can be computed offline, reducing the onboard computational cost. This paper presents the following innovations to existing methods: the addition of a terminal state weighting matrix to the cost function and the application as a reachability-informed neighboring guidance algorithm about a launch trajectory under nonlinear control.


WeC05 Tutorial Session, Grand Salon 6	Add to My Program
Decision and Control in Sports Analytics

Chair: Archibald, Christopher	Brigham Young University
Co-Chair: Grimsman, David	Brigham Young University
Organizer: Grimsman, David	Brigham Young University
Organizer: Melville, William	Texas Rangers
Organizer: Archibald, Christopher	Brigham Young University
Organizer: Nestler, Scott	University of Florida

15:30-17:00, Paper WeC05.1	Add to My Program
Decision and Control in Sports Analytics: A Tutorial (I)

Grimsman, David	Brigham Young University
Nestler, Scott	University of Florida
Archibald, Christopher	Brigham Young University
Melville, William	Texas Rangers
Keywords: Game theory, Markov processes, Emerging control applications Abstract: This tutorial discusses the intersection between control theory, decision-making, and sports analytics. Its purpose is to motivate researchers to engage with open problems in sports, demonstrate how foundational tools from controls and optimization can be applied to these domains, and provide practical guidance for entering the field. The paper covers: (1) the formulation of sports decision problems as sequential decision-making problems; (2) the evolving landscape of sports analytics data and its implications for research and education; (3) modeling athlete and team decision processes through execution error and uncertainty quantification; and (4) applications of game theory to baseball strategy and managerial decision-making. Together, these sections highlight how well-known methods can be adapted to model, evaluate, and improve strategic behavior in sports. The paper concludes with several open research questions in the area.


WeC06 Regular Session, Grand Salon 7	Add to My Program
Game Theory III

Chair: Anderson, Brendon G.	California Polytechnic State University
Co-Chair: Dayanikli, Gokce	University of Illinois Urbana-Champaign

15:30-15:45, Paper WeC06.1	Add to My Program
Linear-Quadratic Mean Field Games with Multiple Heterogeneous Major Agents

Sharifi, Arman	University of Nevada, Reno
Xu, Hao	University of Nevada, Reno
Keywords: Mean field games, Game theory, Optimal control Abstract: This paper presents a framework for linear-quadratic (LQ) mean-field games with multiple heterogeneous major agents, deriving the coupled Riccati equations that define the Nash equilibrium. A key finding is that system complexity scales polynomially for heterogeneous agents but remains bounded for the special case of homogeneous agents, isolating heterogeneity as the primary driver of computational cost. We also introduce a Complexity-Adjusted Cost (CAC) framework to create a quantitative decision boundary between the tractable MFG equilibrium and the computationally intensive social welfare solution. Simulations validate our complexity analysis and demonstrate the CAC framework's utility for practical system design.

15:45-16:00, Paper WeC06.2	Add to My Program
Choice Paralysis in Evolutionary Games

Anderson, Brendon G.	California Polytechnic State University
Keywords: Game theory, Mean field games Abstract: In this paper, we consider finite-strategy approximations of infinite-strategy evolutionary games. We prove that such approximations converge to the true dynamics over finite-time intervals, under mild regularity conditions which are satisfied by classical examples, e.g., the replicator dynamics. We identify and formalize novel characteristics in evolutionary games: choice mobility, and its complement choice paralysis. Choice mobility is shown to be a key sufficient condition for the long-time limiting behavior of finite-strategy approximations to coincide with that of the true infinite-strategy game. An illustrative example is constructed to showcase how choice paralysis may lead to the infinite-strategy game getting "stuck," even though every finite approximation converges to equilibrium.

16:00-16:15, Paper WeC06.3	Add to My Program
Approximately Solving Continuous-Time Mean Field Games with Finite State Spaces

Eich, Yannick	Technische Universität Darmstadt
Fabian, Christian	Technische Universität Darmstadt
Cui, Kai	Technische Universität Darmstadt
Koeppl, Heinz	Technische Universitat Darmstadt
Keywords: Mean field games, Game theory Abstract: Mean field games (MFGs) offer a powerful framework for modeling large-scale multi-agent systems. This paper addresses MFGs formulated in continuous time with discrete state spaces, where agents' dynamics are governed by continuous-time Markov chains - relevant to applications like population dynamics and queueing networks. While prior research has largely focused on theoretical aspects of continuous-time discrete-state MFGs, efficient computational methods for determining equilibria remain underdeveloped. Inspired by discrete-time approaches, we approximate the classical Nash equilibria by regularization methods, enabling more computationally tractable solution algorithms. Specifically, we define regularized equilibria for continuous-time MFGs and extend the classical fixed-point iteration and fictitious play algorithm to these equilibria. We validate the effectiveness and practicality of our approach via illustrative numerical examples.

16:15-16:30, Paper WeC06.4	Add to My Program
Modeling Global Financial Stability through Policy Design: A Hierarchical Stackelberg Mean Field Game Perspective

Rathod, Prathmesh	University of Illinois at Urbana-Champaign
Dayanikli, Gokce	University of Illinois Urbana-Champaign
Keywords: Mean field games, Optimal control, Optimization Abstract: In this paper, we propose a hierarchical game theoretical model of systemic risk in interconnected banking networks. The model consists of three layers: local banks, central banks, and global regulators such as the International Monetary Fund (IMF). Local banks' model is represented through multi-population MFGs, whose Nash equilibria are characterized by coupled forward–backward stochastic differential equations and reduced to Riccati-type ODE systems. At the intermediate layer, we model the central banks that are interacting with local banks, possibly with each other, and also with the IMF through their policy guidelines. We characterize and analyze the multi-population Nash equilibrium between central banks and local banks. At the top level, we model the problem of IMF that acts as a Stackelberg leader interacting with both central banks and the local banks and optimizing global policies. After we present the theoretical results that include equilibrium characterization and existence and uniqueness results, we give a numerical algorithm and analyze the effects of different IMF objectives and also model parameters such as interaction levels between populations and central banks.

16:30-16:45, Paper WeC06.5	Add to My Program
Convexifying Mean-Field Control: An Occupation-Measure and Frank–Wolfe Approach

Yu, Di	Purdue University
You, Sixiong	Purdue University
Pei, Chaoying	Missouri University of Science and Technology
Keywords: Mean field games, Optimization algorithms, Robotics Abstract: Abstract— Large-scale robotic swarms motivate the use of mean-field control (MFC). Classical partial differential equation (PDE)-based formulations provide a principled framework but can become computationally challenging in higher dimensions, whereas machine learning achieves scalability at the cost of approximation and guarantees. In this work, we establish an optimization-based framework that lifts the MFC problem into the space of occupation measures, resulting in a convex relaxation formulated as an optimization over measures. The resulting problem is solved using a Frank–Wolfe (FW) algo- rithm in the measure space, with each iteration reduced to a tractable optimal control problem. This approach retains the O(1/k) convergence rate of FW, avoids discretization of the state space, and naturally incorporates interaction and safety constraints. Numerical experiments demonstrate agreement with analytic and PDE-based baselines in two dimensions and show that the method scales to three-dimensional environments with multiple obstacles, where standard grid-based PDE solvers become impractical. A full 3D instance with ten obstacles is solved in minutes on a standard workstation, underscoring the practicality and scalability of the proposed framework.


WeC07 Regular Session, Grand Salon 9	Add to My Program
Control Applications II

Chair: Ghassani, Rashad	Schneider Electric
Co-Chair: Allen, Brendon C.	Auburn University

15:30-15:45, Paper WeC07.1	Add to My Program
DC-Link Stability and Rectification Current Observer for AC Drives

Ghousein, Mohammad	ArianeGroup
Ghassani, Rashad	Schneider Electric
Keywords: Electrical machine control Abstract: The article addresses two problems: 1) modeling and stability of the DC-Link in motor drives and 2) observer design of the DC-Link rectification current. In the first part, we propose an original model of the DC-Link dynamics, including a virtual diode modeling the direction of the rectification current. We also study the impact of the equivalent series resistance (ESR) of the DC-Link capacitor on the stability of the DC-Link voltage. In the second part, we propose a time-varying observer to estimate the rectifier current. The observer is of Luenberger type with time-varying gains. We obtain sufficient conditions for the exponential decay of the estimation error towards a bounded interval. We then validate the theoretical results through numerical simulations.

15:45-16:00, Paper WeC07.2	Add to My Program
A Comparison of Triangulation and Multilateration Localization Techniques for Crane Cable Detection

Harkonen, Eemil	Georgia Institute of Technology
Zonnenberg, Alexander	Georgia Institute of Technology
Graham, Monica	Georgia Institute of Technology
Adams, Christopher	Georgia Institute of Technology
Singhose, William	Georgia Institute of Technology
Keywords: Embedded systems, Mechatronics, Mechanical systems/robotics Abstract: Crane payload liftoff operations require precise alignment of a crane's overhead trolley with the payload being lifted to ensure the hoisting cable is vertical. In cases where alignment is not achieved, an off-centered lift occurs, resulting in uncontrolled swinging of the payload. This has the potential to damage the crane's cable and/or payload, the crane itself, or cause serious injury or even the death of a nearby worker. This paper introduces a LiDAR-based small-scale localization solution for cable angle sensing. Cable angle sensing solutions exist, but they are expensive and are designed to work in well controlled environments. This paper discusses the design of an inexpensive cable angle sensing solution that can be deployed in varying environments. The efficacy of the employed multilateration and triangulation localization techniques are shown to achieve a maximum position error of 12mm.

16:00-16:15, Paper WeC07.3	Add to My Program
Feedback-Linearization Control for Psychrometric Testing Chambers

Kakani, Raghav	Purdue University
Shaikh, Adil Manzoor	Purdue University
Rhee, Seungho	Purdue University
Kim, Dohyeon	Purdue University
Kircher, Kevin	Purdue University
Keywords: Feedback linearization, Building and facility automation, PID control Abstract: Psychrometric test facilities – insulated chambers with air-handling units that provide heating, cooling, humidification, and dehumidification – are used to evaluate the performance of heating and cooling equipment. Accurate emulation of indoor and outdoor environments during equipment testing relies on effective psychrometric control. However, the psychrometric system is nonlinear, time-varying, and exhibits strong input–output coupling, limiting the effectiveness of conventional controllers across wide operating ranges. To address these challenges, this study proposes a MIMO nonlinear control strategy using feedback linearization. A simplified physics-based state-space model is employed to derive the feedback-linearization control law, which decouples interactions and compensates for nonlinearities. The control strategy is evaluated using a high-fidelity model calibrated to experimental data from a physical psychrometric chamber. In temperature and relative humidity control simulations, the feedback-linearization controller maintains overshoot within the specified constraint and achieves faster settling times than a well-tuned baseline controller, reducing the duration of a standard dynamic test by 80.2 minutes (22.3%).

16:15-16:30, Paper WeC07.4	Add to My Program
Saturated ARISE Control for Uncertain Lower Limb Hybrid-Exoskeleton Dynamics

Crapet, Joseph	Auburn University
Basyal, Sujata	Auburn University
Mishra, Kislaya	Auburn University
Ting, Jonathan	Auburn University
Allen, Brendon C.	Auburn University
Keywords: Nonlinear output feedback, Robust adaptive control, Biotechnology Abstract: Abstract--- Millions of Americans are affected by neurological conditions (NCs), which can be caused by strokes, Parkinson's disease, and more. NCs can negatively impact individuals by decreasing their muscle mass, motor control, and bone density, often as a result of physical inactivity. This deterioration leads to secondary health conditions such as obesity, cardiovascular disease, among others. There is an ongoing need to develop methods that help those afflicted by NCs to effectively recover. One rehabilitation method is to use a hybrid-exoskeleton, which combines the benefits of robot-assisted therapy with functional electrical stimulation (FES). Due to this combination, hybrid-exoskeletons use motors to assist or resist the motion of the user, ensuring proper performance of the rehabilitative exercises, while also implementing FES to ensure active participation from the user during the exercise, enhancing the overall rehabilitative effectiveness of the device. Although this rehabilitation approach has many benefits, the dynamics of a hybrid-exoskeleton are nonlinear and uncertain in nature. As a result, an ineffective controller design will generate a shaky, uncomfortable, or unstable motion, yielding ineffective rehabilitation. To effectively control a hybrid-exoskeleton for rehabilitation, this paper presents the development of a robust control method that is tailored for this unified and complex rehabilitation system. Specifically, a saturated auxiliary robust integral of the sliding mode (ARISE) control law is developed for the first time for a hybrid-exoskeleton. To validate the performance of the designed control law, a Lyapunov-like stability analysis and a preliminary simulation are performed.

16:30-16:45, Paper WeC07.5	Add to My Program
Integrating Target-Distance Feedback with Bayesian Optimization for Cost-Effective Process Optimization

Singhal, Bharat	Tokyo Electron America, INC
Matsumoto, Masashi	Tokyo Electron America, INC
Shinagawa, Jun	Tokyo Electron America, INC
Keywords: Manufacturing systems, Materials processing, Optimization Abstract: Abstract—In semiconductor manufacturing, as device sizes continue to shrink, each process step must meet increasingly stringent requirements that are challenging to achieve through traditional trial-and-error approaches. This has resulted in a shift toward machine learning methods, such as Bayesian optimization, which has shown promising results in reducing the number of experiments needed to meet process specifications in critical manufacturing steps such as etching, deposition, and lithography. Nevertheless, the performance of Bayesian optimization is sensitive to design parameters—such as acquisition functions and their settings—and suboptimal choices can significantly degrade its effectiveness. To address this, in this paper, we incorporate the distance between current performance and target specifications as feedback to dynamically adjust acquisition function parameters, enabling a strategy that emphasizes exploration initially and transitions to exploitation automatically as current performance approaches the target. Our approach, dynamic Bayesian optimization, reduces the number of experiments required to meet the target specification by balancing exploration and exploitation. We first demonstrate the effectiveness of our methodology on standard benchmark functions commonly used in Bayesian optimization. Then, we develop virtual metrology models to accurately predict the etched amounts of SiO2 and Si3N4 layers and employ these models to simultaneously optimize SiO2 etched amount and its selectivity to Si3N4 for the gaseous-phase etching process using our approach.

16:45-17:00, Paper WeC07.6	Add to My Program
Adaptive DHFD-DSMC Based Control for Five-Phase PMSM

Igbokwe, Daniel	Ampère/Renault Group, Centrale Nantes
Ghanes, Malek	Centrale Nantes
Bodson, Marc	Univ. of Utah
Hamida, Mohamed Assaad	LS2N, Ecole Centrale De Nantes
Messali, Amir	Ampère/Renault Group
Keywords: Control applications, Electrical machine control Abstract: Multiphase machines, which employ more than three phases for electrical energy conversion, have attracted renewed interest due to their enhanced efficiency, fault tolerance, and power density compared to conventional three-phase systems—qualities. These advantages make them well-suited for electric vehicles, renewable energy systems, aerospace, and high-power industrial drives. This paper presents practical refinements to existing control strategies for multiphase motor drives. Specifically, we propose a simplified adaptive discrete hybrid filtering differentiator based on a cube-root approximation of the established hybrid bi-homogeneous differentiator, building on prior work by Levant, Livne, and Jbara. Additionally, we introduce a computationally efficient reaching law that approximates the behavior of the exponential reaching law while significantly reducing online calculation burden. The modified differentiator is integrated with the discrete sliding-mode controller to replace conventional filtered differentiators typically paired with linear controllers in Electric Vehicle (EV) applications, thereby improving robustness without increasing computational load. The combined approach is designed for realtime implementation on resource-constrained embedded processors commonly used in EV drive systems.


WeC08 Invited Session, Grand Salon 10-13	Add to My Program
Multi-Agent Control and Coordination II

Chair: Weintraub, Isaac	Air Force Research Laboratory
Co-Chair: Sinha, Abhinav	The University of Cincinnati

15:30-15:45, Paper WeC08.1	Add to My Program
Strategic Concealment of Environment Representations in Competitive Games (I)

Guan, Yue	Georgia Institute of Technology
Maity, Dipankar	University of North Carolina at Charlotte
Tsiotras, Panagiotis	Georgia Institute of Technology
Keywords: Game theory, Agents-based systems, Autonomous systems Abstract: This letter studies the strategic concealment of environment representations used by players in competitive games. We consider a defense scenario in which one player (the Defender) seeks to infer and exploit the representation used by the other player (the Attacker). The interaction between the two players is modeled as a Bayesian game: the Defender infers the Attacker’s representation from its trajectory and places barriers to counteract, while the Attacker obfuscates its representation type to mislead the Defender. We solve for the Perfect Bayesian Nash Equilibrium via a bilinear program that integrates Bayesian inference, strategic planning, and belief manipulation. Simulations show that purposeful concealment naturally emerges: the Attacker randomizes its trajectory to manipulate the Defender’s belief, inducing suboptimal barrier selections and thereby gaining a strategic advantage.

15:45-16:00, Paper WeC08.2	Add to My Program
Reach-Avoid Games in a Grid World (I)

Berneburg, James	George Mason University
Dorothy, Michael	US Army Research Laboratory
Shishika, Daigo	George Mason University
Keywords: Game theory Abstract: Reach-avoid games model scenarios where an attacker seeks to reach a target region that a defender aims to protect. The challenge compared to pursuit-evasion games is that a defender cannot simply capture an attacker, but is concerned with what points the attacker can reach before it is captured. We consider for the first time, to the best of our knowledge, a reach-avoid game played in a graph environment with simultaneous actions. We solve the game of kind played on an obstacle-free 2-dimensional grid world between a single Attacker and a single Defender. We find Attacker has a winning strategy when it can reach any point in the target before Defender, and Defender has a winning strategy in the complementary case, provided Defender's tagging radius is positive, despite players having equal speeds so Defender cannot guarantee capture. We additionally provide minor results and discussion to connect this to more general problems, such as with multiple defenders and attackers.

16:00-16:15, Paper WeC08.3	Add to My Program
Safe Navigation in the Presence of Range-Limited Pursuers (I)

Chapman, Thomas	Air Force Research Laboratory
Von Moll, Alexander	Air Force Research Laboratory
Weintraub, Isaac	Air Force Research Laboratory
Keywords: Autonomous systems, Aerospace, Intelligent systems Abstract: This paper examines the degree to which an evader seeking a safe and efficient path to a target location can benefit from increasing levels of knowledge regarding one or more range-limited pursuers seeking to intercept it. Unlike previous work, this research considers the time of flight of the pursuers actively attempting interception. It is shown that additional knowledge allows the evader to safely steer closer to the threats, shortening paths without accepting additional risk of capture. A control heuristic is presented, suitable for real-time implementation, which capitalizes on all knowledge available to the evader.

16:15-16:30, Paper WeC08.4	Add to My Program
Target Defense Using a Turret and Mobile Defender Team (I)

Von Moll, Alexander	Air Force Research Laboratory
Maity, Dipankar	University of North Carolina at Charlotte
Pachter, Meir	AFIT/ENG
Shishika, Daigo	George Mason University
Dorothy, Michael	US Army Research Laboratory
Keywords: Game theory, Cooperative control, Autonomous vehicles Abstract: A scenario is considered wherein a stationary, turn constrained agent (Turret) and a mobile agent (Defender) cooperate to protect the former from an adversarial mobile agent (Attacker). The Attacker wishes to reach the Turret prior to getting captured by either the Defender or Turret, if possible. Meanwhile, the Defender and Turret seek to capture the Attacker as far from the Turret as possible. This scenario is formulated as a differential game and solved using a geometric approach. Necessary and sufficient conditions for the Turret-Defender team winning and the Attacker winning are given. In the case of the Turret-Defender team winning equilibrium strategies for the min max terminal distance of the Attacker to the Turret are given. Three cases arise corresponding to solo capture by the Defender, solo capture by the Turret, and capture simultaneously by both Turret and Defender.

16:30-16:45, Paper WeC08.5	Add to My Program
Heterogeneous Pursuit of an Active Target under Sensing Constraints (I)

Surve, Prajakta	Michigan State University
Bopardikar, Shaunak D.	Michigan State University
Von Moll, Alexander	Air Force Research Laboratory
Weintraub, Isaac	Air Force Research Laboratory
Casbeer, David W.	Air Force Research Laboratory
Keywords: Agents-based systems, Autonomous systems, Game theory Abstract: This paper studies a heterogeneous three-agent pursuit-evasion scenario in which a sensor–attacker team attempts to capture an active target capable of changing its heading at fixed time intervals. The sensor has a limited sensing range, and the attacker must intercept the target before it escapes sensing. We formulate this problem as a emph{game of kind} and extend the optimal sensor and attacker strategies from prior work on passive targets to the active target setting. The sensor updates its heading in each interval by assuming that the target will keep its heading fixed for the rest of the engagement, while the attacker uses an Apollonius circle-based approach for minimum-time interception and updating its heading corresponding to the target heading in every interval. We show that the conditions for capture or escape of a passive target also extend to the case of an active target. In particular, if the speed of the active target is less than a critical value identified for passive targets in our prior work, then capture is guaranteed.

16:45-17:00, Paper WeC08.6	Add to My Program
Decentralized CBF-Based Safety Filters for Collision Avoidance of Cooperative Missile Systems with Input Constraints

Autenrieb, Johannes	German Aerospace Center (DLR)
Spiller, Mark	German Aerospace Center (DLR)
Keywords: Aerospace, Agents-based systems, Flight control Abstract: This paper presents a decentralized safety filter for collision avoidance in multi-agent aerospace interception scenarios. The approach leverages robust control barrier functions (RCBFs) to guarantee forward invariance of safe sets under bounded inputs and high-relative-degree dynamics. Each effector executes its nominal cooperative guidance command, while a local quadratic program (QP) modifies the input only when necessary. Event-triggered activation based on range and zero-effort miss (ZEM) criteria ensures scalability by restricting active constraints to relevant neighbors. To ensure feasibility under multiple simultaneously active constraints, a slack-variable relaxation scheme is introduced that prioritizes critical agents in a Pareto-optimal manner. Simulation results in many-on-many interception scenarios demonstrate that the proposed framework maintains collision-free operation with minimal deviation from nominal guidance, providing a computationally efficient and scalable solution for safety-critical multi-agent aerospace systems.


WeC09 Invited Session, Grand Salon 12	Add to My Program
Dynamics and Behavior of Energy Storage Systems

Chair: Filgueira da Silva, Samuel	The Ohio State University
Co-Chair: Moura, Scott	University of California, Berkeley
Organizer: Docimo, Donald	Texas Tech University
Organizer: Soudbakhsh, Damoon	Temple University
Organizer: Zhang, Dong	University of Oklahoma
Organizer: Song, Ziyou	University of Michigan, Ann Arbor
Organizer: Araujo Xavier, Marcelo	Amazon Leo
Organizer: Moura, Scott	University of California, Berkeley
Organizer: Lin, Xinfan	University of California, Davis
Organizer: Cui, Xiaofan	University of California, Los Angeles
Organizer: Filgueira da Silva, Samuel	The Ohio State University
Organizer: Tang, Shuxia	Texas Tech University
Organizer: Dey, Satadru	The Pennsylvania State University

15:30-15:45, Paper WeC09.1	Add to My Program
Design and Modeling of a Heat-Exchange Sleeve for Enhanced Thermal Safety of Lithium-Ion Batteries (I)

Ferreira, Patryck	Texas Tech University
Tang, Shuxia	Texas Tech University
Keywords: Modeling, Energy systems, Simulation Abstract: This paper presents the design and modeling of a phase change material (PCM) sleeve enclosed in a Polyethylene Terephthalate Glycol (PETG) case for the thermal management of lithium-ion batteries. The model introduces five states: PETG inner wall, PCM solid, PCM liquid, phase fraction, and PETG outer surface, ensuring accurate representation of conduction, latent heat absorption, and convective heat loss. The formulation is validated experimentally using a 26650 Li-ion cell cycled at 11 A under controlled laboratory conditions. Results show satisfactory agreement between model predictions and experimental measurements, with root-mean-square errors of 0.56 ◦ C at the PETG inner wall and 1.44 ◦ C in the PCM domain. The PETG+PCM sleeve reduces case temperature rise by approximately 5 ◦ C and attenuates oscillation amplitude, indicating the buffering role of PCM.

15:45-16:00, Paper WeC09.2	Add to My Program
Unbalancing Second-Life Battery Pack with Bidirectional DC/DC Converter for Extending Life (I)

Xu, Zhicheng	University of Oklahoma
Kajiura, Yuichi	University of Oklahoma
Choo, Wonoo	University of Oklahoma
Espin, Jorge	University of Oklahoma
Zhang, Dong	University of Oklahoma
Keywords: Energy systems, Power electronics, Modeling Abstract: This paper proposes a novel battery pack state of charge (SoC) unbalancing strategy to extend the lifespan of series connected heterogeneous second-life batteries with parallel connected, isolated bidirectional boost converters. By utilizing a balancing bus via converters, the system is capable of regulating the current of individual battery. This feature deliberately unbalances the SoC of each battery module based on its estimated state of health (SoH) and an ampere-hour throughput degradation model. This strategy ensures that all battery modules reach the end of life (EoL) simultaneously, thereby maximizing the overall lifetime of the pack system. Simulation results demonstrate that the proposed approach can significantly increase the cycle life by approximately 50%, especially in scenarios with substantial SoH heterogeneity, a common condition for second-life battery applications.

16:00-16:15, Paper WeC09.3	Add to My Program
Conformalized Transfer Learning for Li-Ion Battery State of Health Forecasting under Manufacturing and Usage Variability (I)

Filgueira da Silva, Samuel	The Ohio State University
Ozkan, Mehmet	The Ohio State University
El Idrissi, Faissal	The Ohio State University
Canova, Marcello	The Ohio State University
Keywords: Energy systems, Estimation, Machine learning Abstract: Accurate forecasting of state-of-health (SOH) is essential for ensuring safe and reliable operation of lithium-ion cells. However, existing models calibrated on laboratory tests at specific conditions often fail to generalize to new cells that differ due to small manufacturing variations or operate under different conditions. To address this challenge, an uncertainty-aware transfer learning framework is proposed, combining a Long Short-Term Memory (LSTM) model with domain adaptation via Maximum Mean Discrepancy (MMD) and uncertainty quantification through Conformal Prediction (CP). The LSTM model is trained on a virtual battery dataset designed to capture real-world variability in electrode manufacturing and operating conditions. MMD aligns latent feature distributions between simulated and target domains to mitigate domain shift, while CP provides calibrated, distribution-free prediction intervals. This framework improves both the generalization and trustworthiness of SOH forecasts across heterogeneous cells.

16:15-16:30, Paper WeC09.4	Add to My Program
Analysis of the Single Particle Model with Electrolyte for Lithium Iron Phosphate Batteries (I)

Lee, Jaewoong	University of California, Berkeley
Jiang, Shida	Univeristy of California, Berkeley
Tao, Shengyu	Chalmers University of Technology
Zeng, Wente	TotalEnergies S.E
Lamare, Pierre-Olivier	SAFT
Bertin, Clement	SAFT
Monier-Reyes, Daniel	SAFT
Benjamin, Sebastien	Saft S.A
Moura, Scott	University of California, Berkeley
Keywords: Energy systems, Modeling Abstract: This paper examines the challenges in accurately modeling lithium iron phosphate (LFP) batteries using the single particle model with electrolyte (SPMe). The SPMe offers a practical balance between the high-fidelity Doyle-Fuller-Newman (DFN) model and computationally efficient equivalent circuit models (ECM) for battery management systems (BMS). While the SPMe has demonstrated success across many Li-ion chemistries, it encounters structural limitations when applied to LFP batteries. These limitations arise from the unique two-phase kinetics of LFP cathodes, which result in a flat cathode open-circuit potential (OCP). The two-phase transformation mechanisms in the LFP cathodes are not explicitly captured by the single-phase diffusion assumption of the SPMe, and the flat cathode OCP can degrade voltage prediction accuracy. We quantify these challenges and identify root causes through voltage component and parameter sensitivity analyses. Based on these findings, we highlight the need for additional voltage correction terms in the SPMe voltage equation and propose an empirical correction approach for LFP batteries.

16:30-16:45, Paper WeC09.5	Add to My Program
System Identification of Lithium-Ion Battery Equivalent Circuit Models Using Ensemble Kalman Inversion (I)

Barat, Farzaneh	University of Kansas
Wilson, Sara	University of Kansas
Kim, Huijeong	University of Kansas
Fang, Huazhen	Michigan State University
Keywords: Identification, Energy systems, Optimization Abstract: System identification remains an intriguing challenge for lithium-ion batteries, as many models are nonlinear, exhibit multi-physics coupling, and involve a large number of parameters. In this paper, we address this challenge using the ensemble Kalman inversion (EnKI) method for battery system identification. EnKI performs maximum a posteriori parameter estimation through successive local Gaussian approximations, enabling an iterative and incremental search for unknown parameters. The search combines Monte Carlo sampling with Kalman-type updates to evolve an ensemble of samples, thereby offering empirical stability and the ability to handle strongly nonlinear models. We validate the proposed approach on two equivalent circuit models with coupled electro-thermal dynamics, through both simulation and experiments. The results demonstrate that the proposed approach achieves accurate parameter estimation with rapid iterative convergence, and it shows strong potential for application to other battery models.

16:45-17:00, Paper WeC09.6	Add to My Program
Concurrent Optimization of Lithium-Ion Battery Charging and Balancing Via Safe and Data-Efficient Deep Reinforcement Learning

Hailemichael, Habtamu	Clemson University
Ayalew, Beshah	Clemson University
Figueroa-Santos, Miriam	Ground Vehicle System Center
Barron, Morgan	Ground Vehicle System Center
Keywords: Modeling, Reinforcement learning, Energy systems Abstract: This paper presents a novel deep reinforcement learning (DRL)-based approach for the joint optimization of charging and balancing of lithium-ion battery packs. Traditional methods often rely on conservative, rule-based strategies that limit efficiency and adaptability, or on model-based approaches that are computationally intensive for real-time implementation and degrade in optimality as the battery ages. In contrast, our framework employs DRL to develop adaptive, optimal control policies that dynamically respond to the battery’s changing state. DRL training is conducted through interaction with continually learned, high-fidelity hybrid virtual environment that combines physics-based battery models with Transformer-based neural residuals. This continually updated environment enables safe, efficient, and adaptive learning that reflects the battery’s evolving behavior over time. Compared to conventional constant current–constant voltage (CCCV) charging with rule-based balancing, our method achieves up to 2× faster charging and maintains SOC imbalance below 0.25%, demonstrating substantial improvements in performance and safety.


WeC10 Regular Session, Grand Salon 15	Add to My Program
Linear Matrix Inequalities

Chair: Seiler, Peter	University of Michigan, Ann Arbor
Co-Chair: Peet, Yulia	Arizona State University

15:30-15:45, Paper WeC10.1	Add to My Program
Replay Attack Detection with Chaotic Masking in Cyber-Physical Systems

Lu, Hongnian	Zhejiang University
Liu, Zhitao	Zhejiang University
Chen, Tao	Zhejiang University
Su, Hongye	Zhejiang Univ
Keywords: Chaotic systems, Neural networks, LMIs Abstract: Replay attacks are a common threat in cyber-physical systems, potentially causing damage to the physical plant while avoiding detection by replaying historical sensor measurements. In this paper, inspired by the fact that systems with periodic steady states, e.g., constant or sinusoid, are susceptible to the replay attacks, we propose a chaotic masking scheme to break this periodicity, thereby facilitating replay attack detection. Specifically, chaotic signals are superimposed on the sensor measurements, intentionally rendering the transmitted data non-periodic. To de-mask the transmitted data, an extended observer is designed, which estimates the states of both the physical plant and the chaotic system. Finally, the effectiveness of the proposed scheme for replay attack detection is verified by simulation.

15:45-16:00, Paper WeC10.2	Add to My Program
Hinf Analysis and Synthesis for Vector-Valued Edge Consensus Networks

Mihaly, Vlad Mihai	Technical University of Cluj-Napoca
Susca, Mircea	Technical University of Cluj-Napoca
Abou Jaoude, Dany	American University of Beirut
Keywords: Network analysis and control, LMIs, Robust control Abstract: This paper focuses on networks of agents described using vector-valued and time-scaled multi-rate integrators interconnected through matrix-weighted edges. Both the agents/nodes and their edges/interconnections are affected by zero-mean Gaussian noise. The finite-energy noise impact on network consensus is measured using the Hinf-norm of the resulting edge representation of the dynamical system. For this setup, we provide three technical innovations. First, we propose a method to convexify the computation of the aforementioned norm in terms of the weights and time scales of the graph. Second, we provide an iterative convex synthesis method to minimize the said norm based on the tunable weights and time scales. Third, we cover a general output performance index based on arbitrary linear configurations of the system's states, whereas previous literature only considers the spanning tree and co-tree edge states for the network output. A numerical example illustrates the proposed novel algorithm.

16:00-16:15, Paper WeC10.3	Add to My Program
A PIE Representation of Cylindrical PDEs and Stability Analysis Using LPIs

Purra, Varshitha	Arizona State University
Peet, Yulia	Arizona State University
Keywords: Linear systems, Distributed parameter systems, LMIs Abstract: This paper extends the Partial Integral Equation (PIE) framework to cylindrical PDEs. Singular radially-dependent terms, such as 1/r and 1/r2, preclude a direct application of a PDE-to-PIE reformulation, developed for Cartesian coordinates, and the subsequent PIE-based stability analysis, which requires polynomial coefficients. We develop two exact reformulation strategies, (i) a variable substitution s = r^2 and (ii) a weighted formulation multiplying the PDE by an appropriate power of r, and prove the equivalence between the developed reformulations and the original PDE. Using these strategies, we construct PIE representations of a canonical axisymmetric reaction-diffusion equation. We then use Linear Partial Inequalities (LPIs) in PIETOOLS 2024 to test asymptotic stability and compare the results with the analytically-derived stability bounds. Finally, we apply two reformulation strategies to a tokamak flux-gradient PDE model and show that the weighted formulation eliminates all fractional powers and results in an admissible PIE representation, whereas the substitution method fails to do so for this more complicated PDE. This work establishes a foundation for Lyapunov-based stability analysis of cylindrical infinite-dimensional systems.

16:15-16:30, Paper WeC10.4	Add to My Program
Echo State Network Controller Design for a Class of Leaky-Integrator Nonlinear Systems

Deng, Hao	University Paris-Saclay
Stoica, Cristina	CentraleSupélec/L2S, Univ. Paris-Saclay
Ossmann, Daniel	Munich University of Applied Sciences HM
Chadli, M.	University of Paris-Saclay - UEVE
Keywords: LMIs Abstract: This paper proposes a novel sufficient condition for the leaky-integrator echo state network controller design based on Incremental Input-to-State Stability criterion for discrete-time systems. The controller design conditions are derived via Linear Matrix Inequalities. A novel leaky-integrator echo state network-based controller structure is presented, demonstrating enhanced system performance. The simulation results highlight the performance improvements achieved by adopting the proposed controller structure.

16:30-16:45, Paper WeC10.5	Add to My Program
Robust Data-Driven Control for Nonlinear Systems Using Their Digital Twins and Quadratic Funnels

Shakeri, Shiva	University of Washington
Mesbahi, Mehran	University of Washington
Keywords: Lyapunov methods, Iterative learning control, LMIs Abstract: This paper examines a robust data-driven approach for the safe deployment of systems with nonlinear dynamics using their imperfect digital twins. Our contribution involves proposing a method that fuses the digital twin's nominal trajectory with online, data-driven uncertainty quantification to synthesize robust tracking controllers. Specifically, we derive data-driven bounds to capture the deviations of the actual system from its prescribed nominal trajectory informed via its digital twin. Subsequently, the dataset is used in the synthesis of quadratic funnels—robust positive invariant tubes around the nominal trajectory—via linear matrix inequalities built on the time-series data. The resulting controller guarantees constraint satisfaction while adapting to the true system behavior through a segmented learning strategy, where each segment's controller is synthesized using uncertainty information from the previous segment. This work establishes a systematic framework for obtaining safety certificates in learning-based control of nonlinear systems with imperfect models.

16:45-17:00, Paper WeC10.6	Add to My Program
Discrete-Time Stability Analysis of ReLU Feedback Systems Via Integral Quadratic Constraints

Vahedi Noori, Sahel	University of Michigan
Hu, Bin	University of Illinois at Urbana-Champaign
Dullerud, Geir E.	University of Minnesota
Seiler, Peter	University of Michigan, Ann Arbor
Keywords: Robust control, LMIs, Neural networks Abstract: This paper analyzes internal stability of a discrete-time feedback system with a ReLU nonlinearity. This feedback system is motivated by recurrent neural networks. We first review existing static quadratic constraints (QCs) for slope-restricted nonlinearities. Next, we derive hard integral quadratic constraints (IQCs) for scalar ReLU by using finite impulse filters and structured matrices. These IQCs are combined with a dissipation inequality leading to an LMI condition that certifies internal stability. We show that our new dynamic IQCs for ReLU are a superset of the well-known Zames-Falb IQCs specified for slope-restricted nonlinearities. Numerical results show that the proposed hard IQCs give less conservative stability margins than Zames-Falb multipliers and prior static QC methods, sometimes dramatically so.


WeC11 Invited Session, Grand Salon 16	Add to My Program
Mechatronics

Chair: Xia, Fangzhou	The University of Texas at Austin
Co-Chair: Vikas, Vishesh	University of Alabama
Organizer: Barton, Kira	University of Michigan, Ann Arbor
Organizer: Su, Hao	North Carolina State University
Organizer: Mazumdar, Yi	Georgia Institute of Technology
Organizer: Vikas, Vishesh	University of Alabama
Organizer: Xia, Fangzhou	The University of Texas at Austin
Organizer: Zhang, Jun	University of Nevada Reno
Organizer: He, Binghan	The University of Texas at San Antonio
Organizer: Zhang, Qiang	The University of Alabama
Organizer: Han, Feng	New York Institute of Technology
Organizer: Zuo, Shan	University of Connecticut

15:30-15:45, Paper WeC11.1	Add to My Program
Magnetic Position Estimation in Ferrous Actuators under Hysteresis: A Switched-Gain Non-Linear Observer Approach (I)

Daroudi, Sajjad	University of Minnesota
Rajamani, Rajesh	Univ. of Minnesota
Keywords: Mechatronics, Control applications, Nonlinear systems identification Abstract: Estimating actuator position in construction and earthmoving vehicles is critical for enabling precise control, operator assistance, and automation of repetitive tasks. Conventional approaches based on string potentiometer or LVDTs can be costly, require maintenance, and lack robustness in harsh environments. This paper develops a magnetic-field-based position estimation method that uses a low-cost anisotropic magnetoresistive (AMR) sensor mounted on the actuator. A strong magnet attached to the piston provides magnetic field variations measured in longitudinal and vertical directions. A major challenge is that movement of the magnet magnetizes and demagnetizes the actuator’s body causing strong hysteresis in the magnetic field–position relationship, along with non-monotonic sensor characteristics. To address this, a Preisach model is identified from both major and minor loop data, and a polynomial function is used to represent the hysteresis-free baseline. The hysteresis model is then applied in real time to estimate and subtract the hysteresis contribution from the measurements, yielding corrected signals. A switched-gain nonlinear observer is subsequently designed to account for the non-monotonicity of the measurement functions. Experimental results on a hydraulic actuator show that the proposed approach provides accurate position tracking: the observer converges rapidly from unknown initial conditions and maintains estimation errors within 6 mm, with a root-mean-square error (RMSE) of about 1.9 mm.

15:45-16:00, Paper WeC11.2	Add to My Program
Sample Tilt Control for Atomic Force Microscopes with Multiple Parallel Active Cantilevers (I)

Lin, Tingyi	University of Texas, Austin
Hao, Yizhou	University of Texas at Austin
Li, Wenyue	University of Michigan
Sung, Joshua	The University of Texas at Austin
Xia, Fangzhou	The University of Texas at Austin
Keywords: Mechatronics, MEMs and Nano systems, Control applications Abstract: Parallel operation of multiple atomic force microscope (AFM) probes has emerged as a promising approach for high-throughput nanoscale metrology, particularly in semiconductor inspection where rapid and reliable surface characterization is critical. However, when two or more cantilevers operate simultaneously, unavoidable tilt misalignment between the probes and the sample surface can cause asynchronous engagement, unstable imaging, and measurement errors. To address this challenge, this paper presents the design and implementation of a dual-probe AFM (DP-AFM) integrated with an active tilt compensation control strategy. The system employs a hierarchical dual closed-loop framework, where coarse alignment is achieved using a goniometer motor and fine compensation is realized through a z-axis piezoelectric positioner in coordination with a custom tilt stage. Engagement thresholds derived from probe amplitude signals, measured via lock-in amplifiers, serve as feedback inputs to ensure stable and simultaneous probe--sample contact. A proportional-integral (PI) controller regulates the oscillation amplitude during scanning, thereby mitigating disturbances from tilt, environmental noise, and stage dynamics. Experimental results demonstrate that the proposed control scheme reduces engagement error, improves stability, and enables parallel scanning with dual probes. These results highlight the potential of control-based strategies to advance AFM toward scalable multi-probe configurations applicable to semiconductor metrology, biomedical research, and other nanoscale applications.

16:00-16:15, Paper WeC11.3	Add to My Program
Machine Learning-Based Stability Lobe Diagram Estimation and Chatter Suppression Control in Milling Process (I)

Huang, Yi	Rutgers University
Han, Feng	New York Institute of Technology
Liu, Wenyi	Rutgers University
Yi, Jingang	Rutgers University
Guo, Yuebin	Rutgers University
Keywords: Mechatronics, Machine learning, Manufacturing systems Abstract: Chatter is a self-excited vibration in milling that degrades surface quality and accelerates tool wear. This paper presents an adaptive process controller that suppresses chatter by leveraging machine learning-based online estimation of the Stability Lobe Diagram (SLD) in realtime. Stability analysis is conducted using the semi-discretization method for milling dynamics modeled by delay differential equations. An integrated machine learning framework estimates the SLD from sensor data and predicts surface roughness for chatter detection in real time. These estimates are integrated into an optimal controller that adaptively adjusts spindle speed to maintain process stability and improve smoothness of surface finish. Simulations and experiments are performed to demonstrate the superior performance compared to the existing approaches.

16:15-16:30, Paper WeC11.4	Add to My Program
Frequency-Domain Fault Analysis and Diagnostic-Enhanced Design of Hydraulic Systems (I)

Chen, Weichen	Oakland University
Yoon, Yongsoon	Oakland University
Keywords: Fault diagnosis, Fluid power control, Mechatronics Abstract: This paper presents a frequency-domain framework for fault analysis and its application to diagnostic-enhanced design of hydraulic systems. To this end, physics-based inverse generalized frequency response functions are first derived. On this basis, fault sensitivity functions are defined as a scaled gradient vector, enabling a quantitative measure of fault effects in the frequency domain. Fault sensitivity analysis is conducted across light-, medium-, and heavy-duty applications. The analysis reveals that diagnostic performance depends strongly on nominal parameters, and faults can be categorized into two groups according to their effects on sensitivity trajectories in the complex plane: energy-transmission faults and dissipative faults. Building upon these insights, a diagnostic-enhanced design methodology is proposed and applied to a medium-duty hydraulic system, originally exhibiting poor fault isolation. The redesigned system achieves a substantially improved separation metric, demonstrating enhanced diagnostic capability.

16:30-16:45, Paper WeC11.5	Add to My Program
Compliance Control of an Electro-Hydraulic Large-Scale Manipulator for Robotic On-Site Construction

Wasserloos, Philipp	University of Stuttgart
Gienger, Andreas	University of Stuttgart
Sawodny, Oliver	University of Stuttgart
Keywords: Control applications, Mechatronics Abstract: Robotics is an emerging field in the construction sector. While it is a promising approach to increase productivity and sustainability, it also comes with some challenges such as uncertain environments paired with the necessity to ensure safe interaction between the end-effector and its surroundings. To overcome these challenges, this paper presents a compliance control scheme for large-scale electro-hydraulic manipulators with high structural flexibility using joint velocities as control variables, and analyzes its performance. First, a trajectory in workspace coordinates is generated. Then, based on the force measured near the tool center point (TCP), a desired deviation from the trajectory is computed. The new trajectory is then transferred to joint position trajectories, which constitute the reference signal of joint position controllers. As a result, the TCP follows a trajectory in space, but yields to external forces when contact with the environment is established. To evaluate the approach, two experiments are carried out. First, a constant force is applied and the reaction of the system recorded. Second, an assembly situation at the construction site with interaction between the end-effector and the environment is imitated and a building element is inserted by the manipulator. The experiments not only validate the quantitative expectations, but also demonstrate the applicability to construction processes.

16:45-17:00, Paper WeC11.6	Add to My Program
Dominant Pole Placement in Linear Quadratic Regulators: Application in Automated Design of Gain-Scheduled Control

Ekanayake, Lahiru	Southern Illinois University
Komaee, Arash	Southern Illinois University
Keywords: Linear parameter-varying systems, Mechatronics, Identification Abstract: A tuning procedure for linear quadratic regulators (LQR) is presented to stabilize linear systems with a prescribed exponential decay rate. More specifically, this procedure allows an LQR to place the dominant poles of a closed-loop linear system at an arbitrary distance from the imaginary axis on the left half of the complex plane. This procedure is in particular a useful tool for automated control design within the framework of gain scheduling, in which a family of linear controllers must be designed for a set of linear systems that locally approximate a nonlinear system at a large number of operating points. By placing the dominant closed-loop poles of all linear systems at the same distance from the imaginary axis, uniform bandwidth and perhaps nearly identical frequency response can be attained at all operating points. Application of uniform pole placement in gain scheduling control is demonstrated by a practical example in feedback control of magnetic manipulators.


WeC12 Regular Session, Grand Salon 18	Add to My Program
Uncertain Systems

Chair: Coogan, Samuel	Georgia Institute of Technology
Co-Chair: Givigi, Sidney	Queen's University

15:30-15:45, Paper WeC12.1	Add to My Program
Risk Mitigation for Interval Signal Temporal Logic Monitoring and Synthesis

Baird, Luke	Georgia Institute of Technology
Schoer, Andrew	MIT Lincoln Laboratory
Cleaveland, Matthew	MIT Lincoln Laboratory
Coogan, Samuel	Georgia Institute of Technology
Leahy, Kevin	Worcester Polytechnic Institute
Keywords: Uncertain systems, Computational methods, Fault detection Abstract: This letter presents a method to mitigate the risk of violation of a temporal logic specification. Given a time-varying signal with interval-valued uncertainty at each time step, we propose an optimization approach to identify time instances for which a tighter uncertainty bound is required to satisfy the specification, knowledge which can be used to, e.g., focus sensing resources to strategically reduce uncertainty. We demonstrate our method on a simulated unmanned underwater vehicle where GPS calibration is informed by the identified time instances. In contrast to existing methods that solve an uncertainty-aware optimal control program with temporal logic mixed-integer constraints, our proposed interval-tightening approach is several orders of magnitude faster to compute. Additionally, we survey methods to produce interval-valued uncertainty, specifically how probabilistic bounds may translate to confidence intervals about an entire signal.

15:45-16:00, Paper WeC12.2	Add to My Program
Learning-Based Shrinking Disturbance-Invariant Tubes for State and Input-Dependent Uncertainty

Ramadan, Abdelrahman	Queen's University
Givigi, Sidney	Queen's University
Keywords: Uncertain systems, Data driven control, Robust adaptive control Abstract: We develop a learning-based framework for constructing shrinking disturbance-invariant tubes under state- and input-dependent uncertainty, intended as a building block for tube Model Predictive Control (MPC), and certify safety via a lifted, isotone (order-preserving) fixed-point map. Gaussian Process (GP) posteriors become (1 − α) credible ellipsoids, then polytopic outer sets for deterministic set operations. A two-time-scale scheme separates learning epochs, where these polytopes are frozen, from an inner, outside-in iteration that converges to a compact fixed point Z* ⊆ 𝒢; its state projection is RPI for the plant. As data accumulate, disturbance polytopes tighten and the associated tubes nest monotonically, resolving the circular dependence between the set to be verified and the disturbance model while preserving hard constraints. A double-integrator study illustrates shrinking tube cross-sections in data-rich regions while maintaining invariance.

16:00-16:15, Paper WeC12.3	Add to My Program
Learning Feedforward Planners from Frequency-Domain Data

Jiang, Yu	ClearMotion, Inc
Keywords: Uncertain systems, Direct adaptive control, Linear systems Abstract: This paper develops a frequency-domain framework for learning finite-horizon feedforward planners directly from data. By embedding Toeplitz dynamics into circulants, the lifted planning problem is diagonalized under the discrete Fourier transform, reducing control design to per-frequency gain estimation. Trajectory windows are tapered, transformed, and used in convex least-squares regressions with Laplacian or Tikhonov regularization, allowing selective use of informative frequency bins and mitigating noise effects. The resulting planner is obtained without explicit plant identification and yields interpretable frequency responses. On a preview steering problem with a kinematic bicycle model, combining the learned feedforward with a baseline feedback controller improves tracking accuracy relative to feedback alone.

16:15-16:30, Paper WeC12.4	Add to My Program
Safe Navigation in Dynamic Environments Using Data-Driven Koopman Operators and Conformal Prediction

Liang, Kaier	Lehigh University
Yang, Guang	Boston University
Cai, Mingyu	University of California Riverside
Vasile, Cristian Ioan	Lehigh University
Keywords: Uncertain systems, Nonlinear systems identification, Predictive control for linear systems Abstract: We propose a novel framework for safe navigation in dynamic environments by integrating Koopman operator theory with conformal prediction. Our approach leverages data-driven Koopman approximation to learn nonlinear dynamics and employs conformal prediction to quantify uncertainty in the Koopman approximation, providing statistical guarantees on approximation errors. This uncertainty is effectively incorporated into a Model Predictive Controller (MPC) formulation through constraint tightening, ensuring robust safety guarantees. We implement a layered control architecture with a reference generator providing waypoints for safe navigation. The effectiveness of our methods is validated in simulation.

16:30-16:45, Paper WeC12.5	Add to My Program
On Globally Optimal Stochastic Policy Gradient Methods for Domain Randomized LQR Synthesis

Nguyen-Le, Alexian	University of Pennsylvania
Matni, Nikolai	University of Pennsylvania
Keywords: Uncertain systems, Robust control, Randomized algorithms Abstract: Domain randomization is a simple, effective, and flexible scheme for obtaining robust feedback policies aimed at reducing the sim-to-real gap due to model mismatch. While domain randomization methods have yielded impressive demonstrations in the robotics-learning literature, general and theoretically motivated principles for designing optimization schemes that effectively leverage the randomization are largely unexplored. We address this gap by considering a stochastic policy gradient descent method for the domain randomized linear-quadratic regulator synthesis problem, a situation simple enough to provide theoretical guarantees. In particular, we demonstrate that stochastic gradients obtained by repeatedly sampling new systems at each gradient step converge to global optima with appropriate hyperparameters choices, and yield better controllers with lower variability in the final controllers when compared to approaches that do not resample. Sampling is often a quick and cheap operation, so computing policy gradients with newly sampled systems at each iteration is preferable to evaluating gradients on a fixed set of systems.

16:45-17:00, Paper WeC12.6	Add to My Program
Indirect Herding Control through a Chain-Of-Influence by Uncertain Intermediaries

Philor, Jhyv	University of Florida
Nino, Cristian	Florida Institute for Human and Machine Cognition
Amy, Patrick	University of Florida
Dixon, Warren E.	University of Florida
Keywords: Uncertain systems, Stability of nonlinear systems, Lyapunov methods Abstract: Herding is an established class of problems where a herding agent seeks to influence a target agent to move to a desired location. Motivated by scenarios where the herder cannot directly influence a target or the herder desires to conceal its influence, this paper examines a new generalization of the herding problem where the herding agent influences a target agent through a set of intermediate agents (i.e., multi-hop chain of influence). The interconnected influences result in challenging coupled dynamics within the group of agents. Motivated by this challenge, robust control techniques are used to compensate for the complex chain of influences among the group, and a Lyapunov-based analysis is used to prove the target is exponentially regulated to a desired location by the herding agent.


WeC13 Regular Session, Grand Salon 19	Add to My Program
Autonomous Systems III

Chair: Rogers, Jonathan	Naval Surface Warfare Center, Philadelphia Division
Co-Chair: Sargolzaei, Arman	University of South Florida

15:30-15:45, Paper WeC13.1	Add to My Program
Trusted Reinforcement Learning Control of a Shipboard Stewart Platform Via Control Lyapunov Functions and Control Barrier Functions

Rogers, Jonathan	Naval Surface Warfare Center, Philadelphia Division
Ripple, John T.	Naval Surface Warfare Center, Philadelphia Division
Keywords: Control applications, Reinforcement learning, Stability of nonlinear systems Abstract: Certain shipboard payloads–such as cranes, cameras, or landing decks–require ship stability in order to operate correctly. Unfortunately, wave induced ship motion causes these systems to perform in a suboptimal manner. A solution is to mount the payloads onto a parallel manipulator–like a Stewart platform–instead of directly to the ship. However, Stewart platforms are difficult to control because they are highly nonlinear and, in the case of shipboard applications, the bottom plate will move due to the ship motion. Motivated by the need to stabilize shipboard payloads, in this paper, we present a novel reinforcement learning (RL) based control strategy for a Stewart platform that contains controller stability and system safety guarantees through the use of a control Lyapunov function (CLF) and control barrier function (CBF) based quadratic program (QP) filter. We demonstrate robustness of our control strategy through numerical validation that includes real-world extreme wave induced ship motion.

15:45-16:00, Paper WeC13.2	Add to My Program
Quadrotor Aerobatics: A Geometric Algebra–Based Tracking Approach

Garcia Alcantara, Omar Alejandro	New Mexico State University
Sandoval, Steven	New Mexico State University
Rubio Scola, Ignacio	INTI - Conicet
Garcia Carrillo, Luis Rodolfo	Air Force Research Laboratory (AFRL)
Espinoza Quesada, Eduardo Steed	CINVESTAV
Keywords: Modeling, Algebraic/geometric methods, Autonomous vehicles Abstract: Geometric Algebra (GA) tools are employed to enable trajectory tracking of aerobatic maneuvers for a quadrotor while explicitly accounting for its fully coupled 6-DoF dynamics. The proposed method introduces a GA-based prioritization framework for reference generation that unifies translational objectives, preferred orientations, and aerobatic maneuver commands within a single control structure. Rotational references are derived analytically from translational trajectories, thereby avoiding the need for numerical differentiation. The effectiveness of the trajectory tracking controller is demonstrated through numerical simulations of a mission profile comprising three aerobatic maneuvers: barrel roll, 720° corkscrew, and a loop, executed in the presence of external disturbances (drag and wind). The results demonstrate the controller's robustness and effectiveness for handling underactuation and achieving trajectory tracking of aerobatic maneuvers beyond π radians.

16:00-16:15, Paper WeC13.3	Add to My Program
Resilient UAV Mission Management in Threat-Prone Environments Using Markov Decision Processes

Quamar, Md Muzakkir	King Fahd University of Petroleum and Minerals
Nasir, Ali	King Fahd University of Petroleum and Minerals
Keywords: Markov processes, Autonomous systems, Stochastic optimal control Abstract: This paper proposes a unified Markov Decision Process (MDP)-based framework for resilient mission management of a single unmanned aerial vehicle (UAV) operating in threat-prone and resource-constrained environments. UAV missions are inherently challenged by limited battery capacity, stochastic threats, dynamic goal reprioritization, and potential actuator, sensor, or payload faults. Existing approaches typically address these factors in isolation, resulting in suboptimal decision-making under complex operational conditions. The proposed framework formulates mission execution as a finite-state MDP that integrates post-fault system capability assessment, state-of-charge-based range feasibility, dynamic goal updates, and adaptive navigation mode switching for threat evasion. A structured cost function captures trade-offs among mission commitment, fault mitigation, energy preservation, and risk-aware maneuvering. The optimal policy is computed offline using value iteration and implemented online via a precomputed lookup structure to enable low-latency execution. Simulation results demonstrate adaptive goal commitment, safe recovery under major faults, and recharge-aware planning under energy constraints, thereby enhancing mission reliability and operational resilience.

16:15-16:30, Paper WeC13.4	Add to My Program
Nonlinear Time-Optimal Trajectory Optimization for Heterogeneous Microrobots in Constrained Environments

TalebiRostami, Hossein	Old Dominion University
Beaver, Logan E.	Old Dominion University
Keywords: Optimization, Optimization algorithms, Robotics Abstract: This paper presents a nonlinear optimization framework for offline, time-optimal trajectory planning of heterogeneous microrobots driven by a shared global magnetic field. Navigation is posed as a constrained nonlinear program with workspace limits, obstacle avoidance, and inter-robot separation. To reduce computational burden, three enhancements are introduced: micro-step constraint aggregation, vectorized trajectory evaluation, and precomputed velocity–frequency interpolation. These updates reduce computation time by nearly 90% relative to the original formulation without compromising accuracy. Simulations with up to three microrobots in cluttered environments, using experimentally calibrated speed–frequency data, demonstrate feasibility of coordinated shared-input control. Compared with Rapidly-exploring Random Trees (RRT) and path-length–based strategies, the proposed formulation converges reliably while explicitly minimizing total traversal time.

16:30-16:45, Paper WeC13.5	Add to My Program
Data-Driven Predictive Control for Autonomous Vehicle Tracking with Obstacle Avoidance Using Gaussian Processes

Xie, Mengxu	Northeastern University
Ma, Tong	Northeastern University
Keywords: Stochastic optimal control, Identification for control, Autonomous systems Abstract: This work tackles the challenge of vehicle tracking under obstacle avoidance by embedding path planning within a model- and data-driven predictive control framework that leverages Gaussian processes (GP-MDPC) in conjunction with backoffs. Autonomous driving is inherently affected by stochasticity and uncertainty, making vehicle tracking with obstacle avoidance a stochastic constrained control problem, where stochastic nonlinear model predictive control (SNMPC) emerges as the primary approach. The proposed approach addresses two key challenges in SNMPC. First, GPs are employed to capture unknown vehicle dynamics, with sparse GPs adopted to reduce computational complexity and enable efficient online evaluation. Second, a backoff-based approximation is introduced to reformulate chance constraints into tractable expressions by tuning backoffs offline from generated closed-loop Monte Carlo samples, which balances the risk of constraint violation and robustness as well as guarantees online recursive feasibility. The backoff method improves vehicle tracking performance by mitigating the conservatism inherent in Chebyshev's inequality method. The resulting formulation yields a finite-horizon stochastic optimal control problem (FH-SOCP) that integrates vehicle and obstacle chance constraints with state mean and covariance. Simulation results confirm that sparse GP-MDPC using backoffs achieves superior computational efficiency while maintaining satisfactory tracking performance as well as ensuring safety compared to full GP-MDPC either using backoffs or Chebyshev’s inequality.

16:45-17:00, Paper WeC13.6	Add to My Program
An Actor-Critic-Identifier Control Design for Increasing Energy Efficiency of Automated Electric Vehicles

Faghihian, Hamed	University of South Florida
Sargolzaei, Arman	University of South Florida
Keywords: Automotive control, Reinforcement learning, Autonomous systems Abstract: Electric vehicles (EVs) can achieve greater operational effectiveness when range limitations are mitigated. Enhancing energy efficiency through advanced control strategies is therefore of significant importance, and recent developments in vehicle automation provide a promising opportunity to address this challenge. However, many existing approaches rely on indirect methods, primarily because establishing a direct relationship between control inputs and power consumption is inherently difficult. To address this limitation, a neural network (NN)-based identifier is proposed to learn this mapping online and integrate it with an actor-critic reinforcement learning (RL) framework to synthesize an optimal control policy. The resulting actor-critic-identifier architecture eliminates the reliance on explicit system models while maintaining accurate speed tracking and improving energy efficiency. In the proposed framework, the update laws of the neural networks are derived using Lyapunov-based stability analysis to ensure closed-loop stability. The effectiveness of the method is evaluated through simulation studies. Comparative results with a baseline controller demonstrate that the proposed approach improves overall energy efficiency by 42%, indicating substantial potential for enhancing the energy efficiency of electric vehicles.


WeC14 Regular Session, Grand Salon 21	Add to My Program
Machine Learning I

Chair: Pare, Philip E.	Purdue University
Co-Chair: Farhood, Mazen	Virginia Tech

15:30-15:45, Paper WeC14.1	Add to My Program
Learning Passive Continuous-Time Dynamics with Multistep Port-Hamiltonian Gaussian Processes

Leung, Humphrey	Purdue University
Pare, Philip E.	Purdue University
Keywords: Learning, Identification for control, Nonlinear systems identification Abstract: We propose the multistep port-Hamiltonian Gaussian process (MS-PHS GP) to learn physically consistent continuous-time dynamics and a posterior over the Hamiltonian from noisy, irregularly-sampled trajectories. By placing a GP prior on the Hamiltonian surface H and encoding variable-step multistep integrator constraints as finite linear functionals, MS-PHS GP enables closed-form conditioning of both the vector field and the Hamiltonian surface without latent states, while enforcing energy balance and passivity by design. We state a finite-sample vector-field bound that separates the estimation and variable-step discretization terms. Lastly, we demonstrate improved vector-field recovery and well-calibrated Hamiltonian uncertainty on mass-spring, Van der Pol, and Duffing benchmarks.

15:45-16:00, Paper WeC14.2	Add to My Program
Data-Driven Discrepancy Modeling in Higher-Dimensional State Space Via Coprime Factorization

Sinha, Sourav	Virginia Tech
Farhood, Mazen	Virginia Tech
Keywords: Learning, Identification for control, Robust control Abstract: This work provides a data-driven framework that combines coprime factorization with a lifting linearization technique to model the discrepancy between a nonlinear system and its nominal linear approximation using a linear time-invariant (LTI) state-space model in a higher-dimensional state space. In the proposed framework, the nonlinear system is represented in terms of the left coprime factors of the nominal linear system, along with perturbations modeled as stable, norm-bounded LTI systems in a higher-dimensional state space using a deep learning approach. Our method builds on a recently proposed parametrization for norm-bounded systems, enabling the simultaneous minimization of the H-infinity norm of the learned perturbations. We also provide a coprime factorization-based approach as an alternative to direct methods for learning lifted LTI approximations of nonlinear systems. In this approach, the LTI approximations are obtained by learning their left coprime factors, which remain stable even when the original system is unstable. The effectiveness of the proposed discrepancy modeling approach is demonstrated through multiple examples.

16:00-16:15, Paper WeC14.3	Add to My Program
Entropy-Regularized Two-Stage State-Only Imitation Learning under Unknown Dynamics

Wang, Dejin	Northeastern University
Ghoreishi, Seyede Fatemeh	Northeastern University
Keywords: Learning, Machine learning Abstract: Imitation learning (IL) enables policy recovery from demonstrations without requiring reward specification; however, most approaches assume access to expert actions and known dynamics---assumptions that are rarely satisfied in realistic domains. We propose a two-stage state-only IL framework that first learns a dynamics surrogate from auxiliary action-labeled data and then recovers an expert-like policy from purely state-only demonstrations. This decoupling is a key distinction from prior state-only IL methods, providing a principled path to stable policy recovery. To address the policy identifiability challenge inherent to state-only settings, we incorporate entropy regularization, which encourages deterministic solutions and improves robustness rather than acting as an ad hoc penalty. Our framework is supported by a theoretical analysis clarifying how modeling and policy errors impact imitation accuracy. Experiments conducted in autonomous driving scenarios within the CARLA simulator show that our method achieves higher action accuracy, lower variance, and improved control rewards compared to state-of-the-art baselines.

16:15-16:30, Paper WeC14.4	Add to My Program
Learning Transferable Friction Models and LuGre Identification Via Physics-Informed Neural Networks

Ozmen, Asutay	Univ. of California, Santa Barbara
Hespanha, Joao P.	Univ. of California, Santa Barbara
Byl, Katie	Univ. of California at Santa Barbara
Keywords: Learning, Modeling, Simulation Abstract: Accurately modeling friction in robotics remains a core challenge, as robotics simulators like MuJoCo and PyBullet use simplified friction models or heuristics to balance computational efficiency with accuracy, where these simplifications and approximations can lead to substantial differences between simulated and physical performance. In this paper, we present a physics-informed friction estimation framework that enables the integration of well-established friction models with learnable components, requiring only minimal, generic measurement data. Our approach enforces physical consistency yet retains the flexibility to capture complex friction phenomena. We demonstrate, on an underactuated and nonlinear system, that the learned friction models, trained solely on small and noisy datasets, accurately reproduce dynamic friction properties with significantly higher fidelity than the simplified models commonly used in robotics simulators. Crucially, we show that our approach enables the learned models to be transferable to systems they are not trained on. This ability to generalize across multiple systems streamlines friction modeling for complex, underactuated tasks, offering a scalable and interpretable path toward improving friction model accuracy in robotics and control.

16:30-16:45, Paper WeC14.5	Add to My Program
Adaptive Control with Integral Concurrent Learning for Systems with Uncertain Dynamical and Control Effectiveness Terms

Gul, Zeki	Ege University
Tatlicioglu, Enver	Ege University
Deniz, Meryem	Izmir Katip Celebi University
Zergeroglu, Erkan	Gebze Technical University
Keywords: Learning, Stability of linear systems, Estimation Abstract: This work presents a concurrent learning based adaptive controller formulation for a class of uncertain linear systems having unknown effectiveness matrix that represents potential degradations in actuator performance. Specifically, making use of a memory stack containing selected data from the recorded and current information gathered from the system's runs incorporated with a model reference adaptation algorithm, asymptotically convergent error tracking is guaranteed. When the memory stack contains sufficiently rich system information, the proposed methodology ensures both exponential convergence of the tracking error signal and exact estimation of the parameters.

16:45-17:00, Paper WeC14.6	Add to My Program
Data-Driven Practical Stabilization of Nonlinear Systems Via Chain Policies: Sample Complexity and Incremental Learning

Siegelmann, Roy	Massachusetts Institute of Technology
Mallada, Enrique	Johns Hopkins University
Keywords: Learning, Stability of nonlinear systems, Lyapunov methods Abstract: We propose a method for data-driven practical stabilization of nonlinear systems with provable guarantees, based on the novel concept of Nonparametric Chain Policies (NCPs). The approach employs a normalized nearest-neighbor rule to assign, at each state, a finite-duration control signal derived from stored data, after which the process repeats. Unlike recent works that model the system as linear, polynomial, or polynomial fraction, we only require the system to be locally Lipschitz. Our analysis builds on the framework of Recurrent Lyapunov Functions (RLFs), which enable data-driven certification of stability using standard norm functions instead of requiring the explicit construction of a classical Lyapunov function. To extend this framework, we introduce the concept of Recurrent Control Lyapunov Functions (R-CLFs), which can certify the existence of an NCP that practically stabilizes an arbitrarily small c-neighborhood of an equilibrium point. We also provide an explicit sample complexity guarantee of O((3/rho)^d log(R/c)) number of trajectories—where R is the domain radius, d the state dimension, and rho is a system-dependent constant. The proposed Chain Policies are nonparametric, thus allowing new verified data to be readily incorporated into the policy to either improve convergence rate or enlarge the certified region. Numerical experiments illustrate and validate these properties.


WeC15 Regular Session, Grand Salon 22	Add to My Program
Algebraic and Geometric Methods

Chair: Clark, William	Ohio University
Co-Chair: Verriest, Erik I.	Georgia Inst. of Tech

15:30-15:45, Paper WeC15.1	Add to My Program
Geometric Insight in Solving Optimal Control Problems and the Emergence of Generalized Functions

Verriest, Erik I.	Georgia Inst. of Tech
Keywords: Algebraic/geometric methods, Optimal control Abstract: Geometric insight may lead to a quick solution for a class of non-LQ optimal control problems. We illustrate this with a simple example that looks very inconspicuous. While necessary conditions for optimality are easily obtained, their analytic solution may not be easy. But some problems are reducible to an Euclidean distance problem. This insight then leads to the additional realization that in some cases, optimality may require impulsive inputs. However, Dirac deltas cannot be compatible with nonlinear operations, at least not in Schwartz's distribution theory. Thus, it seems that we may have a solution but not a theory. Since the solution is transparent in its geometric form, it suggests that another approach to generalized functions, as proposed by Colombeau, should be used. This is very valuable as it corroborates our earlier work (MTNS-24). Generalizations are then sought for other problems reducible to Euclidean minimum distance problems, and even more general Riemannian spaces. We make some connections with the notion of persistence of behavior, where these results apply.

15:45-16:00, Paper WeC15.2	Add to My Program
On the Equivalence of Koopman Eigenfunctions and Commuting Symmetries

Jiang, Xinyuan	Pennsylvania State University
Li, Yan	The Pennsylvania State University
Keywords: Algebraic/geometric methods, Reduced order modeling Abstract: The Koopman operator framework offers a way to represent a nonlinear system as a linear one. The key to this simplification lies in the identification of eigenfunctions. While various data-driven algorithms have been developed for this problem, a theoretical characterization of Koopman eigenfunctions from geometric properties of the flow is still missing. This paper provides such a characterization by establishing an equivalence between a set of Koopman eigenfunctions and a set of commuting symmetries -- both assumed to span the tangent spaces at every point on a simply connected open set. Based on this equivalence, we derive an explicit formula for the principal Koopman eigenfunctions and prove its uniform convergence on the region of attraction of a locally asymptotically stable equilibrium point, thereby offering a constructive method for computing Koopman eigenfunctions.

16:00-16:15, Paper WeC15.3	Add to My Program
Variations on the Retarded Oscillator through Diffeology

Clark, William	Ohio University
Keywords: Delay systems, Algebraic/geometric methods, Variational methods Abstract: The dynamics and evolution of many real-world systems depend on past values of the states due to the finite propagation speed of information. One way to model such systems is through delay differential equations. Unlike ordinary differential equations, delay systems are inherently infinite-dimensional which makes their analysis more difficult. Standard approaches for studying these systems is through either functional analysis or perturbation analysis as the delay vanishes. This work introduces the idea of diffeology as a possible framework for the qualitative geometric study of delay differential equations. As an application of this, the variational equation is presented for retarded delay differential equations with state-dependent delays. As a case-study, this theory is applied to the retarded oscillator.

16:15-16:30, Paper WeC15.4	Add to My Program
Traffic Characterization of Event-Triggered Control Systems: A Geometric-Algebraic Perspective

Chen, Tao	Central South University
Wang, Hongju	Central South University
Hu, Wenfeng	Central South University
Keywords: Hybrid systems, Modeling Abstract: The triggering behaviors of event-triggered control systems are investigated in this article and characterized by inter-event times (IETs) and the transitions among them. Such behaviors are particularly significant in resource-constrained networked control systems, where IETs determine packet transmission instants and shape communication traffic, directly affecting bandwidth utilization and closed-loop stability. While the feasibility of inter-event time (IET) transitions is conventionally formulated as a complex constraint satisfaction problem, we reformulate it into an equivalent linear cone problem, which provides a clearer description of the feasible region for reliable analysis. Based on this formulation, necessary and sufficient conditions are established to determine the feasibility of IET transitions, together with an algorithm to compute the set of all feasible IET transitions. Numerical simulations illustrate how transition feasibility varies with the control parameter and provide insight for parameter selection and system design.

16:30-16:45, Paper WeC15.5	Add to My Program
Homogeneous Approximations and Fixed Time Semistability with Applications to Network Consensus

Verma, Kriti	Georgia Institute of Technology
Haddad, Wassim M.	Georgia Inst. of Tech
Keywords: Stability of nonlinear systems, Algebraic/geometric methods, Lyapunov methods Abstract: This paper studies properties of homogeneous approximations of systems with a continuum of equilibrium points in a geometric, coordinate-free setting. Specifically, we develop the concept of homogeneity in the 0-limit and ∞-limit (i.e., homogeneity in the bi-limit) with respect to general dilations. A key contribution of the paper is a result relating the regularity properties of homogeneous in the bi-limit functions and vector fields to their degree of homogeneity and the local behavior of the dilations near the equilibrium and infinity. Specifically, we develop new finite time and fixed time semistability results for a class dynamical systems possessing a continuum of equilibria which are homogeneous in the bi-limit with respect to a semi-Euler vector field. Finally, we use these results to develop thermodynamically inspired distributed consensus control protocols for multiagent network systems for achieving coordination tasks in fixed time.

16:45-17:00, Paper WeC15.6	Add to My Program
Source-Seekers with Obstacle Avoidance for Nonholonomic Vehicles with Perception: A Model-Free Geometric Hybrid Control Approach

Zhang, Xiyuan	University of California San Diego
Abdelgalil, Mahmoud	University of California, San Diego
Poveda, Jorge I.	University of California, San Diego
Keywords: Optimization algorithms, Learning, Hybrid systems Abstract: We study the problem of steering nonholonomic vehicles toward the source of an unknown signal, accessible only through noisy measurements, while navigating cluttered environments using onboard perception instead of position data. Such stabilization cannot be robustly achieved with smooth feedback laws due to topological obstructions and geometric constraints induced by the obstacles and the nonholonomic dynamics. To address this, we propose a model-free hybrid controller based on Lie-bracket averaging, integrating perception maps from a transformer-based mixture-of-experts model with geometric control. Perception maps provide camera-derived state information, while the hybrid controller ensures robust source seeking. We show that, under bounded and even adversarial perception errors, the closed-loop system achieves semi-global practical asymptotic stability and guarantees obstacle avoidance. Experiments in dense environments demonstrate consistent collision-free trajectories and reliable convergence, closely matching the ground truth.


WeC16 Regular Session, Grand Salon 24	Add to My Program
Reinforcement Learning III

Chair: Wongpiromsarn, Tichakorn (Nok)	Iowa State University
Co-Chair: Xu, Zhe	Arizona State University

15:30-15:45, Paper WeC16.1	Add to My Program
Encoding High-Level Knowledge in Offline Multi-Agent Reinforcement Learning Using Reward Machines

Meshkat Alsadat, Shayan	Arizona State University
Xu, Zhe	Arizona State University
Keywords: Reinforcement learning Abstract: Offline reinforcement learning (RL) learns policies from fixed-size datasets without interacting with the environment, while multi-agent reinforcement learning (MARL) faces challenges from large joint state-action spaces and agent interdependencies. Most offline MARL methods apply regularizations, ignoring system-wide dependencies, risking extrapolation errors. We propose Automata-Guided Multi-Agent Offline RL with Reward Machine (AGMORL), a novel framework extending automata learning to offline MARL with reward machines. AGMORL uses a deterministic finite automaton to learn the reward machine from a dataset, capturing team dynamics and agent interactions, while guiding individual policies to avoid out-of-distribution actions by encoding dataset knowledge. Unlike other methods, it avoids secondary components like generative models. We provide convergence guarantees to an optimal policy and show AGMORL outperforms state-of-the-art offline MARL methods.

15:45-16:00, Paper WeC16.2	Add to My Program
Enhancing Multi-Agent Reinforcement Learning by Mining Decomposed Causal Signal Temporal Logic Formula

Partovi Aria, Hadi	Arizona State University
Xu, Zhe	Arizona State University
Keywords: Reinforcement learning, Agents-based systems Abstract: Reinforcement Learning (RL) has seen significant success in training agents to perform complex sequential tasks. However, challenges remain in multi-agent contexts where agents must coordinate to satisfy temporal and causal specifications. This paper introduces MASTL-CIRL (Multi-Agent Signal Temporal Logic for Causal Inference in Reinforcement Learning), a framework that extends single-agent causal STL inference to multi-agent systems. Our framework mines interpretable causal specifications that capture temporal dependencies while enabling decentralized execution through principled task decomposition. Experimental results in cooperative multi-agent environments demonstrate that our method outperforms traditional approaches, achieving higher sample efficiency, more robust policies, and better generalization to novel scenarios. The decomposed formulas retain causal interpretability while significantly reducing communication requirements between agents during execution.

16:00-16:15, Paper WeC16.3	Add to My Program
Incorporating System-Level Safety Requirements in Perception Models Via Reinforcement Learning

Fan, Weisi	Iowa State Univeristy
Lane, Jesse	Iowa State University
Liu, Qisai	Iowa State University
Sarkar, Soumik	Iowa State University
Wongpiromsarn, Tichakorn (Nok)	Iowa State University
Keywords: Reinforcement learning, Autonomous systems, Machine learning Abstract: Control of autonomous systems is often designed under the assumption of accurate perception. However, perception models are typically trained in isolation using metrics such as accuracy, precision, and recall, which fail to capture the downstream safety consequences of perception errors. Conventional loss functions, including cross-entropy and negative log-likelihood, treat all misclassifications equally and ignore how perception errors propagate into decision-making and control. To address this limitation, we propose a safety-driven training paradigm that explicitly integrates system-level safety objectives into the optimization of perception models. Formally specified safety requirements, expressed using the rulebook formalism, are translated into quantitative violation scores that reflect the severity of perception-induced failures. These scores are incorporated into a reinforcement learning–based fine-tuning process, enabling perception models to adjust their internal representations to reduce high-risk system-level violations. Experiments conducted in the high-fidelity CARLA simulator show that, compared to state-of-the-art perception models trained with conventional objectives, our approach reduces average safety violations by up to 50% under challenging driving scenarios. The proposed framework provides a principled and controller-agnostic mechanism for aligning perception training with formal safety specifications, offering a practical pathway toward safer learning-enabled autonomous systems.

16:15-16:30, Paper WeC16.4	Add to My Program
Wasserstein-Barycenter Consensus for Cooperative Multi-Agent Reinforcement Learning

Baheri, Ali	Rochester Institute of Technology
Keywords: Reinforcement learning, Cooperative control, Distributed control Abstract: Cooperative multi-agent reinforcement learning (MARL) demands principled mechanisms to align heterogeneous policies while preserving the capacity for specialized behavior. We introduce a novel consensus framework that defines the team strategy as the entropic-regularized p-Wasserstein barycenter of agents’ joint state–action visitation measures. By augmenting each agent’s policy objective with a soft penalty proportional to its Sinkhorn divergence from this barycenter, the proposed approach encourages coherent group behavior without enforcing rigid parameter sharing. We derive an algorithm that alternates between Sinkhorn-barycenter computation and policy-gradient updates, and we prove that, under standard Lipschitz and compactness assumptions, the maximal pairwise policy discrepancy contracts at a geometric rate. Empirical evaluation for a cooperative navigation case study demonstrates that our OT-barycenter consensus outperforms an independent learners baseline in convergence speed and final coordination success.

16:30-16:45, Paper WeC16.5	Add to My Program
Dynamics-Augmented Reinforcement Learning for Partially Measurable Nonlinear Systems

Patil, Aditya Satish	University of Minnesota
Sun, Zongxuan	University of Minnesota
Kim, Kenneth	DEVCOM Army Research Laboratory
Kweon, Chol-Bum	DEVCOM Army Research Laboratory
Keywords: Reinforcement learning, Data driven control Abstract: Designing effective controllers for nonlinear systems without access to accurate models or full state measurements remains a core challenge in control engineering. While model-free reinforcement learning methods such as Deep Deterministic Policy Gradient (DDPG) can achieve high performance when full state information is available, their effectiveness typically degrades under partial state measurement. This paper proposes a two-phase control framework that decouples dynamics representation learning from policy optimization. A key novelty of our approach is the explicit use of measurement history to learn a predictive model of system dynamics, enabling a compact, dynamics-aware embedding to be constructed offline. This learned representation is then used to augment the input to a DDPG agent, facilitating stable and sample-efficient control under partial measurement. We evaluate the proposed approach on a high speed trajectory tracking task for a nonlinear electrohydraulic system. Results show that the method achieves desired tracking and rapid convergence under partial state measurement, outperforming recurrent reinforcement learning approaches in both sample efficiency and final tracking performance.

16:45-17:00, Paper WeC16.6	Add to My Program
Expert-Augmented Distributed Multi-Agent Reinforcement Learning with Jensen–Shannon Regularization: A Policy Gradient Approach

Wang, Dongming	University of California, Riverside
Zhu, Yuhan	University of California, Riverside
Zhang, Yanyu	University of California, Riverside
Ren, Wei	University of California, Riverside
Keywords: Reinforcement learning, Networked control systems, Distributed control Abstract: In this paper, we propose an expert-augmented distributed multi-agent reinforcement learning (MARL) framework that integrates local prior knowledge from demonstrations, pretrained models, or transfer learning, while relying only on local observations and one-hop neighbor communication. To quantify the quality of expert policies, we introduce a Jensen–Shannon divergence–based metric. In the general case, we establish that the algorithm retains sublinear convergence to a neighborhood of the optimum, with irreducible bias from local communication constraints, and achieves faster convergence in practice. In particular, when the expert policy is sufficiently close to the optimal, our algorithm establishes local linear convergence. Experiments in Level-Based Foraging environments support our theoretical findings, providing empirical validation. Overall, this framework offers a principled method to incorporate expert knowledge into distributed MARL, with promising potential applications in robotics and autonomous systems where demonstrations are available.


WeC17 Regular Session, Churchill A1	Add to My Program
Biological Systems I

Chair: Del Vecchio, Domitilla	Massachusetts Institute of Technology
Co-Chair: Gracy, Sebin	South Dakota School of Mines and Technology

15:30-15:45, Paper WeC17.1	Add to My Program
Game-Theoretic Social Distancing in Competitive Bi-Virus SIS Epidemics

Catalano, Benjamin	South Dakota School of Mines
Paarporn, Keith	University of Colorado, Colorado Springs
Gracy, Sebin	South Dakota School of Mines and Technology
Keywords: Biological systems, Autonomous systems, Network analysis and control Abstract: The numerous elements that drive infectious dis- eases are highly complex, especially social behaviors that evolve in tandem with the spreading of diseases. Moreover, recent studies highlight the importance of understanding how multiple strains will simultaneous spread through a population (e.g.Delta and Omicron variants of SARS-CoV-2). In this paper, we propose a bi-virus SIS epidemics model that is coupled with individual social distancing behaviors. The behaviors are governed by replicator equations from evolutionary game-theory. The prevalence of the strains impact individuals’ choices to social distance or not, and, in turn, their choices affect the spreading of the bi-virus epidemic. Our analysis identifies the system’s equilibria and their local stability properties, which reveals several isolated fixed points with varying levels of social distancing. We find that the only outcomes where co-existence of the two strains is possible are on line segments of equilibria that emerge when the reproduction numbers of both strains are equivalent, and under suitable conditions on our model parameters, said line segments are locally exponentially stable. We demonstrate our findings with several numerical simulations.

15:45-16:00, Paper WeC17.2	Add to My Program
Stochastic Analysis of Plasmids Competition in a Bacterial Switch

Ruolo, Iacopo	Massachusetts Institute of Technology
Lu, Eric	Massachusetts Institute of Technology
Del Vecchio, Domitilla	Massachusetts Institute of Technology
Keywords: Biological systems, Cellular dynamics, Stochastic systems Abstract: Recombinase-based toggle switches recently emerged as synthetic gene circuits for encoding reversible cellular memory in bacteria, yet an understanding of their dynamics has been lacking. In particular, when the state is flipped via external inputs, it is possible that plasmids with identical origins of replication coexist, leading to competition for plasmid replication machinery. The role of these dynamics in such circuits has not been analyzed. In this work, we model intracellular plasmid dynamics using deterministic and stochastic approaches to examine how competition by different plasmids for the same replication machinery shapes the long-term outcome of switching events. While deterministic analysis predicts monostability, analysis of a stochastic chemical reaction model reveals the existence of two absorbing sets in which only one of the plasmid species exists. These results indicate that the final outcome of a switching event depends on the ratio of incompatible plasmids within the cell and on the relative binding constants for the replication machinery, potentially leading to failure of state maintenance. This analysis investigates the characteristics of recombinase-based toggle switches accounting for plasmid competition and identifies key parameters that must be optimized to achieve long-term memory.

16:00-16:15, Paper WeC17.3	Add to My Program
Prescribed-Time Control of an Agent-Based SIWS Epidemic Model

Abbasi, Zohreh	University of Waterloo
Liu, Xinzhi	University of Waterloo
Keywords: Biological systems, Stability of nonlinear systems, Control applications Abstract: This paper proposes a multi-agent approach to a multilayer, interconnected, and heterogeneous SIWS (Susceptible-Infected-Water-Susceptible) epidemic model incorporating population flow. The layers represent city-to-city connectivity, water resources, and designated leaders. A layer-specific prescribed-time synchronization strategy is applied to ensure infection eradication across cities and water systems within a user-defined time horizon. A saturated control input model is considered to account for limited medical resources and to enforce positive infection rates, addressing real-world constraints. Theoretical results are validated through a case study modeled on Louisiana, U.S., which demonstrates effective epidemic suppression within the prescribed time frame.

16:15-16:30, Paper WeC17.4	Add to My Program
Feedback Regulation of Protein Synthesis with State-Dependent Delays

Chatterjee, Poulami	University of Delaware
Singh, Abhyudai	University of Delaware
Keywords: Biological systems, Stochastic systems, Modeling Abstract: Cells must maintain stable protein levels despite the stochastic nature of gene expression, as large fluctuations can disrupt cellular function and decision-making. Mechanisms such as negative feedback and delays in protein activation help buffer this fluctuation, but the role of state-dependent delays remains unclear. We develop a stochastic model where proteins are produced in bursts as inactive molecules, pass through intermediate activation steps, and then degrade. The activation delay depends on the current active protein level, creating a state-dependent feedback loop. We derive analytical expressions linking delay structure and feedback strength to fluctuations in active protein levels, quantified by the Fano factor. Our results show that state-dependent delays can reduce fluctuations below the baseline from burst frequency, and stochastic simulations confirm these predictions. Incorporating negative feedback in burst production further suppresses variability while maintaining stable system behavior. These findings highlight how temporal, state-dependent regulation stabilizes protein expression and can inform the design of robust synthetic gene circuits.

16:30-16:45, Paper WeC17.5	Add to My Program
Bayesian Nonlinear State-Space Modeling and Chance-Constrained Predictive Control for Hemodynamic Resuscitation

Estiri, Elham	Kent State University
Mirinejad, Hossein	Kent State University
Keywords: Predictive control for nonlinear systems, Biomedical, Stochastic optimal control Abstract: This paper presents a unified framework for uncertainty-aware modeling and stochastic control of fluid resuscitation. A Bayesian nonlinear state–space model (BNSSM) is developed to generate calibrated multi-step predictions of mean arterial pressure (MAP) while jointly propagating aleatoric and epistemic uncertainty. Monte Carlo rollouts of the learned model are integrated into a chance-constrained stochastic model predictive control (CC-SMPC) formulation with joint uncertainty, which enforces safety bounds on MAP and infusion rates. The accuracy of the BNSSM was evaluated on real hemorrhage–resuscitation data, demonstrating its ability to capture physiologically consistent MAP dynamics. Furthermore, the proposed CC-SMPC controller outperformed its deterministic counterpart in fluid dose adjustment, achieving safer and more precise regulation of MAP. These results highlight the framework’s ability to jointly address epistemic and aleatoric uncertainty, providing a more reliable foundation for closed-loop fluid management in critical care.


WeC18 Regular Session, Churchill A2	Add to My Program
Constrained Control III

Chair: Nicotra, Marco M	University of Colorado Boulder

15:30-15:45, Paper WeC18.1	Add to My Program
From Bundles to Backstepping: Geometric Control Barrier Functions for Safety-Critical Control on Manifolds

de Sa, Massimiliano	California Institute of Technology
Ong, Pio	California Institute of Technology
Ames, Aaron D.	California Institute of Technology
Keywords: Constrained control, Algebraic/geometric methods, Lyapunov methods Abstract: Control barrier functions (CBFs) have a well-established theory in Euclidean spaces, yet still lack general formulations and constructive synthesis tools for systems evolving on manifolds common in robotics and aerospace applications. In this paper, we develop a general theory of geometric CBFs on bundles and, for control-affine systems, recover the standard optimization-based CBF controllers and their smooth analogues. Then, by generalizing kinetic energy-based CBF backstepping to Riemannian manifolds, we provide a constructive CBF synthesis technique for geometric mechanical systems, as well as easily verifiable conditions under which it succeeds. Further, this technique utilizes mechanical structure to avoid computations on higher-order tangent bundles. We demonstrate its application to an underactuated satellite on SO(3).

15:45-16:00, Paper WeC18.2	Add to My Program
Importance of the Input Cost When Designing Control Barrier Function Safety Filters

Freire, Victor	University of Colorado Boulder
Nicotra, Marco M	University of Colorado Boulder
Keywords: Constrained control Abstract: Control barrier function (CBF)-based safety filters seek to enforce constraints by ``minimally adjusting'' a nominal control input. This minimal adjustment is usually measured using a simple Euclidean norm in input space. In this work, we study how the use of weighed norms can affect the performance of CBF safety filters. Specifically, we seek to minimize the impact of the input error on the nominal performance, as opposed to minimizing the input error itself. Numerical simulations show that the proper weighing of the cost function of the CBF quadratic program can have a profound effect on the overall performance of the closed-loop system.

16:00-16:15, Paper WeC18.3	Add to My Program
Probabilistic Control Barrier Functions: Safety in Probability for Discrete-Time Stochastic Systems

Mestres, Pol	California Institute of Technology
Werner, Blake	California Institute of Technology
Cosner, Ryan	California Institute of Technology
Ames, Aaron D.	California Institute of Technology
Keywords: Constrained control Abstract: Control systems operating in the real world face countless sources of unpredictable uncertainties. These random disturbances can render deterministic guarantees inapplica- ble and cause catastrophic safety failures. To overcome this, this paper proposes a method for designing safe controllers for discrete-time stochastic systems that retain probabilistic guarantees of safety. To do this we modify the traditional notion of a control barrier function (CBF) to explicitly account for these stochastic uncertainties and call these new modified functions probabilistic CBFs. We show that probabilistic CBFs can be used to design controllers that guarantee safety over a finite number of time steps with a prescribed probability. Next, by leveraging various uncertainty quantification methods, such as concentration inequalities and the scenario approach, we provide a variety of sufficient conditions that result in computationally tractable controllers with tunable probabilistic guarantees across a plethora of practical scenarios. Finally, we showcase the applicability of our results in simulation and hardware for the control of a quadruped robot.

16:15-16:30, Paper WeC18.4	Add to My Program
Safe Trajectory Planning with Bernstein Polynomials under Control Barrier Function Constraints

Lee, Jungeun	UNIST
Lee, Seongjae	Ulsan National Institute of Science and Technology (UNIST)
Jeon, Jeong hwan	Ulsan National Institute of Science and Technology
Keywords: Constrained control, Predictive control for nonlinear systems, Optimal control Abstract: Ensuring safe trajectory planning of autonomous mobile robots in environments with static and dynamic obstacles is a critical challenge for real-world deployment. Existing approaches, such as nonlinear model predictive control (NMPC) with control barrier functions (CBFs), can provide safety guarantees. However, their computational complexity scales with the prediction horizon, which can hinder real-world feasibility. We propose an NMPC algorithm that combines discrete-time CBFs with Bernstein polynomial parameterization. By constraining a constant number of Bernstein control points within a dynamically generated safe corridor, the resulting trajectory is guaranteed to remain forward invariant within the safe set. This formulation also reduces computational complexity compared with conventional approaches. Simulation results in both static and dynamic obstacle environments show that the proposed method achieves a high success rate while maintaining safety margins comparable to existing baselines. At the same time, computation time is significantly reduced and remains nearly constant as the prediction horizon increases. These results demonstrate the efficiency, robustness, and scalability of the proposed approach for safe trajectory planning in diverse scenarios.

16:30-16:45, Paper WeC18.5	Add to My Program
Barrier-Riccati Synthesis for Nonlinear Safe Control with Expanded Region of Attraction

Almubarak, Hassan	King Fahd University of Petroleum and Minerals
AL-Sunni, Maitham	Carnegie Mellon University
Dubbin, Justin	Georgia Institute of Technology
Sadegh, Nader	Georgia Inst. of Tech
Dolan, John	Carnegie Mellon University
Theodorou, Evangelos A.	Georgia Institute of Technology
Keywords: Constrained control, Stability of nonlinear systems, Optimal control Abstract: We present a Riccati-based framework for safety-critical nonlinear control that integrates the barrier states (BaS) methodology with the State-Dependent Riccati Equation (SDRE) approach. The BaS formulation embeds safety constraints into the system dynamics via auxiliary states, enabling safety to be treated as a control objective. To overcome the limited region of attraction in linear BaS controllers, we extend the framework to nonlinear systems using SDRE synthesis applied to the barrier-augmented dynamics and derive a matrix inequality condition that certifies forward invariance of a large region of attraction and guarantees asymptotic safe stabilization. The resulting controller is computed online via pointwise Riccati solutions. We validate the method on an unstable constrained system and cluttered quadrotor navigation tasks, demonstrating improved constraint handling, scalability, and robustness near safety boundaries. This framework offers a principled and computationally tractable solution for synthesizing nonlinear safe feedback in safety-critical environments.

16:45-17:00, Paper WeC18.6	Add to My Program
Autonomy Architectures for Safe Planning in Unknown Environments under Budget Constraints

Cherenson, Daniel	University of Michigan
Agrawal, Devansh Ramgopal	University of Michigan
Panagou, Dimitra	University of Michigan, Ann Arbor
Keywords: Constrained control, Autonomous robots Abstract: Mission planning can often be formulated as a constrained control problem under multiple path constraints (i.e., safety constraints) and budget constraints (i.e., resource expenditure constraints). In a priori unknown environments, verifying that an offline solution will satisfy the constraints for all time can be difficult, if not impossible. We present ReRoot, a novel sampling-based framework that enforces safety and budget constraints for nonlinear systems in unknown environments. The main idea is that ReRoot grows multiple reverse RRT* trees online, starting from renewal sets, i.e., sets where the budget constraints are renewed. The trees provide paths of minimal budget expenditure back to the renewal set, aiding the construction of safe dynamically feasible backup trajectories and providing a principled backup policy when integrated into the Gatekeeper safety verification architecture. We demonstrate our approach in simulation with a fixed-wing UAV in a GNSS-denied environment with a budget constraint on localization error that can be renewed at visual landmarks.


WeC19 Regular Session, Churchill B1	Add to My Program
Optimal Control II

Chair: Drgona, Jan	Johns Hopkins University
Co-Chair: Cowlagi, Raghvendra V.	Worcester Polytechnic Institute

15:30-15:45, Paper WeC19.1	Add to My Program
Safe Online Control-Informed Learning

Zhou, Tianyu	Purdue University
Liang, Zihao	Purdue University
Lu, Zehui	Independent Researcher
Mou, Shaoshuai	Purdue University
Keywords: Optimal control, Constrained control, Data driven control Abstract: This paper proposes a Safe Online Control- Informed Learning framework for safety-critical autonomous systems. The framework unifies optimal control, parameter estimation, and safety constraints into an online learning process. It employs an extended Kalman filter to incrementally update system parameters in real time, enabling robust and data-efficient adaptation under uncertainty. A softplus barrier function enforces constraint satisfaction during learning and control while eliminating the dependence on high-quality initial guesses. Theoretical analysis establishes convergence and safety guarantees, and the framework’s effectiveness is demonstrated on cart-pole and robot-arm systems.

15:45-16:00, Paper WeC19.2	Add to My Program
Trajectory Optimization for Cooperative Navigation by Solving the HJB Equation Using PINNs

Wojciechowski, Maria R.	Worcester Polytechnic Institute
Deshamouni, Jayaratna	Shrewsbury High School
Steffens, Michael	Draper
Cowlagi, Raghvendra V.	Worcester Polytechnic Institute
Keywords: Optimal control, Cooperative control, Neural networks Abstract: Cooperative navigation is a methodology where multiple mobile vehicles share navigational aiding information with the aim of improving the precision and accuracy of localization. This methodology can enable teams of autonomous mobile vehicles to navigate in, say, undersea and subterranean environments where conventional localization methods based on global satellite navigation systems and/or vision do not suffice. In such environments, information can be shared when vehicles are in close proximity. To this end, one approach is to assign the role of a cooperative navigation aid (CNA) to specific vehicles in the team, and to then optimize the trajectories of the remaining vehicles to achieve sufficient proximity to the CNA while conducting their tasks. We formulate this problem as an optimal control problem, where proximity to a moving CNA constitutes a "benefit" to be maximized while traversing between two prespecified fixed locations in the environment. We develop a physics-informed neural network (PINN)-based solution of the Hamilton-Jacobi-Bellman (HJB) equation for the problem of optimizing a weighted composite objective of maximizing the aid received from a CNA while also minimizing the time of travel. We describe the details of the PINN architecture and demonstrate its effectiveness via numerical simulations.

16:00-16:15, Paper WeC19.3	Add to My Program
Adaptive Input Shaper Design for Unknown Second-Order Systems with Real-Time Parameter Estimation

Aung, Nyi Nyi	Louisiana State University
Wight, Bradley	Louisiana State University
Stein, Adrian	Louisiana State University
Keywords: Optimal control, Estimation, Adaptive control Abstract: This work addresses the problem of designing an input shaper for an unknown system, where the switch time is highly sensitive to the system’s natural frequency. Applying the input-shaped reference either before or after the switch time typically results in poor performance. To overcome this challenge, a feedforward control method with an online parameter estimator is proposed for black-box second-order systems, in which the system parameters are estimated in real time and the input shaper is adapted accordingly. This approach allows the input-shaped reference to be applied even if the first switch time is missed, as the method accounts for the system’s periodic switch time and enables periodic application of the input shaper. Simulation results demonstrate accurate online parameter estimation and reference tracking with complete vibration suppression, achieved through the proposed adaptive input shaper design.

16:15-16:30, Paper WeC19.4	Add to My Program
Regret of H_infty Preview Controllers

Liu, Jietian	University of Michigan
Seiler, Peter	University of Michigan, Ann Arbor
Keywords: Optimal control, H-infinity control, Linear systems Abstract: This paper studies preview control in both the H_infty and regret-optimal settings. The plant is modeled as a discrete-time, linear time-invariant system subject to external disturbances. The performance baseline is the optimal non-causal controller that has full knowledge of the disturbance sequence. We first review the construction of the H_infty preview controller with p-steps of disturbance preview. We then show that the closed-loop H_infty performance of this preview controller converges as pto infty to the performance of the optimal non-causal controller. Furthermore, we prove that the optimal regret of the preview controller converges to zero. These results demonstrate that increasing preview length allows controllers to asymptotically achieve non-causal performance in both the H_infty and regret frameworks. A numerical example illustrates the theoretical results.

16:30-16:45, Paper WeC19.5	Add to My Program
Combined Learning and Control: A New Paradigm for Optimal Control with Unknown Dynamics

Kounatidis, Panagiotis	Cornell University
Malikopoulos, Andreas A.	Cornell University
Keywords: Optimal control, Learning Abstract: In this paper, we present the combined learning-and-control (CLC) approach, which is a new way to solve optimal control problems with unknown dynamics by unifying model-based control and data-driven learning. The key idea is simple: we design a controller to be optimal for a proxy objective built on an available model while penalizing mismatches with the real system, so that the resulting controller is also optimal for the actual system. Building on the original CLC formulation, we demonstrate the framework to the linear–quadratic regulator problem and make three advances: (i) we show that the CLC penalty is a sequence of stage-specific weights rather than a single constant; (ii) we identify when these weights can be set in advance and when they must depend on the (unknown) dynamics; and (iii) we develop a lightweight learning loop that tunes the weights directly from data without abandoning the benefits of a model-based design. We provide a complete algorithm and an empirical study against common baseline methods. The results clarify where prior knowledge suffices and where learning is essential, and they position CLC as a practical, theoretically grounded bridge between classical optimal control and modern learning methods.

16:45-17:00, Paper WeC19.6	Add to My Program
Zero-Shot Transferable Solution Method for Parametric Optimal Control Problems

Li, Xingjian	University of Texas at Austin
Kan, Kai Fung (Kelvin)	UCLA
Verma, Deepanshu	Clemson University
Kumar, Krishna	University of Texas at Austin
Osher, Stanley	University of California, Los Angeles
Drgona, Jan	Johns Hopkins University
Keywords: Optimal control, Machine learning, Neural networks Abstract: This paper presents a transferable solution method for optimal control problems with varying objectives using function encoder (FE) policies. Traditional optimization-based approaches must be re-solved whenever objectives change, resulting in prohibitive computational costs for applications requiring frequent evaluation and adaptation. The proposed method learns a reusable set of neural basis functions that spans the control policy space, enabling efficient zero-shot adaptation to new tasks through either projection from data or direct mapping from problem specifications. The key idea is an offline–online decomposition: basis functions are learned once during offline imitation learning, while online adaptation requires only lightweight coefficient estimation. Numerical experiments across diverse dynamics, dimensions, and cost structures show our method delivers near-optimal performance with minimal overhead when generalizing across tasks, enabling semi-global feedback policies suitable for real-time deployment.


WeC20 Regular Session, Churchill B2	Add to My Program
Model Predictive Control I

Chair: Trimboli, Michael	University of Colorado, Colorado Springs
Co-Chair: Schitz, Philipp	German Aerospace Center (DLR)

15:30-15:45, Paper WeC20.1	Add to My Program
A Comparison of MPC Architectures for a Heat Pump

Quah, Titus	University of California Santa Barbara
Bortoff, Scott A.	Mitsubishi Electric Research Laboratories
Rawlings, James B.	University of California, Santa Barbara
Keywords: Predictive control for linear systems, Building and facility automation, Process Control Abstract: Modern heat pumps are strongly interactive multivariable systems, requiring rigorous application of multivariable control theory. Important factors to consider are robustness with respect to plant uncertainty, and enforcement of process constraints. This paper summarizes the design of two Model Predictive Control (MPC) architectures for a heat pump application, and compares their design methodologies and performance to a conventional selector-based multi-loop PID architecture. One MPC architecture is an H-infinity loop-shaped MPC that uses inverse optimal control to realize the robustifing compensator as a constrained optimization to enforce constraints. The second is an offset-free MPC architecture that preserves output tracking despite plant-model mismatch and unmeasured disturbances and can be tuned using operational data. We compare the two MPCs and the PID on a constrained heat-pump model, assessing closed-loop transients, robustness margins, and tuning complexity. In simulation, all controllers track set-points with similar performance. During sensible heat disturbance, a temperature limit prevents full rejection; even so, both MPC designs counterintuitively reduce the outdoor fan speed -- which, through coupled system interactions, increases net heat removal and yields improved disturbance rejection without violating constraints. As for robustness, all three designs meet disk-margin targets. For tuning complexity, offset-free MPC > H-infinity loop-shaped MPC > PID. Overall, on this plant, robustness is a tie; offset-free MPC delivers the strongest constrained disturbance handling but requires the most tuning, H-infinity loop-shaped MPC is the middle ground, and PID is simplest.

15:45-16:00, Paper WeC20.2	Add to My Program
Adaptive-R Dual-Mode Model Predictive Control

Undare, Suchita Anil	University of Colorado Colorado Springs
Karami, Kiana	Penn State Harrisburg
Trimboli, Michael	University of Colorado, Colorado Springs
Keywords: Predictive control for linear systems, Constrained control, Optimal control Abstract: We introduce Adaptive-R Dual-Mode Model Predictive Control (Adaptive-R DMPC), a systematic framework in which the input weight R is scheduled based on set membership over nested regions χCPI (constrained), χADAP (adaptive), and χMPI (invariant). Conventional DMPC reduces computation by switching from constrained MPC to LQR, but this switch at the terminal set may induce abrupt changes in the control input at the MPC–LQR transition, which can result in non- smooth state evolution and thereby motivate the proposed adaptive formulation. In our approach, the input weight R is scheduled inside an intermediate region χADAP, blending between a conservative and nominal value so that the feedback gain converges smoothly to KLQR without solving QPs in this region. This construction preserves recursive feasibility and stability while markedly reducing the number of QP solves and total computation time.

16:00-16:15, Paper WeC20.3	Add to My Program
Inexact-ADMM for Embedded Linear Model Predictive Control

Adegbege, Ambrose Adebayo	The College of New Jersey
Oluleti, Victor Pelumi	Obafemi Awolowo University
Harish, Nia	The College of New Jersey
Keywords: Predictive control for linear systems, Constrained control, Optimization algorithms Abstract: We consider an inexact variant of the Alternating Direction Method of Multipliers (ADMM) for quadratic programming problems commonly encountered in embedded model predictive control (MPC). The inexact-ADMM method retains the convergence properties and behavior of standard-ADMM but with reduced complexity of the first inner subproblem, involving only simple over-relaxed Uzawa steps. We establish a convergence rate of O(1/k) for the primal-dual gap, where k is the iteration count, and time complexity per iteration of O(N), where N is the prediction horizon. We showcase the efficiency and performance of the inexact-ADMM compared to the standard ADMM using a well-known numerical example.

16:15-16:30, Paper WeC20.4	Add to My Program
Robust Distributed Model Predictive Control under Limited Communication Rates and DoS Attacks

Cao, Xinyuan	Shanghai Jiao Tong University
Dai, Yufan	University of Victoria
Li, Shaoyuan	Shanghai Jiao Tong University
Shi, Yang	University of Victoria
Keywords: Predictive control for linear systems, Distributed control, Quantized systems Abstract: This paper proposes a distributed model predictive control (DMPC) strategy for multi-agent systems (MASs) under the combined challenges of disturbances, limited network communication rates, and denial of service (DoS) attacks. In contrast to existing approaches, this paper integrates robust resilient control strategies and quantized communication mechanisms within the DMPC framework to jointly address the multiple challenges. Specifically, tightened robustness constraints are first introduced to handle the external disturbances. Then, a lengthened and quantized transmission method is developed, where the lengthened strategy preserves system coordination against DoS attacks and the quantized encoder/decoder scheme significantly reduces communication cost and suppresses quantization error. Furthermore, a rigorous theoretical analysis is provided to establish the recursive feasibility and ensure stability of the closed-loop system. Finally, simulation results validate the performance of the proposed framework.

16:30-16:45, Paper WeC20.5	Add to My Program
Robust Helicopter Ship Deck Landing with Guaranteed Timing Using Shrinking-Horizon Model Predictive Control

Schitz, Philipp	German Aerospace Center (DLR)
Mercorelli, Paolo	Leuphana University of Lüneburg
Dauer, Johann C.	DLR (German Aerospace Center)
Keywords: Predictive control for linear systems, Flight control, Autonomous systems Abstract: Autonomous helicopter ship deck landing requires precise timing, robustness to disturbances, and strict constraint satisfaction. This paper presents a trajectory planning approach based on robust shrinking-horizon Model Predictive Control (SHMPC) with move blocking, enabling computationally efficient planning over long horizons while guaranteeing maneuver completion within a specified time. A simplified control-oriented helicopter model is introduced for trajectory generation, while attitude stabilization is handled by an inner-loop controller. Robustness is achieved using a tube-based formulation with a disturbance-observer-based ancillary controller. A terminal set designed via backward reachability guarantees that the helicopter reaches a state from which a safe touchdown can occur within a prescribed time window. Simulations using a nonlinear model of DLR’s midiARTIS demonstrator under strong wind and ship motion demonstrate accurate tracking, constraint satisfaction, and real-time feasibility.


WeC21 Regular Session, Churchill C1	Add to My Program
Optimization III

Chair: Lin, Zongli	University of Virginia
Co-Chair: Canova, Marcello	The Ohio State University

15:30-15:45, Paper WeC21.1	Add to My Program
Non-Ergodic Convergence Algorithms for Distributed Consensus and Coupling-Constrained Optimization

Qiu, Chenyang	University of Virginia
Lin, Zongli	University of Virginia
Keywords: Optimization algorithms, Large-scale systems Abstract: We study distributed convex optimization with two ubiquitous forms of coupling: consensus constraints and global affine equalities. We first design a linearized method of multipliers for the consensus optimization problem. Without smoothness or strong convexity, we establish non-ergodic sublinear rates of order O(1/sqrt{k}) for both the objective optimality and the consensus violation. Leveraging duality, we then show that the economic dispatch problem admits a dual consensus formulation, and that applying the same algorithm to the dual economic dispatch yields non-ergodic O(1/sqrt{k}) decay for the error of the summation of the cost over the network and the equality-constraint residual under convexity and Slater’s condition. Numerical results on the IEEE 118-bus system demonstrate faster reduction of both objective error and feasibility error relative to the state-of-the-art baselines, while the dual variables reach network-wide consensus.

15:45-16:00, Paper WeC21.2	Add to My Program
DC Fast Charging Station Queue Minimization through Dynamic Pricing

Abbasi, Mohammad Hossein	Clemson University
Zhang, Jiangfeng	Clemson University
Krovi, Venkat	Clemson University
Keywords: Optimization algorithms, Lyapunov methods, Optimization Abstract: This paper presents a novel approach to minimizing queues in DC fast charging stations (FCSs). The number of queuing electric vehicles (EVs) is explicitly modeled as a decision variable, defined as a function of departing EVs after completing their charging sessions. To enhance profitability, the objective function also accounts for FCS operating costs, while charging prices are dynamically adjusted based on user price sensitivity. The optimization problem is formulated within the Lyapunov Optimization (LO) framework, which decomposes it into one time step subproblems and eliminates the need for forecasting EV arrivals, while guaranteeing global optimality. Rigorous analysis establishes the theoretical guarantees of the proposed method. Furthermore, a Pareto Front study is conducted to support parameter tuning. Simulation results show that an FCS with four charging ports can serve 100 EVs per day with an average of only eight vehicles in the queue, illustrating the strength of the proposed strategy.

16:00-16:15, Paper WeC21.3	Add to My Program
Optimal Multimarginal Schrödinger Bridge: Minimum Spanning Tree Over Measure-Valued Vertices

Bondar, Georgiy Antonovich	UC Santa Cruz
Halder, Abhishek	Iowa State University
Keywords: Optimization algorithms, Machine learning, Computational methods Abstract: The Multimarginal Schrödinger Bridge (MSB) finds the optimal coupling among a collection of random vectors with known statistics and a known correlation structure. In the MSB formulation, this correlation structure is specified a priori as an undirected connected graph with measure-valued vertices. In this work, we formulate and solve the problem of finding the optimal MSB in the sense we seek the optimal coupling over all possible graph structures. We find that computing the optimal MSB amounts to solving the minimum spanning tree problem over measure-valued vertices. We show that the resulting problem can be solved in two steps. The first step constructs a complete graph with edge weight equal to a sum of the optimal value of the corresponding bimarginal SB and the entropies of the endpoints. The second step solves a standard minimum spanning tree problem over that complete weighted graph. Numerical experiments illustrate the proposed solution.

16:15-16:30, Paper WeC21.4	Add to My Program
Temporal Variabilities Limit Convergence Rates in Gradient-Based Online Optimization

Van Scoy, Bryan	Miami University
Bianchin, Gianluca	University of Louvain
Keywords: Optimization algorithms, Optimization, Linear systems Abstract: This paper investigates the fundamental performance limits of gradient-based algorithms for time-varying optimization. Leveraging the internal model principle and root locus techniques, we show that temporal variabilities impose intrinsic limits on the achievable rate of convergence. For a problem with condition ratio kappa and temporal variability described by a model of degree n, we show that the worst-case convergence rate of any minimal-order gradient-based algorithm (having access to a model of temporal variation) is rho_text{TV} = (frac{kappa-1}{kappa+1})^{1/n}. This bound reveals a fundamental tradeoff between problem conditioning, temporal complexity, and rate of convergence. We further construct explicit controllers that attain the bound for low-degree models of time variation.

16:30-16:45, Paper WeC21.5	Add to My Program
Quadratic Surrogate Attractor for Particle Swarm Optimization

Clemente, Maurizio	The Ohio State University
Canova, Marcello	The Ohio State University
Keywords: Optimization algorithms, Optimization, Numerical algorithms Abstract: This paper presents a particle swarm optimization algorithm that leverages surrogate modeling to replace the conventional global best solution with the minimum of an n-dimensional quadratic form, providing a better-conditioned dynamic attractor for the swarm. This refined convergence target, informed by the local landscape, enhances resilience against premature convergence and noisy environments while incurring only minimal computational overhead. The surrogate-augmented approach is evaluated against the standard algorithm through a numerical study on a set of benchmark optimization functions that exhibit diverse landscapes. To ensure statistical significance, 400 independent runs are conducted for each function and algorithm, and the results are analyzed based on their statistical characteristics and corresponding distributions. The quadratic surrogate attractor consistently outperforms the conventional algorithm across all tested functions. The improvement is particularly pronounced for quasi-convex functions, where the surrogate model can exploit the underlying convex-like structure of the landscape.

16:45-17:00, Paper WeC21.6	Add to My Program
Speed Limitations of Gradient-Based Algorithms for Heterogeneous Optimization

Wu, Wuwei	City University of Hong Kong
Wang, Miaomiao	City University of Hong Kong
Colaneri, Patrizio	Politecnico Di Milano
Chen, Jie	City University of Hong Kong
Keywords: Optimization algorithms, Optimization, Robust control Abstract: Modern large-scale optimization problems often involve heterogeneous components with diverse properties essential to the performance of optimization algorithms, which pose challenges for designing fast and efficient algorithms. This paper tackles the heterogeneity issue through a control-theoretic lens by modeling gradient-based optimization algorithms as feedback systems and utilizing frequency-domain tools, including Nevanlinna-Pick interpolation theory. This perspective enables a unified treatment of parallel and serial gradient-based algorithms, and allows us to establish the fastest achievable convergence rates for both architectures under the circle and Zames-Falb criteria. The results reveal a fundamental trade-off: when parallel computation is available, parallel algorithms achieve superior real-time performance, whereas in resource-constrained settings, serial algorithms offer faster iteration rates. The proposed approach provides a systematic foundation for analyzing and designing high-performance optimization algorithms in heterogeneous computing environments.


WeC22 Invited Session, Churchill C2	Add to My Program
Estimation and Control of Distributed Parameter Systems III

Chair: Hu, Weiwei	University of Georgia
Co-Chair: Demetriou, Michael A.	Worcester Polytechnic Institute
Organizer: Demetriou, Michael A.	Worcester Polytechnic Institute
Organizer: Hu, Weiwei	University of Georgia

15:30-15:45, Paper WeC22.1	Add to My Program
Backstepping Stabilization of infty+infty Continua of Linear Hyperbolic Systems (I)

Humaloja, Jukka-Pekka	Technical University of Crete
Bekiaris-Liberis, Nikolaos	Technical University of Crete
Keywords: Distributed parameter systems Abstract: We consider the boundary stabilization problem for continua of PDE systems that are viewed as the limits of n+m linear hyperbolic systems, as n and m tend to infinity. We construct a full-state feedback law via introduction of a continuum-PDE backstepping transformation. The design procedure gives rise to two kernel equations evolving on a 4-D domain, whose well-posedness is established. We show stability of the continuum closed-loop system via construction of a novel Lyapunov functional. Effectiveness of the control design is illustrated in a numerical example.

15:45-16:00, Paper WeC22.2	Add to My Program
Optimal Sensor Location for Predictive Iterative Learning Control of Distributed Parameter Systems (I)

Patan, Maciej	University of Zielona Gora
Klimkowicz, Kamil	University of Zielona Gora
Patan, Krzysztof	University of Zielona Gora
Rogers, Eric	University of Southampton
Keywords: Iterative learning control, Distributed parameter systems, Estimation Abstract: This study aims to develop a robust computational scheme to address the optimal tracking control problem for repetitive distributed parameter systems, in situations where the controlled quantity is not directly observable. In such a case, the reliability of the model predictions becomes a crucial factor affecting the quality of control design in consecutive process replications. To maximize the accuracy of prediction, the optimal sensor location problem is considered. It consists in selecting gaged sites from among all available sites so that the suitable criterion defined on the Fisher information matrix associated with the model parameters is minimal. To effectively solve it, a relaxed convex optimization problem is formulated, and then suitable optimality conditions are provided. In result, the existing efficient convex optimization solvers can be easily applied. Then, the measurement locations providing the most informative system observations are further incorporated into the predictive control scheme based on the iterative learning control. The proposed approach is verified on the example of repetitive control for a thermophotovoltaic cell.

16:00-16:15, Paper WeC22.3	Add to My Program
Concurrent Design of Event-Triggered and Gain Scheduled Control for 2x2 Coupled Hyperbolic PDEs (I)

Somathilake, Eranda	Department of Mechanical and Aerospace Engineering, University of California San Diego
Diagne, Mamadou	University of California San Diego
Keywords: Distributed parameter systems, Switched systems, Stability of linear systems

16:15-16:30, Paper WeC22.4	Add to My Program
Existence and Uniqueness of the Solution to a Class of Fredholm Integral Equations Related to Difference Equations (I)

Bresch-Pietri, Delphine	Mines Paris -- PSL
Auriol, Jean	Université Paris-Saclay, CNRS, CentraleSupélec, Laboratoire Des Signaux Et Systèmes
Keywords: Delay systems, Distributed parameter systems Abstract: This paper addresses the existence and uniqueness of the solution to a class of Fredholm integral equations associated with scalar Linear Integral Delay Equations (LIDEs). Based on an operator-theoretic framework involving transport partial differential equations, we provide general conditions that guarantee well-posedness of these equations. Leveraging this result, we introduce a constructive approach for the numerical computation of the Lyapunov matrix corresponding to scalar LIDEs with commensurate delays. Specifically, the Lyapunov matrix equations are reformulated as the Fredholm integral equation under consideration, for which the proposed conditions are satisfied under exponential stability of the LIDE. The resulting integral equation can then be discretized, and the linear system efficiently solved to compute the Delay Lyapunov matrix. Numerical examples illustrate the effectiveness and applicability of this methodology.

16:30-16:45, Paper WeC22.5	Add to My Program
SEC-Two: Secure Estimation Over Two Channels (I)

Porzio, Jacopo	KTH Royal Institute of Technology
Kanellopoulos, Aris	KTH Royal Institute of Technology
Alisic, Rijad	Massachusetts Institute of Technology
Dán, György	KTH - Royal Institute of Technology
Sandberg, Henrik	KTH Royal Institute of Technology
Keywords: Filtering, Emerging control applications, Fault detection Abstract: We characterize the impact of single-time-step sensor attacks on Kalman filtering and fixed-point optimal smoothing for state estimation in linear time-invariant systems over a finite time window. We introduce a single noiseless, unattacked, full-state (perfect) measurement into the estimation process to reduce the attack's impact on the estimation process and increase the probability of detection under static chi-squared tests. In the scalar case, we show that smoothing previous estimates with the perfect measurement to update detection statistics achieves higher attack detectability at the attack time than online Kalman filter--based statistics. Numerical experiments suggest that our findings generalize to multi-dimensional systems.

16:45-17:00, Paper WeC22.6	Add to My Program
Safe Trajectory Tracking of the Stefan Problem with Second-Order Moving Boundary Dynamics

Koga, Shumon	Kobe University
Krstic, Miroslav	University of California, San Diego
Keywords: Distributed parameter systems, Constrained control, Process Control Abstract: This paper considers a safe trajectory tracking of the Stefan problem with a second-order moving boundary dynamics. The model is given by a parabolic Partial Differential Equation (PDE) defined on a time-varying domain of moving boundary governed by a second-order Ordinary Differential Equation (ODE) associated with the Neumann boundary condition. A feedforward control is designed by a series expansion approach to solve the inverse Stefan problem under given reference trajectory of the moving boundary, and the convergence of infinite series is proven. A trajectory tracking controller is derived based on an energy-shaping, which ensures the safety of the model constraint in the closed-loop system. The closed-loop system is also shown to be globally exponentially stable with respect to the tracking error by performing PDE backstepping transformation and Lyapunov analysis. Numerical simulation illustrates an effective tracking performance of the proposed method under a sinusoidal reference trajectory.

Technical Program for Wednesday May 27, 2026