ACC 2026 Program | Friday May 29, 2026


FrSP1 Plenary Session, Grand Ballroom C	Add to My Program
From Covariance Control to Covariance Steering: The First 40 Years

Chair: Hale, Matthew	Georgia Institute of Technology
Co-Chair: Panagou, Dimitra	University of Michigan, Ann Arbor

08:30-09:30, Paper FrSP1.1	Add to My Program
From Covariance Control to Covariance Steering: The First 40 Years

Tsiotras, Panagiotis	Georgia Institute of Technology


FrE1 Eckman Plenary Session, Grand Ballroom D	Add to My Program
Convex and Nonconvex Optimization for Control

Chair: Marden, Jason R.	University of California, Santa Barbara
Co-Chair: Touri, Behrouz	University of Illinois at Urbana Champaign

08:30-09:30, Paper FrE1.1	Add to My Program
Convex and Nonconvex Optimization for Control

Zheng, Yang	University of California San Diego


FrAP01 RI Session, Grand Ballroom B	Add to My Program
RI Poster Session - Day 3

Chair: Frew, Eric W.	University of Colorado, Bolder
Co-Chair: Casbeer, David W.	Air Force Research Laboratory


FrB03 Tutorial Session, Grand Salon 3	Add to My Program
Foundations in AI and Quantum Systems: From Control and Game-Theory to Stochastic Deception

Chair: Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Co-Chair: Tang, Michael	University of California, San Diego
Organizer: Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Organizer: Poveda, Jorge I.	University of California, San Diego

13:30-15:00, Paper FrB03.1	Add to My Program
A Tutorial for Foundations in AI and Quantum Systems: From Control and Game-Theory to Stochastic Deception (I)

Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Poveda, Jorge I.	University of California, San Diego
Anderson, Sean	University of California Santa Barbara
Hespanha, Joao P.	Univ. of California, Santa Barbara
Krstic, Miroslav	University of California, San Diego
Petersen, Ian R.	Australian National University
Tang, Michael	University of California, San Diego
Wadi, Ali	Georgia Institute of Technology
Zhu, Quanyan	New York University
Keywords: Quantum information and control, Game theory Abstract: In this tutorial, we explore the intersection of quantum control, game theory, and artificial intelligence deception, emphasizing their combined potential to enhance secure and strategic decision-making in complex systems. It examines how deceptive behavior can be understood and influenced within strategic frameworks, and outlines principles for designing resilient systems that leverage advantages in both quantum and learning-driven environments. While the discussion centers on specific examples, the foundational ideas are broadly applicable across a range of domains.


FrB04 Invited Session, Grand Salon 4	Add to My Program
Safety Verification and Optimization Protocols for Spacecraft

Chair: Lippay, Zachary	Verus Research
Co-Chair: Soderlund, Alexander	The Ohio State University
Organizer: Petersen, Chris	University of Florida
Organizer: Phillips, Sean	Air Force Research Laboratory
Organizer: Soderlund, Alexander	The Ohio State University

13:30-13:45, Paper FrB04.1	Add to My Program
Safe Landing on Small Celestial Bodies with Gravitational Uncertainty Using Disturbance Estimation and Control Barrier Functions (I)

Arenas Uribe, Felipe	University of Kentucky
Seigler, Thomas Michael	University of Kentucky
Hoagg, Jesse B.	University of Kentucky
Keywords: Spacecraft control, Optimal control, Constrained control Abstract: Soft landing on small celestial bodies (SCBs) poses unique challenges, as uncertainties in gravitational models and poorly characterized, dynamic environments require a high level of autonomy. Existing control approaches lack formal guarantees for safety constraint satisfaction, necessary to ensure the safe execution of the maneuvers. This paper introduces a control that addresses this limitation by integrating trajectory tracking, disturbance estimation, and safety enforcement. An extended high-gain observer is employed to estimate disturbances resulting from gravitational model uncertainties. We then apply a feedback-linearizing and disturbance-canceling controller that achieves exponential tracking of reference trajectories. Finally, we use a control barrier function based minimum-intervention controller to enforce state and input constraints through out the maneuver execution. This control combines trajectory tracking of offline generated reference trajectories with formal guarantees of safety, which follows common guidance and control architectures for spacecraft and allows aggressive maneuvers to be executed without compromising safety. Numerical simulations using fuel-optimal trajectories demonstrate the effectiveness of the controller in achieving precise and safe soft-landing, highlighting its potential for autonomous SCB missions.

13:45-14:00, Paper FrB04.2	Add to My Program
Shrinking Horizon MPC for Safe Multi-Spacecraft Capture of a Non-Cooperative Target (I)

Kim, Taehyeun	University of Michigan
Girard, Anouck	University of Michigan, Ann Arbor
Kolmanovsky, Ilya V.	The University of Michigan
Keywords: Aerospace, Constrained control, Optimal control Abstract: This paper describes the application of a Shrinking Horizon Model Predictive Controller (SHMPC) to a cooperative spacecraft capture. SHMPC is used to coordinate multiple capturing spacecraft to specified non-equilibrium states corresponding to the capture of a rotating non-cooperative target subject to state and control constraints. Robustness to model mismatch, thrust uncertainty, and inexact optimization is illustrated through numerical simulations demonstrating the benefits of SHMPC over the following open-loop trajectories using conventional feedback. Enhancements to the baseline SHMPC formulation are proposed to reduce the computational effort.

14:00-14:15, Paper FrB04.3	Add to My Program
Perception-Integrated Safety Critical Control Via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting (I)

Tscholl, Dario	Georgia Institute of Technology
Nakka, Yashwanth Kumar	Georgia Institute of Technology
Gunter, Brian	Georgia Institute of Technology
Keywords: Predictive control for linear systems, Emerging control applications, Autonomous vehicles Abstract: This letter presents a perception-driven safety filter that converts each 3D Gaussian Splat (3DGS) into a closed-form forward collision cone. The resulting constraint admits a relative-degree-one control barrier function (CBF) that is enforced online via a quadratic program (QP). By leveraging the analytic geometry of Gaussian splats, the proposed formulation yields a continuous, closed-form representation of collision avoidance constraints that is both simple and computationally efficient. In contrast to distance-based CBFs, which typically become active only when the robot is already near an obstacle, the collision cone CBF activates proactively, enabling earlier corrective action and thus smoother avoidance behavior at lower computational cost. The method is validated on a large synthetic scene consisting of approximately 170k Gaussian splats. Compared to a state-of-the-art 3DGS planner, the proposed filter reduces computation time by a factor of 3 and notably decreases trajectory jerk while maintaining safety. The approach is entirely analytic, requires no high-order CBF (HOCBF), and extends naturally to robots with physical extent via principled Minkowski-sum inflation of the splats. These properties make the method well suited for real-time navigation in cluttered, perception-derived environments on Earth and in space.

14:15-14:30, Paper FrB04.4	Add to My Program
Analysis and Flight Validation of Momentum-Biased Attitude Stability During Open-Loop Deployment on NISAR

Rallapalli, Aditya	UR Rao Satellite Centre
Kakula, Ashok Kumar	U R Rao Satellite Centre(ursc) , Isro
Bates, David	Jet Propulsion Laboratory
Keywords: Spacecraft control, Stability of linear systems, Control applications Abstract: The deployment of large unfurlable reflectors presents significant challenges for spacecraft attitude stability, particularly when performed in open-loop (no control) mode due to uncertainties in deployment dynamics and limitations on active control authority. For the NASA–ISRO Synthetic Aperture Radar (NISAR) mission, a 12 m reflector was deployed in orbit using an open-loop sequence. To minimize drift from the Sun and ensure uninterrupted power generation, a momentumbiased configuration was implemented using reaction wheels. Unlike conventional spin-stabilization, which is unsuitable for reflector missions due to telemetry and telecommand pointing constraints, the reaction-wheel momentum bias provided controlled passive stability while maintaining spacecraft orientation. The bias value was derived through analytical modeling of spacecraft dynamics with reaction wheels under a smallangle approximation about the Earth-pointing reference frame, explicitly accounting for gravity-gradient torques and inertia properties. This analysis established stability margins and guided the selection of the commanded momentum bias. The approach also maximized the allowable open-loop duration, providing sufficient time for deployment even under contingency scenarios. Pre-flight predictions showed stable behavior with minimized solar and nadir drift, which was subsequently confirmed through on-orbit telemetry. Flight data demonstrated close agreement with analytical results, providing a flightproven validation of the momentum-biased attitude stability approach for large reflector deployment.

14:30-14:45, Paper FrB04.5	Add to My Program
Gradient-Based Constrained Spacecraft Trajectory Optimization with Convergence Guarantees

Zheng, Yifei	University of California San Diego
Dower, Peter M.	University of Melbourne
McEneaney, William M.	Univ. California San Diego
Keywords: Optimization algorithms, Computational methods, Aerospace Abstract: A trajectory optimization method is devised for a class of problems with state constraints. The method is conceptually a projected gradient descent method, where the convergence of the control process is proved without reference to any discretization, and is obtained directly from the convergence of the cost function under mild regularity conditions. A corresponding algorithm for spacecraft trajectory optimization is proposed, along with two numerical examples where terminal states are prescribed. The algorithm is able to generate trajectories that require less fuel than stationary-action trajectories.

14:45-15:00, Paper FrB04.6	Add to My Program
Convex Sum-Of-Squares Controller Design for Relative Orbital Motion about Circular Chief Orbits

Hoyos, Jose Daniel	Purdue University
Sikka, Sidhdharth	Purdue
Rai, Ayush	Purdue University
Mou, Shaoshuai	Purdue University
Keywords: Spacecraft control, Formal verification/synthesis, Optimization Abstract: A key challenge in relative orbital motion is the design of nonlinear controllers that provide formal stability guarantees for the nonlinear dynamics while systematically incorporating operational constraints such as actuator saturation and sensor field-of-view requirements. In this paper, we present a computational framework for designing certified nonlinear controllers for orbital control under circular chief orbit. Our approach provides a unified solution leveraging three key steps: polynomial lifting to express the non-polynomial orbital dynamics in a polynomial form, the use of a dual Lyapunov theory for stability certification, and sum-of-squares optimization for tractable controller synthesis. Numerical simulations confirm effective constraint handling, including input saturation and geometric keep-in regions, and enhanced robustness to significant state measurement noise in comparison to LQR.


FrB05 Tutorial Session, Grand Salon 6	Add to My Program
Control Via Incentives and Information in Socio-Technical Systems

Chair: Basar, Tamer	Univ of Illinois, Urbana-Champaign
Co-Chair: Savla, Ketan	University of Southern California
Organizer: Chiu, Chih-Yuan	Georgia Institute of Technology
Organizer: Ferguson, Bryce L.	Dartmouth College
Organizer: Brown, Philip N.	University of Colorado Colorado Springs
Organizer: Savla, Ketan	University of Southern California
Organizer: Wu, Manxi	University of California Berkeley
Organizer: Bose, Subhonmesh	University of Illinois at Urbana Champaign
Organizer: Velicheti, Raj Kiriti	University of Illinois at Urbana Champaign
Organizer: Basar, Tamer	Univ of Illinois, Urbana-Champaign

13:30-15:00, Paper FrB05.1	Add to My Program
Control Via Incentives and Information in Socio-Technical Systems: A Tutorial (I)

Basar, Tamer	Univ of Illinois, Urbana-Champaign
Bose, Subhonmesh	University of Illinois at Urbana Champaign
Brown, Philip N.	University of Colorado Colorado Springs
Chiu, Chih-Yuan	Georgia Institute of Technology
Ferguson, Bryce L.	Dartmouth College
Savla, Ketan	University of Southern California
Velicheti, Raj Kiriti	University of Illinois at Urbana Champaign
Wu, Manxi	University of California Berkeley
Maheshwari, Chinmay	Johns Hopkins University
Keywords: Game theory, Agents-based systems, Emerging control applications Abstract: Many shared public amenities such as transporta- tion networks and power grids suffer from congestion and depletion effects resulting from uncoordinated overconsumption by self-interested users. In response, researchers have long studied methods of encouraging pro-social consumption of shared societal resources. In this tutorial, we survey recent work in the area of behavior-influencing mechanisms in modern infrastructure-scale socio-technical systems. Two basic frame- works are considered. First, an informed social planner may use their informational advantage to influence behavior: our paper demonstrates a variety of situations in which interesting and counterintuitive principles emerge in this context. For instance, it is often the case that agents are better off if the planner reveals noisy information rather than acting with full transparency. We present a variety of distinct approaches and problem formulations for this information design setting, ranging from convergence results to analysis under nonstandard informational assumptions. Second, an infrastructure operator may leverage their monopoly power to use pricing policies to influence societal behavior. Here, we ask a robustness question and present recent work on methods of characterizing the sensitivity of social outcomes to variations in pricing policy.


FrB06 Invited Session, Grand Salon 7	Add to My Program
Networked Systems: Analysis and Control

Chair: Gracy, Sebin	South Dakota School of Mines and Technology
Co-Chair: Pare, Philip E.	Purdue University
Organizer: Gracy, Sebin	South Dakota School of Mines and Technology
Organizer: Pare, Philip E.	Purdue University
Organizer: Ishii, Hideaki	University of Tokyo

13:30-13:45, Paper FrB06.1	Add to My Program
Generating Differentially Private Networks with a Modified Erdős–Rényi Model (I)

Rao, Huaiyuan	Georgia Institute of Technology
Hawkins, Calvin	Georgia Institute of Technology
Benvenuti, Alexander	Georgia Institute of Technology
Hale, Matthew	Georgia Institute of Technology
Keywords: Network analysis and control Abstract: Differential privacy has been used to privately calculate numerous network properties, but existing approaches often require the development of a new privacy mechanism for each property of interest. Therefore, we present a framework for generating entire networks in a differentially private way. Differential privacy is immune to post-processing, which allows for any network property to be computed and analyzed for a private output network, without weakening its protections. We consider undirected networks and develop a differential privacy mechanism that takes in a sensitive network and outputs a private network by randomizing its edge set. We prove that this mechanism does provide differential privacy to a network's edge set, though it induces a complex distribution over the space of output graphs. We then develop an equivalent privacy implementation using a modified Erdős–Rényi model that constructs an output graph edge by edge, and it is efficient and easily implementable, even on large complex networks. Experiments implement epsilon-differential privacy with epsilon of 2.5 when computing graph Laplacian spectra, and these results show the proposed mechanism incurs 49.34% less error than the current state of the art.

13:45-14:00, Paper FrB06.2	Add to My Program
The Tragedy of the Commons in Multi-Population Resource Games (I)

Vahmian, Yamin	University of Colorado Colorado Springs
Paarporn, Keith	University of Colorado, Colorado Springs
Keywords: Game theory, Network analysis and control, Optimization Abstract: Self-optimizing behaviors can lead to outcomes where collective benefits are ultimately destroyed, a well-known phenomenon known as the ``tragedy of the commons". These scenarios are widely studied using game-theoretic approaches to analyze strategic agent decision-making. In this paper, we examine this phenomenon in a bi-level decision-making hierarchy, where low-level agents belong to multiple distinct populations, and high-level agents make decisions that impact the choices of the local populations they represent. We study strategic interactions in a context where the populations benefit from a common environmental resource that degrades with higher extractive efforts made by high-level agents. We characterize a unique symmetric Nash equilibrium in the high-level game, and investigate its consequences on the common resource. While the equilibrium resource level degrades as the number of populations grows large, there are instances where it does not become depleted. We identify such regions, as well as the regions where the resource does deplete.

14:00-14:15, Paper FrB06.3	Add to My Program
A Passivity-Agnostic Framework for Distributed Adaptive Synchronization under Unknown Leader Dynamics (I)

Wafi, Moh. Kamalul	Northeastern University
Montazeri Hedesh, Hamidreza	Northeastern University
Siami, Milad	Northeastern University
Keywords: Adaptive control, Networked control systems, Distributed control Abstract: We present a passivity-agnostic framework for distributed adaptive synchronization under position-only communication, bounded disturbances, and unknown leader dynamics. By passivity-agnostic we mean the design does not require the closed loop to be SPR a priori: it certifies SPR when present and recovers it by frequency shaping when absent. Followers are heterogeneous second-order systems with unknown (possibly unstable) dynamics. In the SPR regime, a structured reparameterization yields gradient-based adaptive error dynamics; Lyapunov analysis guarantees global asymptotic synchronization in the disturbance-free case, exact rejection of constant disturbances, and bounded responses to time-varying disturbances, with parameter convergence under persistent excitation. In the non-SPR regime, frequency shaping recovers effective passivity of the unshaped transfer function, enabling the same stability guarantees via standard passivity/Lyapunov arguments using Meyer-Kalman-Yakubovich (MKY) Lemma. Simulations across star, cyclic, path, and arbitrary graphs demonstrate scalable synchronization, robust tracking, and parameter adaptation under multiple disturbance profiles, confirming that the frequency-shaped non-SPR designs match the performance of the SPR case.

14:15-14:30, Paper FrB06.4	Add to My Program
Regulation of Rumor Propagation Via (Multi-Leader) Stackelberg Graphon Games (I)

Liu, Huaning	University of Illinois at Urbana-Champaign
Dayanikli, Gokce	University of Illinois Urbana-Champaign
Keywords: Mean field games, Game theory, Control applications Abstract: We study the control of rumor propagation in large networked populations by using Stackelberg graphon games. We first introduce a principal who wants to incentivize the spread of her preferred news and discourage the spread of non-preferred news. We define the Stackelberg graphon game equilibrium (SGGE), characterize the graphon game Nash equilibrium (GGNE) with a forward-backward differential equation system, and establish existence results. We further formulate a multi-leader model with two competing principals, each incentivizing her own preferred news. Finally, we propose a bi-level algorithm for computing (multi-leader) Stackelberg graphon game equilibria and conclude with numerical experiments where we show that existence competing principals will result in strong opinion divisions in the population.

14:30-14:45, Paper FrB06.5	Add to My Program
Finding Super-Spreaders in SIS Epidemics (I)

Sridhar, Anirudh	New Jersey Institute of Technology
Ghosh, Arnob	New Jersey Institute of Technology
Keywords: Network analysis and control, Markov processes, Estimation Abstract: In network epidemic models, controlling the spread of a disease often requires targeted interventions such as vaccinating high-risk individuals based on network structure. However, typical approaches assume complete knowledge of the underlying contact network, which is often unavailable. While network structure can be learned from observed epidemic dynamics, existing methods require long observation windows that may delay critical interventions. In this work, we show that full network reconstruction may not be necessary: control-relevant features, such as high-degree vertices (super-spreaders), can be learned far more efficiently than the complete structure. Specifically, we develop an algorithm to identify such vertices from the dynamics of a Susceptible-Infected-Susceptible (SIS) process. We prove that in an n-vertex graph, vertices of degree at least n^alpha can be identified over an observation window of size Omega (1/alpha), for any alpha in (0,1). In contrast, existing methods for exact network reconstruction requires an observation window that grows linearly with n. Simulations demonstrate that our approach accurately identifies super-spreaders and enables effective epidemic control.

14:45-15:00, Paper FrB06.6	Add to My Program
Bounds of Validity for Bifurcations of Equilibria in a Class of Networked Dynamical Systems (I)

Gupta, Pranav	Indian Institute of Technology Bombay
Banavar, Ravi N.	Indian Institute of Technology Bombay
Bizyaeva, Anastasia	Cornell University
Keywords: Networked control systems, Neural networks, Stability of nonlinear systems Abstract: Local bifurcation analysis plays a central role in understanding qualitative transitions in networked nonlinear dynamical systems, including dynamic neural network and opinion dynamics models. In this article we establish explicit bounds of validity for the classification of bifurcation diagrams in two classes of continuous-time networked dynamical systems, analogous in structure to the Hopfield and the Firing Rate dynamic neural network models. Our approach leverages recent advances in computing the bounds for the validity of Lyapunov–Schmidt reduction, a reduction method widely employed in nonlinear systems analysis. Using these bounds we rigorously characterize neighbourhoods around bifurcation points where predictions from reduced-order bifurcation equations remain reliable. We further demonstrate how these bounds can be applied to an illustrative family of nonlinear opinion dynamics on k−regular graphs, which emerges as a special case of the general framework. These results provide new analytical tools for quantifying the robustness of bifurcation phenomena in dynamics over networked systems and highlight the interplay between network structure and nonlinear dynamical behaviour.


FrB07 Regular Session, Grand Salon 9	Add to My Program
Automotive Systems

Chair: Tang, Jian	Robert Bosch LLC
Co-Chair: Rajamani, Rajesh	Univ. of Minnesota

13:30-13:45, Paper FrB07.1	Add to My Program
Design of a Sensorless Trailer Anti-Lock Braking System Based on Trailer Swing Detection

Jeong, Semin	Korea Advanced Institute of Science and Technology
Oh, Jiyeol	Korea Advanced Institute of Science and Technology
Choi, Seibum Ben	Korea Advanced Institute of Science and Technology
Keywords: Automotive systems, Automotive control, Control applications Abstract: This study proposes a trailer anti-lock braking system (ABS) for commercial travel trailers, addressing the practical concern of unknown trailer parameters and the absence of trailer sensors. In current commercial trailer systems, a control unit in the towing vehicle allows the driver to manually adjust the trailer brake gain. However, improper adjustment can cause excessive braking at the trailer wheels, leading to wheel lock-up and a loss of lateral stability, commonly referred to as the “trailer swing” phenomenon. Previous studies are based on trailer wheel speed or hitch angle sensors, which are not typically available in commercial trailers, thus limiting their applicability. Aftermarket trailer ABS also exists, but its high cost and complex installation requirements have hindered widespread adoption. To overcome this limitation, this study introduces a novel trailer ABS that does not rely on direct wheel slip measurements. Instead, trailer swing is detected by the trailer swing index, an indirect indicator of wheel lock-up, enabling braking control without additional trailer-mounted sensors. The proposed method is evaluated through Carsim/Simulink. Through the proposed method, trailer swing is detected and controlled effectively without trailer information.

13:45-14:00, Paper FrB07.2	Add to My Program
On the Importance of Slip Angle Estimation for Accurate Target Vehicle Tracking

Kyong, Hongjoon	University of Minnesota
Alai, Hamidreza	University of Minnesota
Kazemi Tameh, Ehsan	University of Minnesota
Rajamani, Rajesh	Univ. of Minnesota
Keywords: Automotive systems, Mechatronics, Estimation Abstract: With an increasing demand for Level 3 autonomous driving systems, the need for accurate tracking of the trajectories of nearby vehicles using just low-cost radar has become critical. While well-designed estimation algorithms for target vehicle tracking are essential, the accuracy of the underlying dynamic model plays a pivotal role as well. Conventional vehicle kinematic models are formulated in the inertial (global) frame and do not explicitly account for the ego-vehicle’s motion. The new relative motion model in this paper considers the effects of ego-vehicle motion, leading to improved target tracking performance when the ego-vehicle executes non-zero yaw motions. This model requires the use of the ego-vehicle’s dynamic states, including velocity, yaw rate, and slip angle. Among these, slip angle estimation poses a particular challenge due to limitations in cost-effective sensing technologies. This paper highlights the importance of slip angle estimation for accurate vehicle tracking using a new relative motion model. The ego-vehicle’s slip angles are employed for target vehicle tracking. The proposed method is validated across a range of driving scenarios, including sharp cornering maneuvers and even U-turns. Experimental results demonstrate that the proposed method consistently provides reliable vehicle tracking results.

14:00-14:15, Paper FrB07.3	Add to My Program
Observer Design for Vector Nonlinear System with Application to Vehicle Tracking

Sharma, Gaurav	University of Minnesota
Zemouche, Ali	CRAN UMR CNRS 7039 & Université De Lorraine
Rajamani, Rajesh	Univ. of Minnesota
Keywords: Automotive systems, Multivehicle systems, Estimation Abstract: This paper introduces an observer design framework for a system with multivariable vector arguments in the nonlinear process dynamics. When the partial derivatives of the nonlinear functions are bounded and monotonic, exponential convergence of the observer is guaranteed by design using a set of linear matrix inequalities (LMIs). For non-monotonic cases, a switched-gain observer is proposed, complemented by a state-scaling technique that enhances numerical stability and extends the feasibility of the LMI conditions. The framework is validated in an autonomous driving application, where the observer estimates the trajectories of surrounding vehicles. Experiments on a full-scale autonomous Chrysler Pacifica vehicle at the University of Minnesota demonstrate robust performance even in the presence of non-monotonic vector nonlinear dynamics.

14:15-14:30, Paper FrB07.4	Add to My Program
A Regenerative Braking Framework for Intent-Based Platooning in Real-World Traffic Situations

Sharma, Sachin	Toyota Motor North America
Moradipari, Ahmadreza	Toyota Motor North America
Avedisov, Sergei S.	Toyota North America R&D InfoTech Labs
Mishra, Shatadal	Toyota Motor North America
Nour, Mariam	Toyota InfoTech Labs
Keywords: Cooperative control, Automotive systems, Multivehicle systems Abstract: Traditional platooning algorithms, such as Cooperative Adaptive Cruise Control (CACC), rely on connected automated vehicles (CAVs) sharing their current position, speed, and acceleration. In contrast, intent-based platooning is an emerging technology where CAVs share their planned trajectories. In this work, both of these algorithms are adapted for Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs) to assess the impact of platooning on energy consumption. The regenerative braking system common in BEVs and PHEVs is modeled and incorporated into the optimization framework of intent-based platooning. The modified framework is then compared with the traditional CACC and intent-based platooning frameworks through simulations based on real vehicle speed profiles collected in free-flow and congested highway driving. The results show that the intent-based platooning framework adapted for BEVs and PHEVs achieves an 7-16% improvement in energy efficiency compared to the traditional intent-based platooning framework, and an 14-33% improvement compared to CACC.

14:30-14:45, Paper FrB07.5	Add to My Program
Bayesian Optimization for ADAS Development: Balancing Safety and Comfort in an AEB Use Case (I)

Rigotti Caiano, Solano	IPEN
Galbier Lopes, Ygor	Bosch
Mafort, Leimar	Compnay
Tang, Jian	Robert Bosch LLC
Keywords: Optimization algorithms, Simulation, Automotive systems Abstract: This work integrated a hybrid Bayesian–Optimisation and Genetic–Algorithm (BO–GA) workflow in the IPG CarMaker’s software-in-the-loop (SIL) environment to assist Advanced Driver Assistance System (ADAS) development. By replacing exhaustive grid searches with a Gaussian-process surrogate model and an expected-improvement acquisition function, the approach delivers substantial cost savings and delineates operating boundaries for subsequent software development. An Autonomous Emergency Braking (AEB) use case shows how the approach concentrates evaluations in the most critical regions of the scenario space, specifically where the timeto-collision (TTC) satisfies < 1.0s or the jerk exceeds 10m/s3 —thereby meeting both safety and comfort requirements. Relative to conventional parameter sweeps, the method cuts the total efforts by approximately 75% while still producing interpretable safety–comfort trade-off curves that guide further development.

14:45-15:00, Paper FrB07.6	Add to My Program
Lateral String Stability for Vehicle Platoons

Li, Sixu	Texas A&M Univ
Darbha, Swaroop	Texas A&M Univ
Zhou, Yang	Texas A&M Univ
Keywords: Multivehicle systems, Automotive control, Control applications Abstract: Connected and automated vehicle (CAV) platooning promises gains in energy efficiency and traffic throughput and, most critically, in safety. These safety benefits hinge on string stability, which determines how disturbances propagate along a platoon. While longitudinal string stability is well studied, lateral string stability, which governs the propagation of path-tracking errors that can lead to unsafe deviations from the intended path, remains underexplored. Its importance is increasing as autonomous vehicles rely more heavily on onboard sensing and map‑free navigation, where sensor occlusion and dense formations amplify safety risks. This paper presents a new framework for lateral string stability that directly addresses safety‑critical path‑relative tracking errors and enables consistent comparison across vehicles following the same road geometry. Central to this framework is an arc‑length (Eulerian) viewpoint, a departure from traditional analyses, that clarifies how tracking errors at a given point on the path propagate from one vehicle to the next. A formal definition of lateral string stability is introduced along with two control strategies: an onboard‑sensing‑only controller and a novel learn‑from‑predecessor approach utilizing vehicle-to-vehicle (V2V) communication. We show that onboard sensing alone cannot guarantee attenuation of path-tracking errors, imposing a fundamental safety limitation, whereas V2V communication enables true error attenuation.


FrB08 Regular Session, Grand Salon 10-13	Add to My Program
Cooperative Control

Chair: Gaspar, Peter	SZTAKI
Co-Chair: Zelazo, Daniel	Technion - Israel Institute of Technology

13:30-13:45, Paper FrB08.1	Add to My Program
Aligning LLMs with Human Comfort Preferences in Human-Robot Collaboration Via Comfort-Guided Supervised Fine-Tuning

Yan, Yuchen	Clemson University
Jia, Yunyi	Clemson Universtiy
Keywords: Cooperative control, Agents-based systems, Optimization Abstract: Aligning large language models (LLMs) with human preferences has been a critical challenge in artificial intelligence research. While Reinforcement Learning from Human Feedback (RLHF) is a popular approach to this alignment, it often relies on explicit human-provided rewards, which are costly, inconsistent, and difficult to scale. In this paper, we propose a comfort-guided supervised fine-tuning framework that replaces direct human feedback with reward signals computed from pre-built human comfort models. These models capture spatial, cognitive, and interactional preferences, enabling LLMs to generate decisions that align with human comfort expectations in collaborative scenarios. By fine-tuning LLMs using comfort-based and efficiency-based reward functions derived from these models, our method provides a scalable and systematic alternative to traditional feedback-driven learning. We apply this framework to a human-robot collaboration task, where an LLM generates optimal action plans by balancing efficiency and comfort. Our results demonstrate that fine-tuning with an extremely small dataset (1MB) and model-generated reward signals significantly improves alignment with human expectations, yielding decisions that are both effective and intuitively preferred by human collaborators. This study introduces a lightweight and efficient strategy to embed human comfort into AI-driven decision-making, reducing reliance on large-scale human annotation while maintaining high-quality preference alignment.

13:45-14:00, Paper FrB08.2	Add to My Program
Energy-Based Scheduling for Collaborative Robot Mobility

Nguyen, Alexander A.	University of California, Irvine
Jabbari, Faryar	Univ. of California at Irvine
Egerstedt, Magnus	University of California, Irvine
Keywords: Cooperative control, Agents-based systems, Optimization Abstract: This paper presents an energy-based, pairwise collaborative scheduling framework for two robots with different mobility characteristics traversing a landscape. The collaborative scheduling problem is formulated as a mixed integer linear program (MILP), whose solution determines when the two robots should work together to accomplish their respective tasks with energy remaining in their batteries. The MILP is framed in two ways, considering net energy expenditure and final stored energy (i.e., energy available at the end of the task) as performance measures: (I) optimizing the performance of the overall team (e.g., minimizing the combined net energy expenditure of the two robots) and (II) optimizing the performance of individuals (e.g., minimizing the maximum net energy expenditure for the individual robots). Additionally, we introduce the notion of the ``cost of fairness'' and demonstrate the proposed framework on scenarios in which collaboration between two heterogeneous robots is necessary.

14:00-14:15, Paper FrB08.3	Add to My Program
A Generalized Voronoi Graph Based Coverage Control Approach for Non-Convex Environment

Guo, Zuyi	Zhejiang University
Zheng, Ronghao	Zhejiang University, ZJU
Liu, Meiqin	Zhejiang University
Zhang, Senlin	Zhejiang University
Keywords: Distributed control, Cooperative control, Decentralized control Abstract: To address the challenge of efficient coverage by multi-robot systems in non-convex regions with multiple obstacles, this paper proposes a coverage control method based on the Generalized Voronoi Graph (GVG). This method partitions the non-convex region into multiple sub-regions using GVG, overcoming the limitations of traditional Voronoi partitioning, which fails to handle obstacles and non-convex boundaries. Besides, a weighted load-balancing algorithm is developed, which considers the quality differences among sub-regions. By iteratively optimizing the robot allocation ratio, the number of robots in each sub-region is matched with the sub-region quality to achieve load balance. The convergence of the method is proved and its performance is evaluated through simulations.

14:15-14:30, Paper FrB08.4	Add to My Program
Formation Control Via Rotation Symmetry Constraints

Martinez, Zamir	Technion - Israel Institute of Technology
Zelazo, Daniel	Technion - Israel Institute of Technology
Keywords: Cooperative control, Distributed control, Control of networks Abstract: This work introduces a distributed formation control strategy for multi-agent systems based solely on rotation symmetry constraints. We propose a potential function that enforces inter-agent rotational symmetries, whose gradient defines a control law that drives the agents toward a desired planar symmetric configuration. We show that only n-1 edges (the minimal connectivity requirement) are sufficient to implement the strategy, where n is the number of agents. We further augment the design to address the maneuvering problem, enabling the formation to undergo coordinated translations, rotations, and scaling along a predefined virtual trajectory. Simulation examples are provided to validate the effectiveness of the proposed method.

14:30-14:45, Paper FrB08.5	Add to My Program
An Integrated Design of Robust Hinf Control with Proximal Policy Optimization for the Cooperation of Robot Vehicles

Lelkó, Attila	SZTAKI Institute for Computer Science and Control
Nemeth, Balazs	SZTAKI
Gaspar, Peter	SZTAKI
Keywords: Cooperative control, Reinforcement learning, Robust control Abstract: The aim of this study is to propose a reinforcement-learning-based synthesis method for the integrated design of a robust H-infinity control and a deep neural network. This integration improves closed-loop performance through Proximal Policy Optimization (PPO), i.e. the Hinf controller and the deep neural network are trained within the same optimization process; this joint training constitutes the main novelty of the paper. The resulting control system provides performance guarantees against uncertainties which are taken into account both in the Hinf design process and in the training environment. The effectiveness of the integrated design method is illustrated through the coordinated control of simultaneously moving robot vehicles, where reduced mission time and collision avoidance are achieved even under significant uncertainties.

14:45-15:00, Paper FrB08.6	Add to My Program
Multi-Robot Allocation for Information Gathering in Non-Uniform Spatiotemporal Environments

Naveed, Kaleb Ben	University of Michigan,
Lee, Haejoon	University of Michigan
Panagou, Dimitra	University of Michigan
Keywords: Cooperative control, Autonomous robots, Sensor networks Abstract: Autonomous robots are increasingly deployed to estimate spatiotemporal fields (e.g., wind, temperature, gas concentration) that vary across space and time. We consider environments divided into non-overlapping regions with distinct spatial and temporal dynamics, termed non-uniform spatiotemporal environments. Gaussian Processes (GPs) are commonly used to estimate such fields, where the kernel encodes spatial and temporal correlations through its lengthscales. When these lengthscales are misspecified, the resulting uncertainty estimates can be highly inaccurate. Existing GP methods often assume a single global lengthscale or update only periodically; some allow spatial variation but ignore temporal changes. To address these limitations, we propose a two-phase framework for multi-robot field estimation: Phase 1 uses a variogram-driven planner to learn region-specific spatial lengthscales, while Phase 2 reallocates robots based on current uncertainty and updates sampling as temporal lengthscales are refined. To encode uncertainty, we use clarity, an information metric from our earlier work. We evaluate the proposed method across diverse environments and provide convergence analysis for spatial lengthscale estimation, along with dynamic regret bounds quantifying the gap to the oracle allocation sequence.


FrB09 Invited Session, Grand Salon 12	Add to My Program
Estimation for Energy Storage Systems

Chair: Zhang, Dong	University of Oklahoma
Co-Chair: Soudbakhsh, Damoon	Temple University
Organizer: Docimo, Donald	Texas Tech University
Organizer: Soudbakhsh, Damoon	Temple University
Organizer: Zhang, Dong	University of Oklahoma
Organizer: Song, Ziyou	University of Michigan, Ann Arbor
Organizer: Araujo Xavier, Marcelo	Amazon Leo
Organizer: Moura, Scott	University of California, Berkeley
Organizer: Lin, Xinfan	University of California, Davis
Organizer: Cui, Xiaofan	University of California, Los Angeles
Organizer: Filgueira da Silva, Samuel	The Ohio State University
Organizer: Tang, Shuxia	Texas Tech University
Organizer: Dey, Satadru	The Pennsylvania State University

13:30-13:45, Paper FrB09.1	Add to My Program
Combined State and Parameter Estimation in Parallel-Connected Lithium-Ion Battery Cells (I)

Espin, Jorge	University of Oklahoma
Zhang, Dong	University of Oklahoma
Keywords: Estimation, Energy systems, Observers for nonlinear systems Abstract: Lithium-ion batteries in parallel connection present significant challenges for monitoring and control due to the coupled dynamics of individual cells and the lack of direct cell-level sensing. The inability to monitor cell-level state-of-charge (SOC), individual branch currents, and key parameters limits the effectiveness of battery management strategies and compromises system safety. This paper proposes an adaptive observer framework that addresses these limitations through a descriptor system formulation capturing the coupled dynamics of parallel-connected cells while enforcing Kirchhoff's laws as algebraic constraints. The method enables simultaneous estimation of cell-level states and key parameters, particularly charge capacity, using exclusively total voltage and current measurements. Numerical simulations under a dynamic current profile demonstrate accurate reconstruction of SOC, cell-level currents, and charge capacities of individual cells, even with large initial condition mismatches and parameter heterogeneity. These results establish the potential of this observer-based solution for battery management systems seeking comprehensive cell-level awareness without additional sensing infrastructure.

13:45-14:00, Paper FrB09.2	Add to My Program
Machine Learning Detection of Lithium Plating in Lithium-Ion Cells: A Gaussian Process Approach (I)

Patnaik, Ayush	UC Davis
Zufall, Adam	University of California, Davis
Robinson, Stephen Kern	University of California Davis
Lin, Xinfan	University of California, Davis
Fogelquist, Jackson	University of California, Davis
Ji, Yiwei	Washington University in St. Louis
Bai, Peng	Washington University in St. Louis
Keywords: Machine learning, Aerospace, Energy systems Abstract: Lithium plating during fast charging is a critical degradation mechanism that accelerates capacity fade and can trigger catastrophic safety failures. Recent work has shown that plating onset can manifest in incremental-capacity analysis as an additional high-voltage feature above 4.0 V, often appearing as a secondary peak or shoulder distinct from the main intercalation peak complex; however, conventional methods for computing dQ/dV rely on finite differencing with filtering, which amplifies sensor noise and introduces bias in feature location. In this paper, we propose a Gaussian Process (GP) framework for lithium plating detection by directly modeling the charge-voltage relationship Q(V) as a stochastic process with calibrated uncertainty. Leveraging the property that derivatives of GPs remain GPs, we infer dQ/dV analytically and probabilistically from the posterior, enabling robust detection without ad hoc smoothing. The framework provides three key benefits: (i) noise-aware inference with hyperparameters learned from data, (ii) closed-form derivatives with credible intervals for uncertainty quantification, and (iii) scalability to online variants suitable for embedded BMS. Experimental validation on Li-ion coin cells across a range of C-rates (0.2C-1C) and temperatures (0-40^circC) demonstrates that the GP-based method reliably resolves distinct high-voltage secondary peak features under low-temperature, high-rate charging, while correctly reporting no features in non-plating cases. The concurrence of GP-identified differential features, reduced charge throughput, capacity fade measured via reference performance tests, and post-mortem microscopy confirmation supports the interpretation of these signatures as plating-related, establishing a practical pathway for real-time lithium plating detection.

14:00-14:15, Paper FrB09.3	Add to My Program
Pulse-Based Continual Learning (PCL) for Second-Life Battery SOH Estimation without Catastrophic Forgetting under Data Heterogeneity and Scarcity (I)

Tao, Shengyu	Chalmers University of Technology
Wang, Yezhen	Tsinghua University
Jiang, Shida	Univeristy of California, Berkeley
Lee, Jaewoong	University of California, Berkeley
Moura, Scott	University of California, Berkeley
Zhang, Xuan	Tsinghua-Berkeley Shenzhen Institute
Zou, Changfu	Chalmers University of Technology
Keywords: Energy systems, Estimation, Machine learning Abstract: Second-life batteries are critical to sustainable energy transi-tions, but their diverse historical usage history and limited field data pose major challenges for their reliable health as-sessment. To address this problem, we propose a pulse-based continual learning (PCL) framework with replay-buffer strategy that preserves prior knowledge while adapting to heterogeneous and data-scarce test conditions. To ensure the rapidness of data curation of second-life batteries, rapid pulse current injection are performed at varying state of charge (SOC) levels, widths, amplitudes, and polarizations. The extracted features from those response voltage signals are utilized as the input for PCL model training. The proposed PCL method treats these pulse current test conditions as do-main-incremental tasks and applies a replay-buffer strategy to balance model adaptation and knowledge retention. We performed extensive physical experiments on a broad set of second-life cells, encompassing multiple chemistries (NMC, LMO, LFP), capacities from 2.1 Ah to 35 Ah, and diverse physical forms. The results show that the average SOH esti-mation accuracy across experimental conditions is over 90%, while the forgetting measure for historical tasks is near-zero. The PCL effectively learns new tasks while retaining estab-lished knowledge, delivering accurate and consistent SOH estimates with evolving second-life conditions. Therefore, the proposed PCL method is robust and scalable, providing a pathway toward efficient second-life battery estimation, screening, regrouping with implications for sustainable re-using and recycling.

14:15-14:30, Paper FrB09.4	Add to My Program
Multi-Goal Mission Planning in Windy Environments Using Boundary Expansion Estimates of Reachable Regions (I)

Hayes, Kaelea	The Pennsylvania State University
Pangborn, Herschel	The Pennsylvania State University
Brennan, Sean	The Pennsylvania State University
Keywords: Aerospace, Flight control, Optimization algorithms Abstract: Unmanned aerial vehicles frequently operate in outdoor environments subject to wind disturbances. In areas where these wind disturbances are large, for example where wind speeds approach or surpass the maximum vehicle speed, the most energy efficient route from a start to a goal is often not a straight path. Improvements could potentially be obtained by working with the wind, rather than against it. This paper proposes a grid-free method to quickly solve multi-goal path planning problems in strong wind fields. By propagating an approximation of the reachable set boundary forward in discrete time, the method provides a way to determine trajectories between locations in strong wind fields. It also provides an approximation of the region reachable by the vehicle at after a given time interval, meaning that goals outside of this region that must be reached within that interval can be eliminated from consideration. Additionally, a time-based cost for travel between two locations can be determined by the number of set expansions required to connect them. Under the assumption that time of travel and energy consumption are inherently linked, these properties are applied to solve an energy-optimizing version of the traveling salesman problem (TSP) where the solution is the energy-optimizing path between all feasible goals within a strong wind field, namely one whose localized maximum speeds are faster than the vehicle's flight speed.

14:30-14:45, Paper FrB09.5	Add to My Program
Sparse-Frequency EIS Measurements and Impedance Spectrum Reconstruction for Lithium-Ion Cell Modeling

Morgan, Timothy	University of Alabama in Huntsville
Sahoo, Avimanyu	University of Alabama in Huntsville
Keywords: Nonlinear systems identification, Energy systems, Automotive systems Abstract: In this paper, we propose a sparse-frequency EIS measurement and reconstruction framework that (i) acquires impedance data at a limited number of frequencies and (ii) reconstructs the full spectrum using a Chebyshev-based pseudospectral formulation. The frequencies are chosen according to Chebyshev–Lobatto nodes mapped onto a logarithmic frequency axis. The coefficients of the Chebyshev basis are efficiently obtained through a Type-I discrete cosine transform (DCT-I) with end-point half-weighting. Based on this reconstruction, the parameters of the corresponding ECM are then derived analytically, eliminating the need for iterative Nonlinear Least Squares (NLLS) fitting. Furthermore, we provide an analytic characterization of the reconstruction error and establish its dependence on both the frequency range and the number of sampled points. Finally, we experimentally validate the approach on an 18650 lithium-ion cell, showing that sampling at only six frequencies between 0.1 Hz and 1000 Hz is sufficient to reproduce the impedance spectra with high fidelity. In addition, fewer measurements show a 70% reduction in measurement time. Across the full frequency band, the synthesized ECM faithfully overlays the measured Nyquist curves, achieving sub-milliohm absolute error.

14:45-15:00, Paper FrB09.6	Add to My Program
A Physics-Informed Neural-Network Model for Rechargeable Thick-Electrode Lithium-Metal Battery Cells That Is Accurate at High C-Rate

Hileman, Wesley Allen	University of Colorado Colorado Springs
Lee, Byeong Kil	University of Colorado Colorado Springs
Trimboli, Michael	University of Colorado, Colorado Springs
Plett, Gregory L.	University of Colorado Colorado Springs
Keywords: Modeling, Grey-box modeling, Energy systems Abstract: We construct a physics-informed machine learning (PIML) model for rechargeable thick-electrode lithium-metal battery (LMB) cells by combining an enhanced single-particle model (SPMe) with a feedforward neural network (FNN). The hybrid model incorporates an additional pole necessary for modeling variation in electrochemical variables across the thickness of the electrode and guarantees bounded-input bounded-output stability by design. We train the model against the predictions of a Newman model simulated with the PyBaMM package. The trained PIML model predicts the voltage of a cell with a 200µm electrode at 1.8C with 93% less error than a plain SPMe model while remaining tractable for embedded applications. We provide Python code to simulate the Newman model, simulate the SPMe, and train the PIML model.


FrB10 Regular Session, Grand Salon 15	Add to My Program
Power Systems

Chair: Yebra, Luis José	CIEMAT-Plataforma Solar De Almería
Co-Chair: You, Pengcheng	Peking University

13:30-13:45, Paper FrB10.1	Add to My Program
DC Shipboard Microgrid Control Using Online Multilayer Neural Network Lifelong Learning

Shahed, Md Tanvir	Missouri University of Science and Technology
Farzanegan, Behzad	Missouri University of Science and Technology
Jagannathan, Sarangapani	Missouri Univ of Science & Tech
Keywords: Power systems, Control applications, Reinforcement learning Abstract: This work proposes an adaptive optimal control scheme for DC shipboard power systems (SPS) with multiple distributed generators (DGs), leveraging a multilayer neural network (MNN) to enhance the performance of a supplementary energy storage system (ESS) under pulsed power load (PPL) conditions. PPLs impose large, short-duration power demands that challenge system stability due to nonlinear dynamics and operational constraints. The proposed reinforcement learning (RL) framework with lifelong hybrid learning (LHL) ad- dresses these challenges by pursuing three objectives: rapid ESS charging, DC bus voltage regulation, and proportional load current sharing among DG converters. An actor–critic MNN approximates the value function and control policy, where the critic network is updated using a hybrid scheme that refines weights both at and within sampling instants for faster convergence. A weight consolidation mechanism further enables lifelong learning, mitigating catastrophic forgetting and reducing overall cost, while the actor adapts using control input errors. Experimental validation on the OPAL-RT OP4512 hardware-in-the-loop (HIL) platform demonstrates that the proposed controller effectively achieves the control objectives under dynamic load conditions.

13:45-14:00, Paper FrB10.2	Add to My Program
Learning-Augmented Primal-Dual Control Design for Secondary Frequency Regulation

Yu, Yixuan	Peking University
Bansal, Rajni Kant	Indian Institute of Management, Ahmedabad
Jiang, Yan	The Chinese University of Hong Kong, Shenzhen
You, Pengcheng	Peking University
Keywords: Power systems, Smart grid, Machine learning Abstract: Frequency stability is fundamental to the secure operation of power systems. With growing uncertainty and volatility introduced by renewable generation, secondary frequency regulation - responsible for restoring the nominal system frequency - must now deliver enhanced performance not only in the steady state but also during transients. This paper presents a systematic framework to embed learning in the design of a primal–dual controller that provides provable (potentially exponential) stability and steady-state optimality, while simultaneously improving key transient metrics, including frequency nadir and control effort, in a data-driven manner. In particular, we employ the primal-dual dynamics of an optimization problem that encodes steady-state objectives to realize secondary frequency control with asymptotic stability guarantee. To augment transient performance of the controller via learning, a change of variables on control inputs, which will be deployed by neural networks, is proposed such that under mild conditions, stability and steady-state optimality are preserved. It further allows us to define a learning goal that accounts for the exponential convergence rate, frequency nadir and accumulated control effort, and use sample trajectories to enhance these metrics. Simulation results validate the theories and demonstrate superior transient performance of the learning-augmented primal-dual controller.

14:00-14:15, Paper FrB10.3	Add to My Program
Improved Voltage Regulation with Optimal Design of Decentralized Volt-VAr Control

Russell, Daniel	University of Vermont
Hamilton, Dakota	The University of Vermont
Almassalkhi, Mads	University of Vermont
Ossareh, Hamid	University of Vermont
Keywords: Power systems, Stability of linear systems, Decentralized control Abstract: Integration of distributed energy resources has created a need for autonomous, dynamic voltage regulation. Decentralized Volt-VAr Control (VVC) of grid-connected inverters presents a unique opportunity for voltage management but, if designed poorly, can lead to unstable behavior when in feedback with the grid. We model the grid-VVC closed-loop dynamics with a linearized power flow approach, leveraging historical data, which shows improvement over the commonly used LinDistFlow model. This model is used to design VVC slopes by minimizing steady-state voltage deviation from the nominal value, subject to a non-convex spectral radius stability constraint, which has not been previously implemented within this context. We compare this constraint to existing convex restrictions and demonstrate, through simulations on a realistic feeder, that using the spectral radius results in more effective voltage regulation.

14:15-14:30, Paper FrB10.4	Add to My Program
A Spatio-Temporal Attention and Transformer Framework for Hybrid Attack Detection in Cyber-Physical Systems

Lau, Clement Kai Xuen	Shanghai Jiao Tong University
Wu, Jing	Shanghai Jiao Tong University
Long, Chengnian	Shanghai Jiao Tong University
Keywords: Power systems Abstract: In this paper, we present a Spatio-Temporal Graph with Deep Embedded Clustering framework (STaG-DEC) that jointly detects, localizes, and classifies hybrid threats. It employs a Graph Convolutional Attention Network (GCAT) to extract spatial dependencies and fuses the resulting embeddings with temporally differenced measurements via concatenation and linear projection before the Transformer encoder, which suppresses slow operational drift, highlights abrupt attack-induced transients, and preserves long-range dependencies. To address class imbalance and overlapping patterns, standard crossentropy with a DEC-based joint loss is proposed that promotes compact and better-separated latent clusters and stabilizes minority-class decision boundaries, thereby enabling precise node-level localization and classification across varying attack distributions. Experiments on IEEE 14-bus system demonstrate our superior detection performance with notably improved replay detection and robust behavior under imbalanced conditions.

14:30-14:45, Paper FrB10.5	Add to My Program
Microgrids Optimal Radial Reconfiguration and Islanding Via FORWARD Algorithm

Vendrell Gallart, Joan	University of California Irvine
Kia, Solmaz S.	University of California Irvine (UCI)
Bent, Russell	Los Alamos National Laboratory
Keywords: Smart grid, Power systems, Optimization algorithms Abstract: Designing optimal radial configurations for microgrids involves Mixed-Integer Non-Linear Programming (MINLP) formulations that are often computationally prohibitive for large-scale systems. To overcome this, we reformulate the distribution optimization problem—where generator locations and capacities are predetermined—into a set-valued optimization framework. This allows us to leverage the recently proposed FORWARD algorithm to efficiently explore the radial configuration space while strictly adhering to physical power flow constraints. The resulting feasible configurations serve as high-quality suboptimal solutions or as effective warm-start points for conventional MINLP solvers. The key innovation lies in abstracting the microgrid network structure to enable rapid topological search while maintaining coupling with physical power flow requirements. Our approach introduces several strategic modifications: a phase-expanded node representation for multi-phase AC systems, physics-informed weight functions incorporating impedance characteristics, and constraint integration strategies that embed power system limitations directly within the sampling process. Numerical evaluations on IEEE benchmarks and synthetic networks of up to 400 nodes demonstrate polynomial-time performance, whereas traditional MINLP solvers often fail beyond 70 buses. The algorithm achieves solutions in seconds with competitive quality, enabling real-time microgrid management and large-scale distribution~planning.

14:45-15:00, Paper FrB10.6	Add to My Program
Dynamic Modeling and Control of a Parabolic Trough Solar Collection Line for Power Generation

Yebra, Luis José	CIEMAT-Plataforma Solar De Almería
Rhinehart, R. Russell	Oklahoma State Univ. - Retired
Keywords: Modeling, Distributed parameter systems, Process Control Abstract: This is an application paper that provides a simulator and a control algorithm for a thermal collection line in a parabolic trough collector for solar generation of electricity. The simulation is designed to match the TCP-100 facility of the Plataforma Solar de Almeria, a full-scale CIEMAT research center, in Spain. Of interest to the research community would be the reveal of control issues and a solution that includes two nonlinear algorithms, explanation of a simulator that can serve as a benchmark for control algorithm testing, and comprehensive issues that need to be included in evaluating control algorithms. Of interest to the practitioner community would be the use of simple nonlinear control techniques embedded in a traditional cascade structure to solve nonlinear and deadtime issues. The simulator includes nominal direct normal irradiance (DNI) that changes over the day; random variations in DNI, optical efficiency, inlet temperature, ambient losses, and flow friction factor; and error on all measurements (calibration and noise).


FrB11 Regular Session, Grand Salon 16	Add to My Program
Robotics II

Chair: Stiffler, Nicholas	University of Dayton
Co-Chair: Han, Feng	New York Institute of Technology

13:30-13:45, Paper FrB11.1	Add to My Program
Learning Human Gait with Muscle Control and Metabolic Cost Integration

Drewing, Nadine	Technical University of Darmstadt
Firas, Al-Hafez	TU Darmstadt
Guoping, Zhao	Southeast University
Peters, Jan	Max-Planck Institute
Seyfarth, Andre	TU Darmstadt
Findeisen, Rolf	TU Darmstadt
Ahmad Sharbafi, Maziar	TU Darmstadt
Keywords: Robotics, Biomedical, Reinforcement learning Abstract: Human locomotion involves complex coordination of over-actuated muscle systems and joints, making simulation and control design highly challenging. While recent reinforcement and imitation learning methods can replicate human-like kinematics, they often fail to produce physiologically realistic force patterns, largely due to the limited availability and consideration of reference force plate or electromyography (EMG) data. This paper presents a hybrid imitation learning framework that integrates muscle-driven simulations with reinforcement learning to address over-actuation and to account for the fact that the available data reflects closed-loop actions involving muscle control. A key contribution is the incorporation of metabolic cost into the reward function, shaping energetically efficient and physiologically plausible controllers. Simulation results demonstrate that the learned policies generate muscle activations and ground reaction forces that align more closely with OpenSim references and experimental data than standard imitation learning. The method provides a scalable tool for developing and validating closed-loop control strategies for assistive systems such as exoskeletons and prostheses, while highlighting broader implications for learning-based control of over-actuated biomechanical systems.

13:45-14:00, Paper FrB11.2	Add to My Program
Enhanced-Balance Stabilization of Underactuated Robots with Learned Equilibrium Manifold

Chen, Lixuan	New York Institute of Technology
Han, Feng	New York Institute of Technology
Keywords: Robotics, Control applications, Machine learning Abstract: Existing control designs for underactuated balance robots often rely on static equilibrium, restricting system autonomy. While dynamic equilibrium resolves this, it is computationally demanding to obtain, especially under unknown dynamics. This paper proposes a machine learning-based dynamic equilibrium prediction framework that captures inherent dynamic coupling directly from motion data, requiring no explicit equilibrium data for training. Leveraging this predicted equilibrium improves control design, enabling simultaneous task execution of the actuated subsystem and robust balance stabilization of the unactuated subsystem. Simulations validate the framework's effectiveness and enhanced performance.

14:00-14:15, Paper FrB11.3	Add to My Program
Modeling and Control of Multirotor Aerial Vehicles with Telescoping Extension Arm Morphology

Dauchert, Samuel	University of South Carolina
Zhang, Xin	University of Southern Maine
Stiffler, Nicholas	University of Dayton
Wang, Xiaofeng	University of South Carolina
Keywords: Robotics, Modeling, Predictive control for nonlinear systems Abstract: This paper presents a novel multirotor design featuring telescoping extension arm morphology (TEAM), together with a generalized control allocation framework. TEAM enables continuous and asymmetric arm extension, offering a level of adaptability not achievable with prior morphing designs, which were constrained to symmetric or discrete arm configurations. To manage the nonlinear actuator interactions introduced by TEAM, we introduce a joint mapping loop based on model predictive control (MPC). This formulation simultaneously determines propeller speeds and arm lengths, providing optimal control allocation in real time. The framework integrates seamlessly with standard cascaded flight control architectures: conventional position and attitude controllers remain unchanged, while MPC operates as a third loop with minimal structural modification. This combined design–control approach represents the first generalized framework for morphing multirotors with fully telescoping arms. The simulation results demonstrate that the proposed system achieves accurate trajectory tracking under aggressive maneuvers and center-of-mass offsets caused by payloads, highlighting its potential for robust and efficient aerial logistics.

14:15-14:30, Paper FrB11.4	Add to My Program
Synergy-Driven Prosthetic Hand Control for Dexterous In-Hand Manipulation

Wang, Chu	University of Melbourne
Mohammadi, Alireza	University of Melbourne
Eden, Jonathan Paul	The University of Melbourne
Tan, Ying	The University of Melbourne
Choong, Peter	The University of Melbourne
Oetomo, Denny	The University of Melbourne
Keywords: Robotics, Human-in-the-loop control, Simulation Abstract: Dexterous in-hand manipulation (DIM) remains a major challenge for prosthetic hand control, as users must generate complex multi-joint motions from low-dimensional inputs. Standard principal component analysis (PCA) captures static postural synergies but ignores continuous movement over time, limiting smooth finger coordination and practical prosthetic control. We apply multivariate functional PCA (mfPCA) to model joint trajectories as multivariate functional data, capturing both the temporal dynamics and the inter-joint coordination. Simulations with a high-degree-of-freedom robotic hand show that, with sufficient components, mfPCA-reconstructed trajectories closely reproduce the original object manipulation at joint and task levels. To enable practical low-dimensional control, we introduce a clustering-based strategy that maps synergy weights onto a low-dimensional plane, supporting stable task reproduction and adjustable variation. These results provide a framework for natural, low-burden prosthetic hand control in DIM tasks.

14:30-14:45, Paper FrB11.5	Add to My Program
Control Separation and Coordination of Trajectory Tracking and Balance Stabilization for Underactuated Balance Robots

Han, Feng	New York Institute of Technology
Keywords: Robotics, Predictive control for nonlinear systems, Control applications Abstract: Control input for the actuated subsystem and unactuated subsystem in underactuated robots’ control is separated into two groups. A coordinated scheme is proposed to design control for each subsystem to achieve trajectory tracking and stabilization, including a computed torque control for actuated subsystem tracking and an MPC-based control for unactuated subsystem balance. The MPC-based scheme in a subspace simultaneously predicts equilibrium and solves control design by embedding the unactuated dynamics as a nonlinear algebraic constraint. The effectiveness of the coordinated control is verified by extensive validations and comparisons.

14:45-15:00, Paper FrB11.6	Add to My Program
Force-Augmented LIPM–MPC for Stable Humanoid Loco–Manipulation

Kim, Baekseok	University of Nevada, Las Vegas
Oh, Paul	University of Nevada Las Vegas
Keywords: Robotics, Control applications, Optimal control Abstract: Humanoid robots must maintain stability under external forces during loco-manipulation tasks such as cart pushing or ladder carrying. This paper proposes a model predictive control (MPC) framework that extends the Linear Inverted Pendulum Model (LIPM) by incorporating external wrenches as additive offsets in the zero-moment point (ZMP) formulation. Through an equivalent ZMP transformation, the approach preserves the linear and convex structure of the MPC problem, enabling real-time feasibility while retaining physical interpretability. The method was validated in MuJoCo simulations with a full-size humanoid robot under various manipulation forces. Results demonstrate stable locomotion during lateral disturbances, ladder transportation, wheelbarrow maneuvering, and heavy-object pushing. These findings show that the proposed framework bridges the gap between efficient LIPM-based walking and robust multi-contact manipulation, advancing humanoid capability in real-world tasks.


FrB12 Regular Session, Grand Salon 18	Add to My Program
Discrete Event Systems

Chair: Liu, Ruotian	Polytechnic University of Bari
Co-Chair: Zenati, Abdelhafid	City Univesity of London

13:30-13:45, Paper FrB12.1	Add to My Program
Efficient Robustness Verification for Non-Blocking Discrete Event Systems

Dony, Md Nur-A-Adam	Tennessee Technological University
Rizvi, Syed Ali Asad	Tennessee Technological University
Keywords: Discrete event systems, Automata, Supervisory control Abstract: This paper introduces a mathematical framework that helps us understand when a discrete event system (DES) begins to experience blocking due to transition deletions. We have developed a string-based analysis to determine if non-blocking conditions persist following these deletions. Our contributions can be summarized in three key points: a state-specific, necessary-and-sufficient characterization of blocking caused by transition deletions, a definition of robust deletions that maintain non-blocking conditions, and an algorithm that identifies critical transitions through minimal blocking sets. This algorithm streamlines computation by focusing on structured candidates instead of considering all possible deletions. We present case studies in manufacturing and autonomous vehicles that effectively identify critical transitions and blocking scenarios while significantly reducing complexity.

13:45-14:00, Paper FrB12.2	Add to My Program
Multiple Global Secrets Protection in Discrete Event Systems Via Integer Linear Programming

Liu, Ruotian	Polytechnic University of Bari
Duan, Wei	Xidian Universisty
Hu, Shaopeng	Xidian University
Mangini, Agostino Marcello	Politecnico Di Bari
Fanti, Maria Pia	Polytechnic of Bari
Keywords: Discrete event systems, Optimization, Automata Abstract: This work addresses the problem of protecting multiple global secrets in discrete event systems modeled by nondeterministic finite automata. Specifically, a global secret in a system is assumed to be composed of one or more states, each assigned a specific security level. We say that a state is protected if any sequence of events leading to it from the initial state contains a quantity of protected events that is equal to or greater than the required security level. Correspondingly, a global secret is said to be protected if the cumulative weight of its containing protected states (or all containing states are considered in the worst-case scenario) satisfies a user-defined protection threshold. Our objective is to develop an event protection policy that is capable of protecting global secrets. To do so, we build an augmented automaton which eliminates the difficulty of analyzing the infinite sequences when verifying protection. Then, an optimal protection policy with minimum cost can be obtained by solving an integer linear programming problem. Examples are given to illustrate the effectiveness of our proposed protection strategy.

14:00-14:15, Paper FrB12.3	Add to My Program
Liveness, Reachability, and Reversibility of Signal Interpreted Petri Nets

Köhler, Andreas	University of Duisburg-Essen
Zhang, Ping	University of Duisburg-Essen
Keywords: Discrete event systems, Petri nets, Process Control Abstract: This paper proposes a set of novel sufficient conditions that guarantee three behavioral properties in signal interpreted Petri nets (SIPNs), i.e., liveness, reachability, and reversibility. SIPNs provide a modeling formalism for representing the control algorithm of discrete manufacturing systems. The liveness, reachability, and reversibility properties ensure that the desired control actions remain perpetually executable, the system states are reachable, and that the system can always return to its initial state, respectively. The sufficient conditions are derived based on the Petri net state equation and the enabling rules of the transitions in SIPNs. Moreover, it is shown how the reachability and reversibility can be computationally verified based on an integer linear programming problem. The computational complexity for verifying the properties is polynomial with respect to the number of markings in the SIPN when the reachable set is already available.

14:15-14:30, Paper FrB12.4	Add to My Program
Modular Control of Critical Observability and Weak Opacity in Discrete Event System

Miao, Shaowen	The Hong Kong University of Science and Technology (Guangzhou)
Komenda, Jan	Czech Academy of Sciences
Ji, Yiding	Hong Kong University of Science and Technology (Guangzhou)
Yin, Xiang	Shanghai Jiao Tong University
Keywords: Discrete event systems, Supervisory control, Automata Abstract: Large-scale systems found in technological frameworks often consist of interacting modules, where ensuring security is critical. In discrete-event systems, normality ensures that, based on an system observations, one can unambiguously determine whether the corresponding plant behavior is safe. In this paper, we first investigate the computation of an earlier form of normality, known as (L,P)-normality, in modular discrete-event systems. We show that, under the condition that all shared events are observable, the supremal (L,P)-normal sublanguages can be computed in a modular fashion. We then apply these results to the enforcement of two related notions that share similar physical interpretations with (L,P)-normality---namely, critical observability and weak opacity. By integrating our contributions with existing results on controllability and normality, we develop a modular supervisory control approach to enforce these properties and provide examples to illustrate the enforcement framework.

14:30-14:45, Paper FrB12.5	Add to My Program
Well-Posedness and Stability Analysis of Positive System Networks under Link-Dependent Delays

Bell, Madeleine	City, University of London
Zenati, Abdelhafid	City Univesity of London
Aouf, Nabil	City University of London
Keywords: Networked control systems, Stability of nonlinear systems, Delay systems Abstract: Nonlinear positive system networks with heterogeneous time-delay communications exhibit strong nonlinear effects that are highly sensitive to delays, which complicates their analysis and limits the generalisation of existing results. This paper investigates the stability and well-posedness of such networks. We first establish two fundamental properties of the considered dynamics. Well-posedness guarantees that communication delays do not introduce singularities into the solution and ensures that, for any admissible initial conditions, the system admits a unique trajectory evolving continuously over time. Positivity ensures that all trajectories remain non-negative for admissible initial conditions. Building upon these properties, we develop a new analysis framework that employs the theory of sequences of functions to construct a less-conservative linear Lyapunov function candidate. We show that stability hinges on the definiteness of a newly derived matrix-valued function guided by Barbalat's theorem. The theoretical results are validated through numerical simulations, demonstrating the effectiveness of the obtained results.


FrB13 Regular Session, Grand Salon 19	Add to My Program
Agent Based Systems I

Chair: Lian, Bosen	Auburn University
Co-Chair: Xue, Wenqian	University of Florida

13:30-13:45, Paper FrB13.1	Add to My Program
Fully Distributed GNE Algorithms for Multi-Robot Placement without Consensus on Multipliers

Yin, Shao-An	University of Minnesota
Hong, Mingyi	Iowa State University
Elia, Nicola	University of Minnesota
Keywords: Agents-based systems, Autonomous robots, Autonomous systems Abstract: Recent machine learning research has increasingly focused on equilibrium analysis in non-cooperative games rather than solely on optimal solutions. Many such problems involve shared constraints and can be formulated as Generalized Nash Equilibrium Problems (GNEPs). For strongly monotone games, existing methods compute consensus-based variational GNEs (v-GNEs) by exchanging Lagrange multipliers. We propose a fully distributed continuous-time algorithm for shared linear equality constraints that converges without multiplier exchange and reaches any GNE, reducing communication overhead and improving privacy. Discrete-time schemes are also provided, and the method is validated on a multi-robot placement task.

13:45-14:00, Paper FrB13.2	Add to My Program
Feature-Based Perception-Aware Multi-UAV Trajectory Planning

Yang, Teaya	University of California, Berkeley
Brommer, Christian	Oak Ridge Associated Universities
Mueller, Mark W.	University of California, Berkeley
Keywords: Agents-based systems, Autonomous systems, Cooperative control Abstract: Multi-agent UAV systems are well-suited for large-scale data collection and transportation, though navigation in unstructured, GPS-denied environments remains challenging. Vision-based navigation enables operation without external infrastructure, but the uncertainty in state estimation limits its reliability in multi-agent settings. We propose a trajectory planning framework that incorporates estimator uncertainty by exploiting visual feature observations between agents. The framework maintains a coherent shared map through multi-agent frame alignment to prevent independent vision drift, and employs a perception-aware reward that favors trajectories with stronger feature visibility and cross-agent redundancy. Flight data from a controlled two-UAV experiment demonstrate that our alignment module can effectively reduce relative distance error, validating its role in maintaining inter-agent consistency. Simulations show that perception-aware rewards improve feature visibility and coordination while maintaining goal-reaching performance.

14:00-14:15, Paper FrB13.3	Add to My Program
Dynamic Coalitions in Games on Graphs with Preferences Over Temporal Goals

Yilmaz, And Kaan Ata	University of Texas at Austin
Kulkarni, Abhishek	Vijil, Inc
Topcu, Ufuk	The University of Texas at Austin
Keywords: Agents-based systems, Autonomous systems, Discrete event systems Abstract: In multiplayer games with sequential decision-making, self-interested agents may form and dissolve temporary coalitions to maximize satisfaction of temporal goals beyond individual capabilities. We study a class of multiplayer games played on graphs where, in each round, a designated leader declares a coalition it joins and prescribes a joint action. Coalition members must comply unless their assigned action is inadmissible. The remaining agents act independently using admissible strategies. Each agent has temporal goals prioritized by qualitative preferences. We present a unified framework that integrates coalition formation with rational strategy synthesis under partial-order preferences. First, we generalize classical admissibility by introducing maximal sure-winning strategies to track goal satisfaction across evolving coalitions. Second, we define a multi-dimensional value function that captures preference-aware guarantees for all agents and enables efficient coalition rationality analysis. Third, we develop a polynomial-time algorithm to synthesize admissible strategies for all agents and proves their existence for this class of games. Experiments in a blocks-world domain demonstrate the advantages of dynamic over fixed coalitions, revealing counter-intuitive behaviors: agents with aligned preferences may fail to cooperate, while those with conflicting goals may form beneficial coalitions.

14:15-14:30, Paper FrB13.4	Add to My Program
Control of Heterogeneous Multi-Agent Systems with Active Leader

Xu, Yicheng	University of California Irvine
Jabbari, Faryar	Univ. of California at Irvine
Keywords: Agents-based systems, LMIs, Distributed control Abstract: This paper investigates the distributed output feedback control problem for heterogeneous multi-agent systems under a leader-following architecture with guaranteed L_2 performance. A scalable synthesis framework is developed, where the design of the leader state observer and the local dynamic output feedback controller are decoupled, thereby significantly reducing the computational complexity for large-scale networks. The proposed approach enables each agent to implement the controller using only local information. Linear matrix inequality conditions are derived to guarantee desirable tracking and disturbance attenuation. The dimension of the LMIs to be solved does not increase with the number of agents. The feasibility condition is established, while the effectiveness is demonstrated through a simulation example.

14:30-14:45, Paper FrB13.5	Add to My Program
Distributed Model Predictive Control with Delay for Formation of Multi-Agent Systems

Chiang, Ming-Li	National Taiwan University
Yang, Qi-Hong	National Taiwan Ocean University
Keywords: Agents-based systems, Cooperative control, Distributed control Abstract: In this paper, we propose a distributed model predictive control (DMPC) framework for cooperative formation of multi-agent systems that composed of unmanned aerial vehicles (UAVs) and an unmanned ground vehicle (UGV). The designed system considers formation maintenance, collision avoidance under information delays, uncertain deployments, and obstacle environments. A linear recursive estimation with error-shift compensation reconstructs the delayed neighbor trajectories, and the risk-aware re-optimization mechanism refines control when inter-agent distances fall below a threshold. Simulations results along with real-world experiments are provided to demonstrate robust formation tracking and collision avoidance.

14:45-15:00, Paper FrB13.6	Add to My Program
Data-Driven Min-Max Strategy for Multiplayer Multiagent Systems

Zhang, Yizhong	Auburn University
Colon, Christopher	Auburn University
Lian, Bosen	Auburn University
Keywords: Agents-based systems, Reinforcement learning, Optimal control Abstract: The paper designs data-driven distributed optimal consensus control for networked multiple dynamic systems, with each system (representing an agent) having multiple control inputs (representing players). We formulate differential graphical games that incorporate the evaluation of continuous-time multiplayer multiagent systems (MMSs), the cost functions of all players in all agents, and the communication network. The min-max strategy is designed for each player to prepare the optimal response to the worst-case reactions from the other players in the same and different agents. This strategy provides each player with distributed algebraic Riccati equations (AREs) and designs the distributed optimal control based on local feedback, further certified by ensured solutions in AREs, the achievement of min-max strategy, and the asymptotic stability of local error systems. The model-free, data-driven reinforcement learning (RL) algorithm is further developed to solve the min-max solutions using online sampled local trajectories without knowing system dynamics. Simulations verify the strategy and algorithm.


FrB14 Regular Session, Grand Salon 21	Add to My Program
Machine Learning III

Chair: Tembine, Hamidou	UQTR and Timadide
Co-Chair: Baheri, Ali	Rochester Institute of Technology

13:30-13:45, Paper FrB14.1	Add to My Program
Diffusion-Based Decentralized Federated Multi-Task Representation Learning

Kang, Donghwa	Iowa State University
Moothedath, Shana	Iowa State University
Keywords: Statistical learning, Decentralized control, Agents-based systems Abstract: We develop Dif-AltGDmin, a diffusion-based decentralized federated algorithm for multi-task representation learning (MTRL). We consider multi-task linear regression where multiple models share a common low-dimensional representation, and propose an alternating projected gradient descent and minimization approach to recover the shared low-rank feature matrix without a central coordinator. We provide provable guarantees on sample and iteration complexity to achieve ε-accuracy, and show that Dif-AltGDmin achieves significantly reduced communication complexity compared to existing decentralized approaches. Numerical simulations confirm that our algorithm matches the convergence of its centralized counterpart while remaining communication-efficient.

13:45-14:00, Paper FrB14.2	Add to My Program
Sliced Distribution Matching Based on Cumulative Distribution Functions with Applications to Control

Tzikas, Alexandros	Stanford University
Jamgochian, Arec	Stanford University
Ure, Nazim Kemal	Stanford University
Kochenderfer, Mykel	Stanford University
Boyd, Stephen	Stanford University
Keywords: Statistical learning, Optimization, Stochastic optimal control Abstract: Computing the similarity between two probability distributions is a recurring theme across control. We introduce a unified family of distances between the probability distributions of two random variables that is based on the discrepancy between the cumulative distribution functions of random linear one-dimensional projections of the random variables. Our proposed distance is interpretable, computationally simple, and admits a differentiable approximation. We establish asymptotic theoretical guarantees for sample-based estimators of the distance. We empirically study the use of the distance in a two-sample test and demonstrate its ability to distinguish different distributions. Finally, we show that the distance allows for simple gradient-based solutions in control by studying distribution steering and ergodic control.

14:00-14:15, Paper FrB14.3	Add to My Program
Logarithmic Regret and Polynomial Scaling in Online Multi-Step-Ahead Prediction

Qian, Jiachen	University of California San Diego
Zheng, Yang	University of California San Diego
Keywords: Statistical learning, Stochastic systems, Kalman filtering Abstract: This paper studies the problem of online multi-step-ahead prediction for unknown linear stochastic systems. Using conditional distribution theory, we derive an optimal parameterization of the prediction policy as a linear function of future inputs, past inputs, and past outputs. Based on this characterization, we propose an online least-squares algorithm to learn the policy and analyze its regret relative to the optimal model-based predictor. We show that the online algorithm achieves logarithmic regret with respect to the optimal Kalman filter in the multi-step setting. Furthermore, with new proof techniques, we establish an almost-sure regret bound that does not rely on fixed failure probabilities for sufficiently large horizons N. Finally, our analysis also reveals that, while the regret remains logarithmic in N, its constant factor grows polynomially with the prediction horizon H, with the polynomial order set by the largest Jordan block of eigenvalue 1 in the system matrix.

14:15-14:30, Paper FrB14.4	Add to My Program
CHMAS: A Coupled Hierarchical Framework for Multi-Agent Reinforcement Learning

Wang, Dongming	University of California, Riverside
Xu, Jie	University of California, Riverside
Zhang, Yanyu	University of California, Riverside
Ren, Wei	University of California, Riverside
Keywords: Reinforcement learning, Hierarchical control, Agents-based systems Abstract: Multi-agent reinforcement learning (MARL) systems face fundamental challenges in balancing global coordination with local execution across different temporal scales. This paper introduces the Coupled Hierarchical Multi-Agent System (CHMAS), a novel framework that decomposes multi-agent decision-making into centralized strategic planning and distributed tactical execution with bidirectional information flow. The strategic layer integrates all agents' states with an exclusive global environmental state to generate guidance actions every T timesteps, while tactical agents execute distributed policies augmented by strategic guidance and local neighborhood observations. Unlike existing hierarchical approaches with unidirectional control, CHMAS establishes a feedback mechanism where accumulated tactical rewards influence strategic objectives through a coupling coefficient lambda, ensuring strategic plans remain grounded in tactical feasibility. To address the non-stationarity inherent in hierarchical learning, we propose an asynchronous update protocol where strategic parameters update every N_f tactical episodes, allowing tactical policies to converge to quasi-stationary points between strategic changes. We present both a general bi-level formulation capturing full system dynamics and a tractable additive approximation enabling rigorous analysis. Theoretical analysis proves that this asynchronous scheme achieves mathcal{O}(log K/sqrt{K}) convergence for the strategic layer after K strategic updates under standard assumptions. Experimental validation in a multi-agent foraging domain demonstrates successful learning of spatially partitioned exploration strategies, with both layers converging stably despite hierarchical coupling.

14:30-14:45, Paper FrB14.5	Add to My Program
Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets

Karpoora Sundara Pandian, Shriram	Rochester Institute of Technology
Baheri, Ali	Rochester Institute of Technology
Keywords: Reinforcement learning, Learning, Machine learning Abstract: Offline reinforcement learning (RL) enables policy optimization from fixed datasets, making it suitable for safety-critical applications where online exploration is infeasible. However, these datasets are often contaminated by adversarial poisoning, system errors, or low-quality samples, leading to degraded policy performance in standard behavioral cloning (BC) and offline RL methods. This paper introduces Density-Ratio Weighted Behavioral Cloning (Weighted BC), a robust imitation learning approach that uses a small, verified clean reference set to estimate trajectory-level density ratios via a binary discriminator. These ratios are clipped and used as weights in the BC objective to prioritize clean expert behavior while down-weighting or discarding corrupted data, without requiring knowledge of the contamination mechanism. We provide theoretical guarantees on convergence to the clean policy, finite-sample bounds, and robustness under varying contamination levels. A comprehensive evaluation framework is established, incorporating diverse poisoning protocols (reward, state, transition, and action) on D4RL continuous control benchmarks. Experiments demonstrate that Weighted BC maintains near-optimal performance even at high contamination ratios outperforming baselines such as traditional BC, BCQ, and BRAC.

14:45-15:00, Paper FrB14.6	Add to My Program
Debiased GPT in Mean-Field-Type Games

Tembine, Hamidou	UQTR and Timadie
Keywords: Neural networks, Game theory Abstract: Bias pervades classical approximation methods in machine learning, including neural networks, kernel-based estimators, and density estimators, as well as advanced techniques such as multi-level Monte Carlo when applied to density-dependent functionals. Although some approaches can be rendered asymptotically unbiased under infinite computational budgets, such conditions are unattainable in practice. This limitation is particularly acute in mean-field-type games with Nemytskii-type state coefficients and payoffs, which arises in generative machine intelligence in audio, video, and image domains. Existing methodologies for these problems such as score function estimation inherit and propagate structural bias through all stages of computation. Here we introduce a debiasing framework for mean-field-type transformers that eliminates bias across these settings.


FrB15 Regular Session, Grand Salon 22	Add to My Program
Stability of Nonlinear Systems I

Chair: Rogers, Jonathan	Naval Surface Warfare Center, Philadelphia Division
Co-Chair: Liu, Jun	University of Waterloo

13:30-13:45, Paper FrB15.1	Add to My Program
A Lyapunov-Based Small-Gain Theorem for Fixed-Time Stability

Tang, Michael	University of California, San Diego
Krstic, Miroslav	University of California, San Diego
Poveda, Jorge I.	University of California, San Diego
Keywords: Stability of nonlinear systems, Lyapunov methods Abstract: This paper introduces a novel Lyapunov-based small-gain methodology for establishing fixed-time stability (FxTS) guarantees in interconnected dynamical systems. Specifically, we consider interconnections in which each subsystem admits an individual fixed-time input-to-state stability (ISS) Lyapunov function that certifies FxT-ISS. We then show that if a nonlinear small-gain condition is satisfied, then the entire interconnected system is FxTS. Our results are analogous to existing Lyapunov-based small-gain theorems developed for asymptotic and finite-time stability, thereby filling an important gap in the stability analysis of interconnected dynamical systems. The proposed theoretical tools are further illustrated through analytical and numerical examples, including the first result on fixed-time feedback optimization of dynamical systems without time-scale separation between the plant and the controller.

13:45-14:00, Paper FrB15.2	Add to My Program
Stability Analysis of Fast Extremum Seeking Control for Wiener Systems Using Online Complex Curve Fitting

Palacios Roman, Juan Javier	Eindhoven University of Technology
van Berkel, Matthijs	Dutch Institute for Fundamental Energy Research
Heemels, W.P.M.H. (Maurice)	Eindhoven University of Technology
van Keulen, Thijs	Eindhoven University of Technology
Keywords: Stability of nonlinear systems, Adaptive control, Optimization Abstract: In this paper, we show uniform semi-global practical asymptotic stability of fast extremum seeking control (ESC) for single-input single-output Wiener systems. While classic ESC requires a time-scale separation between plant and dither, the fast ESC method circumvents this time-scale separation by exploiting limited knowledge of the frequency response of the linear part of the Wiener system, thereby achieving faster convergence. The assumptions under which the fast ESC method works are relaxed compared to existing work and explicit bounds on the design parameters of the fast ESC scheme are provided. A numerical case study illustrates the enhanced convergence and the robustness of the fast ESC method.

14:00-14:15, Paper FrB15.3	Add to My Program
Period-Aware Asymptotic Gain with Application to a Periodically Forced Synchronization Circuit

Ponomarev, Anton	Karlsruhe Institute of Technology
Gröll, Lutz	Karlsruhe Institute of Technology
Hagenmeyer, Veit	Karlsruhe Institute of Technology (KIT)
Keywords: Stability of nonlinear systems, Computational methods, Power electronics Abstract: The classical asymptotic gain (AG) is a concept known from the input-to-state stability theory. Given a uniform input bound, AG estimates the asymptotic bound of the output. Sometimes, however, more information is known about the input than just a bound. In this paper we consider the case of a periodic input. Under the assumption that the system converges to a periodic solution, we introduce a new gain, called period-aware asymptotic gain (PAG), which employs periodicity to enable a sharper asymptotic estimation of the output. Since the PAG can distinguish between short-period ("high-frequency") and long-period ("low-frequency") signals, it is able to rigorously quantify such properties as bandwidth, resonant behavior, and high-frequency damping. We discuss how the PAG can be computed and illustrate it with a numerical example from the field of power electronics.

14:15-14:30, Paper FrB15.4	Add to My Program
On the Instability of Nesterov's ODE under Non-Conservative Vector Fields

Ochoa, Daniel E.	University of California Santa Cruz
Abdelgalil, Mahmoud	University of California, San Diego
Poveda, Jorge I.	University of California, San Diego
Keywords: Stability of nonlinear systems, Hybrid systems, Optimization Abstract: We study the instability of Nesterov's ODE in non-conservative settings, where the driving term is not necessarily the gradient of a potential function. While convergence properties under Nesterov's ODE are well-characterized for settings with gradient-based driving terms, we show that the presence of emph{arbitrarily small} non-conservative terms can lead to instability. To resolve the instability issue, we study a regularization mechanism based on restarting. The resulting system is a hybrid dynamical system that mirrors Nesterov's ODE during intervals of flow, and implements restarts of the momentum state through discrete periodic jumps. For this system, we establish novel explicit bounds on the resetting period that ensure the decrease of a suitable Lyapunov function, guaranteeing stability as well as "accelerated" convergence rates under suitable smoothness and monotonicity properties on the driving term. Numerical simulations support our results.

14:30-14:45, Paper FrB15.5	Add to My Program
A Converse Control Lyapunov Theorem for Joint Safety and Stability

Quartz, Thanin	University of Waterloo
Fitzsimmons, Maxwell	University of Waterloo
Liu, Jun	University of Waterloo
Keywords: Stability of nonlinear systems, Lyapunov methods, Constrained control Abstract: We show that the existence of a strictly compatible pair of control Lyapunov and control barrier functions is equivalent to the existence of a single smooth Lyapunov function that certifies both asymptotic stability and safety. This characterization complements existing literature on converse Lyapunov functions by establishing a partial differential equation (PDE) characterization with prescribed boundary conditions on the safe set, ensuring that the safe set is exactly certified by this Lyapunov function. The result also implies that if a safety and stability specification cannot be certified by a single Lyapunov function, then any pair of control Lyapunov and control barrier functions necessarily leads to a conflict and cannot be satisfied simultaneously in a robust sense.

14:45-15:00, Paper FrB15.6	Add to My Program
Control Oriented Real-Time Nonlinear Dynamic Pseudo Inverse for Non-Minimum Phase Systems

Sempertegui, Miguel	Ohio University
Zhu, J. Jim	Ohio University
Lawrence, Douglas A.	Ohio University
Keywords: Nonlinear output feedback, Control applications, Stability of nonlinear systems Abstract: This paper presents a novel algorithm for the dynamic inversion of nonlinear systems within the output tracking control framework, where the inverse serves as an open-loop nominal controller to facilitate feedback stabilization of the tracking error dynamics along a nominal trajectory. The desired dynamic response for the open-loop controlled system is treated as a pseudo identity whereby the causal and stable pseudo inverse is defined. For non-minimum phase systems, the Fliess canonical form is used to enable a practical pseudo inverse design without internal dynamics stabilization or reverse-time previewing. Unlike exact inversion, which does not have the desired dynamics, and can be unstable or anti-causal, the proposed pseudo inverse not only provides the desired dynamics of the open-loop controlled systems but is also stable and implementable in real time. A nontrivial application example illustrates the algorithm.


FrB16 Regular Session, Grand Salon 24	Add to My Program
Neural Networks

Chair: Xu, Xiangru	University of Wisconsin-Madison
Co-Chair: Ruths, Justin	University of Texas at Dallas

13:30-13:45, Paper FrB16.1	Add to My Program
Forward and Backward Reachability Analysis of Closed-Loop Recurrent Neural Networks Via Hybrid Zonotopes

Zhang, Yuhao	University of Wisconsin-Madison
Xu, Xiangru	University of Wisconsin-Madison
Keywords: Neural networks, Machine learning, Formal verification/synthesis Abstract: Recurrent neural networks (RNNs) are widely employed to model complex dynamical systems due to their hidden-state structure, which inherently captures temporal dependencies. This work presents a hybrid zonotope–based approach for computing exact forward and backward reachable sets of closed-loop RNN systems with ReLU activation functions. The method formulates state-pair sets to compute reachable sets as hybrid zonotopes without requiring unrolling. To improve scalability, a tunable relaxation scheme is proposed that ranks unstable ReLU units across all layers using a triangle-area score and selectively applies convex relaxations within a fixed binary limit in the hybrid zonotopes. This scheme enables an explicit trade-off between computational complexity and approximation accuracy, with exact reachability as a special case.

13:45-14:00, Paper FrB16.2	Add to My Program
State-Constrained Online Adaptive Control for Robotic Manipulators

Dey, Ritirupa	University of South Carolina
Sahoo, Avimanyu	University of Alabama in Huntsville
Narayanan, Vignesh	University of South Carolina
Keywords: Neural networks, Constrained control, Data driven control Abstract: In this letter, we present a neural network (NN)-based online adaptive controller for an n-link robot manipulator. The proposed controller utilizes a control barrier function (CBF) to enforce joint position and velocity constraints while simultaneously guaranteeing accurate trajectory tracking of the robot. In particular, we first map the constraints from the state space into the filtered-tracking error space. We demonstrate that maintaining forward invariance of the constrained error set guarantees constraint satisfaction in the state space. Additionally, we analytically establish that this transformation creates a boundary-to-boundary relationship between the two sets. The resulting NN-driven control torque ensures accurate trajectory tracking of the closed-loop robot manipulator while adhering to the prescribed state constraints even in the presence of uncertain dynamics. Finally, we validate the effectiveness of the proposed control scheme both analytically and numerically.

14:00-14:15, Paper FrB16.3	Add to My Program
Safety-Critical Adaptive Spiking Multilayer Neural Network Control of Nonlinear Systems

Ganie, Irfan Ahmad	Wilkes University
Jagannathan, Sarangapani	Missouri University of Science & Technology
Keywords: Neural networks, Adaptive control, Robotics Abstract: This paper introduces a novel safety-critical adaptive spiking neural network (SNN) control framework for uncertain nonlinear systems. On the safety side, we formulate the Augmented Barrier States (ABS) methodology into a unified safety-embedded tracking framework, establishing rigorous safety equivalence results that embed safety constraints directly within the closed-loop dynamics. On the learning side, a deep SNN architecture is developed with online adaptation enabled through direct error-driven weight update laws applied at every layer. Unlike conventional gradient approaches that struggle with instability due to the discontinuous nature of spike generation, the proposed method incorporates singular value decomposition (SVD) to regularize gradient flow, enhancing stability and convergence during online learning. The resulting controller guarantees uniformly ultimately bounded tracking while strictly preserving safety constraints for nonlinear systems, ensuring both adaptability and formal safety guarantees. Simulation studies on two-link manipulator validate the framework, demonstrating robust safety preservation, efficient real-time adaptation, and 80% computational energy savings compared to existing ANN-based approaches.

14:15-14:30, Paper FrB16.4	Add to My Program
TRASE-NODEs: Trajectory Sensitivity-Aware Neural Ordinary Differential Equations for Efficient Dynamic Modeling

Al-Janahi, Fatima	The University of Texas at Austin
Ko, Min-Seung	The University of Texas at Austin
Zhu, Hao	The University of Texas at Austin
Keywords: Neural networks, Nonlinear systems identification, Power systems Abstract: Modeling dynamical systems is crucial across the science and engineering fields for accurate prediction, con- trol, and decision-making. Recently, machine learning (ML) approaches, particularly neural ordinary differential equations (NODEs), have emerged as a powerful tool for data-driven modeling of continuous-time dynamics. Nevertheless, standard NODEs require a large number of data samples to remain consistent under varying control inputs, posing challenges to generate sufficient simulated data and ensure the safety of control design. To address this gap, we propose trajectory-sensitivity- aware (TRASE-)NODEs, which construct an augmented system for both state and sensitivity, enabling simultaneous learning of their dynamics. This formulation allows the adjoint method to update gradients in a memory-efficient manner and ensures that time-invariant control set-point effects are captured in the learned dynamics. We evaluate TRASE-NODEs using damped oscillator and inverter-based resources (IBRs). The results show that TRASE-NODEs generalize better from the limited training data, yielding lower prediction errors than standard NODEs for both examples. The proposed framework offers a data-efficient, control-oriented modeling approach suitable for dynamic systems that require accurate trajectory sensitivity prediction.

14:30-14:45, Paper FrB16.5	Add to My Program
Exact Minimal Perturbations to Quantify Robustness of Neural Networks

Chen, Justin	University of Texas at Dallas
Ruths, Justin	University of Texas at Dallas
Keywords: Neural networks, Optimization algorithms, Emerging control applications Abstract: Certification of neural networks supports their integration into commercial and safety-critical applications with formal guarantees. The Lipschitz constant, often used to certify neural networks, is a problematic proxy for robustness. Recent work on adversarial attacks has made it clear that the output of neural networks can change dramatically under minor perturbations. We expand the use of the hybrid zonotope representation of ReLU feedforward neural networks to calculate the minimum perturbation required to change a network's classification.

14:45-15:00, Paper FrB16.6	Add to My Program
Neural Hybrid Equations: Models, Basic Properties, and Approximation Results

Ochoa, Daniel E.	University of California Santa Cruz
Sanfelice, Ricardo G.	University of California at Santa Cruz
Keywords: Hybrid systems, Neural networks Abstract: This paper establishes approximation results for neural network approximators of hybrid dynamical systems. We propose neural hybrid equations, which are hybrid systems that use neural networks as the flow map and the jump map while preserving the hybrid structure. Our main result proves that solutions to neural hybrid equations, with Lipschitz non-affine activation functions, can approximate solutions of nominal hybrid systems with Lipschitz vector fields with arbitrary precision on compact time domains. For the subclass of neural hybrid equations with rectifying linear unit (ReLU) activation functions, we establish O(1/N) convergence rates where N is the number of neurons. We illustrate our results with a numerical example.


FrB17 Regular Session, Churchill A1	Add to My Program
Stochastic Optimal Control I

Chair: Leahy, Kevin	Worcester Polytechnic Institute
Co-Chair: Ma, Tong	Northeastern University

13:30-13:45, Paper FrB17.1	Add to My Program
Belief Space Control of Safety-Critical Systems under State-Dependent Measurement Noise

Walia, Rohan	Worcester Polytechnic Institute
Black, Mitchell	MIT Lincoln Laboratory
Schoer, Andrew	MIT Lincoln Laboratory
Leahy, Kevin	Worcester Polytechnic Institute
Keywords: Stochastic optimal control, Emerging control applications, Autonomous systems Abstract: Safety-critical control is imperative for deploying autonomous systems in the real world. Control Barrier Functions (CBFs) offer strong safety guarantees when accurate system and sensor models are available. However, widely used additive, fixed-noise models are not representative of complex sensor modalities with state-dependent error characteristics. Although CBFs have been designed to mitigate uncertainty using fixed worst-case bounds on measurement noise, this approach can lead to overly-conservative control. To solve this problem, we extend the Belief Control Barrier Function (BCBF) framework to accommodate state-dependent measurement noise via the Generalized Extended Kalman Filter (GEKF) algorithm, which models measurement noise as a linear function of the state. Using the original BCBF framework as baseline, we demonstrate the performance of the BCBF-GEKF approach through simulation results on a 1D single integrator setpoint tracking scenario and a trajectory tracking scenario using 2D unicycle kinematics. Our results confirm that the BCBF-GEKF approach offers less conservative control with greater safety.

13:45-14:00, Paper FrB17.2	Add to My Program
Chance-Constrained Covariance Steering for Discrete-Time Markov Jump Linear Systems

Shrivastava, Shaurya	Purdue University
Oguri, Kenshiro	Purdue University
Keywords: Stochastic optimal control, Switched systems, Optimization Abstract: In this paper, we solve the chance-constrained covariance steering problem for discrete-time Markov Jump Linear Systems (MJLS) using a convex optimization framework. We derive the analytical expressions for the mean and covariance trajectories of time-varying discrete-time MJLS and show that they cannot be separated even without chance constraints, unlike the single-mode (non-hybrid) dynamics case. To solve the covariance steering problem, we propose a two-step convex optimization framework and then incorporate chance constraints through an iterative framework. Since the stochastic hybrid nature of MJLS introduces non-Gaussian distributions unlike single-mode covariance steering, we also propose deterministic formulations for non-Gaussian chance constraints based on Cantelli's inequality and the multivariate Chebyshev inequality. Both problems are originally nonconvex, and we derive convex relaxations which are proved to be lossless at optimality using the Karush–Kuhn–Tucker (KKT) conditions. Numerical simulations demonstrate the proposed method by achieving target covariances while respecting chance constraints under additive noise, bias, and Markovian jump dynamics.

14:00-14:15, Paper FrB17.3	Add to My Program
Stabilization and Optimality of Pursuer and Evader Policies for Stochastic Differential Games

Haddad, Wassim M.	Georgia Inst. of Tech
Chitre, Ronit	Graduate Research Assistant
Keywords: Stochastic optimal control, Optimal control, Stochastic systems Abstract: This paper examines a two-player, zero-sum stochastic differential game problem over an infinite time horizon, where the players employ controller (pursuer) and stopper (evader) policies for a nonlinear system driven by Brownian motion. The pursuer's goal is to minimize a nonlinear-nonquadratic performance criterion while guaranteeing stochastic stabilization and the evader's goal is to maximize the performance criterion. By establishing a connection between stochastic Lyapunov stability theory and the stochastic Hamilton-Jacobi-Isaacs equation, we derive explicit optimal strategies for both players that guarantee stochastic stabilization as well as enforcing a saddle point condition on a nonlinear nonquadratic performance criterion. We demonstrate that global asymptotic stability in probability is ensured through a Lyapunov function, which also serves as the solution to the steady-state stochastic Hamilton-Jacobi-Isaacs equation, thereby guaranteeing both closed-loop stability in probability and optimality. Furthermore, we develop optimal feedback controller and stopper policies for affine nonlinear systems using an inverse optimality framework tailored to the game problem. These results extend existing linear feedback pursuer-evader policies to nonlinear controllers and stoppers that optimize general polynomial and multilinear performance criteria.

14:15-14:30, Paper FrB17.4	Add to My Program
Partial Stability and Optimal Control for Stochastic Dynamical Systems

Haddad, Wassim M.	Georgia Inst. of Tech
Chitre, Ronit	Graduate Research Assistant
Keywords: Stochastic optimal control, Optimal control, Stochastic systems Abstract: This paper presents a unified framework for the optimal nonlinear analysis and feedback control design of stochastic dynamical systems, specifically targeting partial stability and partial-state stabilization. We ensure partial asymptotic stability in probability for the closed-loop nonlinear system using a Lyapunov function that is both positive definite and decrescent with respect to a subset of the system state. This Lyapunov function corresponds to the steady-state solution of the stochastic Hamilton-Jacobi-Bellman equation, thereby establishing both probabilistic partial stability and optimality. The proposed framework lays the underpinning for generalizing linear-quadratic optimal control strategies to more complex settings involving nonlinear, nonquadratic, and partial-state stochastic stabilization problems. We also explore the connection between our approach and optimal regulation in both linear and nonlinear time-varying stochastic systems with quadratic and more general cost functionals. In addition, we develop optimal feedback controllers for affine stochastic nonlinear systems through an inverse optimality approach specifically designed for the partial-state stabilization setting.

14:30-14:45, Paper FrB17.5	Add to My Program
Stochastic Model Predictive Control for Colloidal Self-Assembly: A Markov State Model-Based Case Study

Rao, Jingzhi	Beijing University of Chemical Technology
Sun, Wei	Louisiana State University
Tang, Xun	Louisiana State University
Keywords: Stochastic optimal control, Process Control, Markov processes Abstract: Controlling colloidal self-assembly into specific configurations for multi-functional materials has attracted vast interests among material scientists, control engineers, and computer scientists. While various feedback control strategies have demonstrated efficacy in rapidly steering the colloidal particles into desired configurations, dealing the stochasticity still stands as a challenge. Here we investigate the application of stochastic model predictive control (sMPC) on an electric field-mediated colloidal self-assembly process, simulated by Markov state models. Specifically, we examine the robustness of sMPC with respect to model accuracy, considering the challenge of obtaining accurate process models for a high-dimensional, stochastic many-body assembly system in practice. Our findings indicate that with a terminal reward objective function, the proposed sMPC controller can yield satisfactory results, and the sMPC also tends to hold robustness against modeling error. We anticipate the work presented here to stimulate further investigations on applying sMPC for colloidal and nano-particle self-assembly systems.

14:45-15:00, Paper FrB17.6	Add to My Program
Data-Conforming Model-Free Stochastic Reinforcement Learning

Athalye, Surabhi	Georgia Institute of Technology
Ramadan, Mohammad	Argonne National Laboratory
Anitescu, Mihai	Argonne National Laboratory
Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Keywords: Stochastic optimal control, Reinforcement learning, Lyapunov methods Abstract: In this paper, we address the concern of potential changes in the state-input distribution during the policy update step of model-free reinforcement learning (RL). In most RL settings, the control policy is updated based on previously recorded data, and then evaluated over the system's future trajectory. However, the new policy may alter the closed-loop behavior and result in future trajectories that are out of distribution with respect to the past learning data, possibly compromising the system's safety due to this discrepancy. To tackle this, we propose a framework that penalizes distributional shifts during RL execution. We incorporate a regularization term that systematically dampens deviations in the distribution in the state-input space, facilitating incremental exploration. Simulations demonstrate the efficacy of our approach.


FrB18 Regular Session, Churchill A2	Add to My Program
Fault Tolerant Systems I

Chair: Bejarano, Francisco Javier	Instituto Politécnico Nacional, ESIME Ticomán
Co-Chair: Cheng, Shiyu	Washington University in St. Louis

13:30-13:45, Paper FrB18.1	Add to My Program
Progress-Based Fault Detection and Health-Aware Task Allocation for Heterogeneous Multi-Robot Systems

Cline, Jack	California Polytechnic State University
Macaranas, Christian	California Polytechnic State University
Farzan, Siavash	California Polytechnic State University
Keywords: Fault detection, Autonomous robots, Cooperative control Abstract: We present a progress-based fault detection module and its integration with dynamic task allocation for heterogeneous robot teams. The detector monitors a normalized task-completion signal with a lightweight Kalman filter (KF) and a normalized innovation squared (NIS) test, augmented with a low-rate stall gate, an uncertainty gate, and debounce logic. Health estimates influence the allocator via health-weighted costs and health-dependent masks; reallocation is event-triggered and regularized with an l1 assignment-change penalty to limit reassignment churn while preserving feasibility through slack variables. The detector has constant per-robot update cost, and the allocation remains a convex quadratic program (QP). Experiments on a common team-task setup evaluate measurement-noise increases, velocity-slip biases, communication dropouts, and task abandonment. The results show timely detection in the noise and bias cases, maintained task completion with limited reassignment, and the expected observability delays under communication dropouts.

13:45-14:00, Paper FrB18.2	Add to My Program
Estimation of Unknown Inputs for Singular Systems: Application to Fault Detection

Correu Olivares, Roberto	Instituto Politécnico Nacional
Bejarano, Francisco Javier	Instituto Politécnico Nacional, ESIME Ticomán
Keywords: Fault detection, Estimation, Linear systems Abstract: A fault estimator is proposed for singular linear systems subject to disturbances. This model accounts for faults that explicitly appear both in the state equation and in the system outputs (i.e., system, actuator and sensor faults). A key aspect of this design is that it does not require estimating either the state vector or the disturbances, which provides greater flexibility in structural conditions compared to the requirements needed to implement an unknown input observer. Furthermore, the number of unknown inputs may exceed the number of available outputs. The faults are formulated through an algebraic expression involving high-order derivatives of the system output. In this way, the recovery of the fault signals is achieved using a high-order sliding mode differentiator, which requires that the derivative of the faults maintains a bounded norm.

14:00-14:15, Paper FrB18.3	Add to My Program
Early Fault Detection and Diagnosis in Closed-Loop Control Systems Using Relay Feedback Limit Cycles

Chaudhari, Yogesh	Indian Institute of Technology Guwahati
Majhi, Somanath	Indian Institute of Technology Guwahati
Keywords: Fault detection, Fault diagnosis, Closed-loop identification Abstract: Utilizing relay-induced limit cycles to stimulate system dynamics while preserving closed-loop stability, this paper proposes an innovative fault detection and diagnosis (FDD) architecture for closed-loop control systems. The methodology integrates analytical Describing Function (DF) modeling with Fast Fourier Transform (FFT) to provide a compact, fault-sensitive feature vector capable of detecting minor parametric deviations often masked by feedback dynamics. Fault isolation is achieved by comparing test feature vectors with a database of known fault signatures using weighted cosine similarity (WCS) and a confidence-based classification scheme. This integrated method supports diagnostic and prognostic goals, makes finding early signs of problems easier, and encourages a shift toward maintenance based on conditions. A validation study is performed on a relay-excited DC motor model, demonstrating its suitability for real-time, safety-critical applications while maintaining scalability and interpretability.

14:15-14:30, Paper FrB18.4	Add to My Program
Sensor and Threshold Selection for Safe Fault Detection

Clark, Andrew	Washington University in St. Louis
Cheng, Shiyu	Washington University in St. Louis
Keywords: Fault detection, Fault tolerant systems, Observers for Linear systems Abstract: Control systems can be severely impacted by node faults due to natural failures or malicious attacks. This paper considers two design problems for unknown input observer (UIO)-based fault detection schemes. First, we consider the problem of choosing optimal detection thresholds to ensure the fault is detected before safety violations occur while avoiding false alarms. We derive bounds on the thresholds to ensure that any constant-magnitude fault can be detected. Second, we investigate the problem of ensuring that the fault detector has adequate information to detect and isolate faults. We derive sufficient conditions for the existence of a UIO as a function of the available sensors, and prove that the problem of selecting a minimum-size set of neighbors is equivalent to matroid intersection, enabling polynomial-time approximation algorithms. Our simulation results confirm that our approach is able to detect faults before safety violations occur.

14:30-14:45, Paper FrB18.5	Add to My Program
Data-Driven, Redundancy-Free Fault-Tolerant Inertial Navigation System Based on Sensor-To-Sensor Identification

Bani Hani, Odai	Iowa State University
Al Janaideh, Mohammad	University of Guelph
Aljanaideh, Khaled	American University of Sharjah
Keywords: Fault detection, Fault tolerant systems Abstract: Inertial navigation systems (INS) use measurements from inertial sensors (accelerometers and gyroscopes) to estimate the location of a moving object. The estimated location can be used by a control system to move the object to a desired location. If one of the sensors used by the INS becomes faulty during the system operation, the estimated location becomes inaccurate, which can lead to damages to the system or the environment. Therefore, in safety-critical applications such as aircraft navigation, redundant sensors are used to detect and disconnect faulty sensors, which can be expensive. In this paper, we propose a data-driven, redundancy-free fault-tolerant inertial navigation system based on sensor-to-sensor identification. We identify sensor-to-sensor models between sensors from a single inertial measurements unit (IMU). These models, which are identified under healthy conditions of the INS system, are used along with a subset of sensor measurements to predict the response of the remaining sensor measurements. If a sensor becomes faulty, then we replace its actual faulty measurement with a healthy virtual measurement obtained using the identified sensor-to-sensor model and the remaining sensor measurements. We apply the proposed approach to two experimental setups, namely, a rotating platform and a moving vehicle. The proposed approach can be used in commercial applications to ensure low cost safe operation of the INS system.

14:45-15:00, Paper FrB18.6	Add to My Program
Fundamental Limitations of Sensitivity Metrics for Anomaly Impact Analysis in LTI Systems

Dong, Jingwei	Uppsala University
Zhang, Kangkang	Imperial College London
Nguyen, Anh Tung	Uppsala University
Teixeira, André M. H.	Uppsala University
Keywords: Fault detection, Optimization, Linear systems Abstract: This study establishes a connection between the output-to-output gain (OOG), a sensitivity metric quantifying the impact of stealthy attacks, and a novel input-to-input gain (IIG) introduced to evaluate fault sensitivity under disturbances, and investigates their fundamental performance limitations arising from the transmission zeros of the underlying dynamical system. Inspired by the OOG, which characterizes the maximum performance loss caused by stealthy attacks, the IIG is proposed as a new measure of robust fault sensitivity and is defined as the maximum energy of undetectable faults for a given disturbance intensity. Then, using right (for OOG) and left (for IIG) co-prime factorizations, both metrics are expressed as the H infinity norm of a ratio of the numerator factors. This unified representation facilitates a systematic analysis of their fundamental limitations. Subsequently, by utilizing the Poisson integral relation, theoretical bounds for the IIG and OOG are derived, explicitly characterizing their fundamental limitations imposed by system non-minimum phase (NMP) zeros. Finally, a numerical example is employed to validate the results.


FrB19 Regular Session, Churchill B1	Add to My Program
Optimal Control IV

Chair: Chakravorty, Suman	Texas A&M University
Co-Chair: Usevitch, James	Brigham Young University

13:30-13:45, Paper FrB19.1	Add to My Program
A Sequential Quadratic Programming Perspective on Optimal Control

Abhijeet, Fnu	Texas A&M University
Chakravorty, Suman	Texas A&M University
Keywords: Optimal control, Optimization algorithms, Computational methods Abstract: This paper offers a unified perspective on different approaches to the solution of optimal control problems through the lens of constrained sequential quadratic programming. In particular, it allows us to find the relationships between Newton’s method, the iterative LQR (iLQR), and Differential Dynamic Programming (DDP) approaches to solve the problem. It is shown that the iLQR is a principled SQP approach, rather than simply an approximation of DDP by neglecting the Hessian terms, to solve optimal control problems that can be guaranteed to always produce a cost-descent direction and converge to an optimum; while Newton’s approach or DDP do not have similar guarantees, especially far from an optimum. Our empirical evaluations on the pendulum and cart-pole swing-up tasks serve to corroborate the SQP-based analysis proposed in this paper.

13:45-14:00, Paper FrB19.2	Add to My Program
Optimal Optical Inter-Satellite Link Tracking Via Gauss-Seidel and Riccati Iterations

Garcia, Daniel	University of New Mexico
Danielson, Claus	University of New Mexico
Keywords: Optimal control, Optimization algorithms, Predictive control for nonlinear systems Abstract: Modern Optical Intersatellite Links require sub-degree pointing accuracy at a target satellite moving relative to the ego-spacecraft. This precise pointing presents significant challenges for spacecraft attitude control systems. This paper develops a novel nonlinear optimization algorithm that avoids linearizing the dynamics on the SO(3) manifold, exploiting the causal structure of the problem using block Gauss-Seidel iterations and Riccati iterations of the state and co-state to achieve optimal reference tracking of a moving reference. The proposed algorithm achieves linear complexity in the planning horizon and maintains exact SO(3) manifold constraints throughout the optimization process. Simulations demonstrate superior tracking performance compared to a linearized attitude planning, while maintaining feasible computation speed. These results demonstrate that our algorithm provides a viable solution for next-generation spacecraft attitude control systems requiring precise reference tracking capabilities.

14:00-14:15, Paper FrB19.3	Add to My Program
Predictive Control Barrier Functions for Discrete-Time Linear Systems with Unmodeled Delays

Paredes Salazar, Juan Augusto	University of Maryland, Baltimore County
Usevitch, James	Brigham Young University
Goel, Ankit	University of Maryland Baltimore County
Keywords: Optimal control, Sampled-data control, Predictive control for linear systems Abstract: This paper introduces a predictive control barrier function (PCBF) framework for enforcing state constraints in discrete-time systems with unknown relative degree, which can be caused by input delays or unmodeled input dynamics. Existing discrete-time CBF formulations typically require the construction of auxiliary barrier functions when the relative degree is greater than one, which complicates implementation and may yield conservative safe sets. The proposed PCBF framework addresses this challenge by extending the prediction horizon to construct a CBF for an associated system with relative degree one. As a result, the superlevel set of the PCBF coincides with the safe set, simplifying constraint enforcement and eliminating the need for auxiliary functions. The effectiveness of the proposed method is demonstrated on a discrete-time double integrator with input delay and a bicopter system with position constraints.

14:15-14:30, Paper FrB19.4	Add to My Program
Sampling-Based Global Optimal Control and Estimation Via Semidefinite Programming

Groudiev, Antoine	École Normal Supérieure, PSL Research University
Schramm, Fabian	Inria, École Normal Supérieure, PSL Research University
Berthier, Eloïse	ENSTA, Institut Polytechnique De Paris
Carpentier, Justin	Inria, École Normal Supérieure, PSL Research University
Dümbgen, Frederike	Inria, École Normal Supérieure, PSL Research University
Keywords: Optimization, Optimal control, Estimation Abstract: Global optimization has gained attention over the past decades, thanks to the development of both theoretical foundations and efficient numerical routines. Among recent advances, Kernel Sum of Squares (KernelSOS) provides a powerful theoretical framework, combining the expressivity of kernel methods with the guarantees of SOS optimization. In this paper, we take KernelSOS from theory to practice and demonstrate its use on challenging control and robotics problems. We identify and address the practical considerations required to make the method work in applied settings: restarting strategies, systematic calibration of hyperparameters, methods for recovering minimizers, and the combination with fast local solvers. As a proof of concept, the application of KernelSOS to robot localization highlights its competitiveness with existing SOS approaches that rely on heuristics and handcrafted reformulations to render the problem polynomial. Even in the high-dimensional, non-parametric setting of trajectory optimization with simulators treated as black boxes, we demonstrate how KernelSOS can be combined with fast local solvers to uncover higher-quality solutions without compromising overall runtimes.

14:30-14:45, Paper FrB19.5	Add to My Program
Optimal Finite-Thrust Maneuver Augmentation for Aerobraking Missions

Dasyam, Amrutha	Wichita State University
Dutta, Atri	Wichita State University
Keywords: Optimization, Optimal control, Spacecraft control Abstract: Aerobraking maneuvers use atmospheric drag over multiple atmospheric passes to gradually decelerate a spacecraft and reduce its orbital energy for an eventual capture into a low-altitude orbit. This paper proposes a thrust-augmentation strategy closer to the periapsis to enforce dynamic pressure and heating constraints. We present dynamic pressure-based heuristic throttle laws within a per-pass optimization framework to compute thrust augmentation needed alongside apoapsis trim maneuvers for accomplishing the aerobraking maneuver. Numerical simulations for a representative Mars aerobraking scenario and comparison of the heuristics with a per-pass optimization framework demonstrate that a ramp-down throttle strategy at the beginning of periapsis passage can yield good quality solution with reduced computational time compared to optimized throttle levels.

14:45-15:00, Paper FrB19.6	Add to My Program
Exhaustive-Serve-Longest Control for Multi-Robot Scheduling Systems

Merati, Mohammad	Boston University
Castanon, David	Boston University
Keywords: Optimization, Queueing systems, Robotics Abstract: We study online task allocation for multi-robot, multi-queue systems with stochastic arrivals at discrete locations and switching delays. We formulate the problem in discrete time; each location can host at most one robot per time slot, servicing a task consumes one slot, switching between locations incurs a one-slot travel delay, and arrivals are independent Bernoulli processes. We formulate a discounted-cost Markov decision process and propose Exhaustive-Serve-Longest (ESL), a simple real-time policy that serves exhaustively when the current location is nonempty and, when idle, switches to a longest unoccupied nonempty location. We prove the optimality of such policy under symmetry conditions on the arrival processes at each location. We compare in simulation the performance of the ESL policy with two well-known policies: a first-come, first-serve policy and a fixed-dwell cyclic policy. Across variations in server–location ratios and loads, ESL consistently yields lower discounted holding cost and smaller mean queue lengths, with action-time fractions showing more serving and restrained switching. When the symmetry conditions are violated, we compare the performance of the ESL policy on smaller problems with an optimal dynamic programming solution, and determine that the ESL performance is within 2% of the optimal performance. Its simplicity and robustness make ESL a practical default for real-time multi-robot scheduling systems.


FrB20 Regular Session, Churchill B2	Add to My Program
Model Predictive Control III

Chair: Milios, Elias Lido Celestino	Robert Bosch GmbH
Co-Chair: Ramezani, Alireza	Northeastern University

13:30-13:45, Paper FrB20.1	Add to My Program
Tube-Based Robust MPC for Variable Air Volume Systems in Airport Terminals

Zhang, Liguo	Beijing University of Technology
He, Kaichao	Beijing University of Technology
Zhan, Jingyuan	Beijing University of Technology
Shi, Rui	Beijing University of Technology
Keywords: Building and facility automation, Grey-box modeling, Predictive control for linear systems Abstract: This paper presents a tube-based model predictive control (Tube-MPC) framework for multi-zone heating, ventilation, and air conditioning (HVAC) systems in airport terminals. The proposed method explicitly addresses system uncertainties arising from passenger flow fluctuations and weather variations, ensuring robust satisfaction of thermal comfort and operational constraints. By incorporating disturbance-invariant sets into the control design, the controller maintains feasible operation while improving resilience to external perturbations. Simulation studies on a five-zone airport terminal case demonstrate that, compared with conventional MPC, the proposed Tube-MPC approach provides stronger disturbance rejection in the hall zone while maintaining the comfort range.

13:45-14:00, Paper FrB20.2	Add to My Program
Contract-Based Hierarchical Control Using Predictive Feasibility Value Functions

Berkel, Felix	Robert Bosch GmbH
Wabersich, Kim Peter	Robert Bosch GmbH
Xiang, Hongxi	ETH Zurich
Milios, Elias Lido Celestino	Robert Bosch GmbH
Keywords: Predictive control for nonlinear systems, Hierarchical control, Machine learning Abstract: Today's control systems are often characterized by modularity and safety requirements to handle complexity, resulting in the use of hierarchical control structures. Although hierarchical model predictive control offers favorable properties, achieving a provably safe, yet modular design remains a challenge. This paper introduces a contract-based hierarchical control strategy to improve the performance of control systems facing challenges related to model inconsistency and independent controller design across hierarchies. We consider a setup where a higher-level controller generates references that affect the constraints of a lower-level controller, which is based on a soft-constrained MPC formulation. The optimal slack variables of the lower-level MPC serve as the basis for a contract that allows the higher-level controller to assess the feasibility of the reference trajectory without exact knowledge of the model, constraints, and cost of the lower-level controller. To ensure computational efficiency while maintaining model confidentiality, we propose using an explicit function approximation, such as a neural network, to represent the cost of optimal slack values. The approach is tested for a hierarchical control setup consisting of a planner and a motion controller as commonly found in autonomous driving.

14:00-14:15, Paper FrB20.3	Add to My Program
Statistically Consistent Approximate Model Predictive Control

Milios, Elias Lido Celestino	Robert Bosch GmbH
Wabersich, Kim Peter	Robert Bosch GmbH
Berkel, Felix	Robert Bosch GmbH
Gruber, Felix	Robert Bosch GmbH
Zeilinger, Melanie N.	ETH Zurich
Keywords: Predictive control for nonlinear systems, Machine learning, Neural networks Abstract: Model Predictive Control (MPC) offers rigorous safety and performance guarantees but is computationally intensive. Approximate MPC aims to circumvent this drawback by learning a computationally cheaper surrogate policy. Common approaches focus on imitation learning (IL) via behavioral cloning (BC). However, BC fundamentally fails to provide accurate approximations when MPC solutions are set-valued due to non-convex constraints or local minima. We propose a two-stage IL procedure to accurately approximate nonlinear, potentially set-valued MPC policies. The method integrates an approximation of the MPC's optimal value function into a one-step look-ahead loss function, and thereby embeds the MPC's constraint and performance objectives into the IL objective. We prove statistical consistency for policies that exactly minimize our IL objective, implying convergence to a safe and stabilizing control law, and establish input-to-state stability guarantees for approximate minimizers. Simulations demonstrate improved performance compared to BC.

14:15-14:30, Paper FrB20.4	Add to My Program
Scalable and Modular Control Framework for Managing Active Cell Balancing Challenges

Ahmed, Afaq	COMSATS University Islamabad
Uppal, Ali Arshad	COMSATS University Islamabad
Ahmed, Qadeer	The Ohio State University
Keywords: Predictive control for nonlinear systems, Modeling, Control applications Abstract: Active cell balancing (ACB) is an important paradigm in an electric vehicle’s (EV) battery pack. However, in designing the ACB network (ACBN), the interconnections among cells and the selection of power electronic components lead to large variations. Therefore, to evaluate the performance of these diversifications, a modular and scalable framework is required. Consequently, this work utilizes graph theory to model and evaluate the ACBN performance for EV range extension. In this respect, a buck-boost (BB) converter-based architecture, which comprises N series adjacent cells, is modeled using the multiplex (M)–a variant of a graph. Moreover, the high-fidelity expressions for balancing currents are integrated with M. Building on that, a nonlinear model predictive control (NMPC) is formulated and subsequently solved to perform ACB. Similarly, the stability argument for the NMPC based on N cells is provided, and the provision to perform ACB for various ACBN types under the graph-NMPC-based framework is discussed. The simulations are performed for various reallife driving scenarios, and the results demonstrate that the BB converter-based architecture has been able to extend the average EV range up to 32 km under NMPC-based balancing.

14:30-14:45, Paper FrB20.5	Add to My Program
Contact-Implicit MPC for Multimodal Locomotion

Venkatesh Krishnamurthy, Kaushik	Northeastern University
Salagame, Adarsh	Northeastern University
Sihite, Eric	California Institute of Technology
Ramezani, Alireza	Northeastern University
Keywords: Predictive control for nonlinear systems, Optimal control, Modeling Abstract: Multimodal legged-aerial robots offer unique advantages in navigating complex environments by seamlessly transitioning between terrestrial and aerial locomotion modes. However, coordinating propulsive forces from thrusters with ground contact forces presents significant control challenges. This paper presents a contact-implicit model predictive control(CI-MPC) framework that unifies the control of legged and aerial subsystems without predefined contact schedules. We formulate the problem using a 2D thruster-assisted hopper as a reduced-order model, embedding Moreau time-stepping scheme within the MPC dynamics constraints to handle contact forces through differential inclusion and complementarity conditions. This formulation allows ground reaction forces to emerge naturally from the physical constraints. Simulation results demonstrate the framework’s capability to traverse diverse terrains including multi-level steps, uneven surfaces, and wave-like obstacles while maintaining a forward velocity of 1 m/s.Analysis reveals that the contact constraints lead to physically consistent behaviors, with ground reaction forces residing on friction cone boundaries. This unified framework eliminates mode-switching discontinuities and enables the natural discovery of hybrid locomotion strategies, offering a promising direction for controlling multimodal robotic systems.


FrB21 Regular Session, Churchill C1	Add to My Program
Robust Control

Chair: Bhattacharya, Raktim	Texas A&M
Co-Chair: Bhadani, Rahul	University of Arizona

13:30-13:45, Paper FrB21.1	Add to My Program
Robust Attitude Control of Nonlinear UAV Dynamics with LFT Models and mathcal{H}_infty Performance

Kumar, Tanay	Texas A&M University
Bhattacharya, Raktim	Texas A&M University
Keywords: Robust control, Linear parameter-varying systems, Autonomous systems Abstract: Attitude stabilization of unmanned aerial vehicles (UAVs) in uncertain environments presents significant challenges due to nonlinear dynamics, parameter variations, and sensor limitations. This paper presents a comparative study of H∞ and classical PID controllers for multi-rotor attitude regulation in the presence of wind disturbances and gyroscope noise. The flight dynamics are modeled using a linear parameter-varying (LPV) framework, where nonlinearities and parameter variations are systematically represented as structured uncertainties within a linear fractional transformation formulation. A robust controller based on H∞ formulation is designed using only gyroscope measurements to ensure guaranteed performance bounds. Nonlinear simulation results demonstrate the effectiveness of the robust controllers compared to classical PID control, showing significant improvement in attitude regulation under severe wind disturbances.

13:45-14:00, Paper FrB21.2	Add to My Program
Lagrangian-Based Disturbance Observer for Uncertain Mechanical Systems: A Case Study of 1 Degree of Freedom

Lee, Hyobin	University of Seoul
Park, Gyunghoon	University of Seoul
Keywords: Robust control, Mechanical systems/robotics, Uncertain systems Abstract: This paper introduces a new formulation of the disturbance observer (DOB) for a class of uncertain one degree-of-freedom mechanical systems from the perspective of Lagrangian mechanics. Unlike most existing works where the inverse of the nominal model is explicitly used to estimate the lumped disturbance, we derive an alternative expression of the lumped disturbance by differentiating the Lagrangian (i.e., the difference between the kinetic and potential energies), which yields an inverse-model-free DOB structure. The proposed inverse-model-free approach requires less model information for DOB design (e.g., the centripetal/Coriolis term). While our new formulation inherently introduces a singularity issue, we show that such singularities can be handled using a damped pseudo-inverse. Using singular perturbation theory, we rigorously show that this preserves the desired practical tracking behavior under the stated conditions of the DOB compared with other inverse-model-based designs. Simulation results also show that the damped pseudo-inverse provides additional freedom to handle measurement noise, allowing the proposed DOB to outperform existing methods in noisy environments.

14:00-14:15, Paper FrB21.3	Add to My Program
Robust Regret Control with Uncertainty-Dependent Baseline

Liu, Jietian	University of Michigan
Seiler, Peter	University of Michigan, Ann Arbor
Keywords: Robust control, Optimal control, Uncertain systems Abstract: This paper proposes a robust regret control framework in which the performance baseline adapts to the realization of system uncertainty. The plant is modeled as a discrete-time, uncertain linear time-invariant system with real-parametric uncertainty. The performance baseline is the optimal non-causal controller constructed with full knowledge of the disturbance and the specific realization of the uncertain plant. We show that a controller achieves robust additive regret relative to this baseline if and only if it satisfies a related, robust H_infty performance condition on a modified plant. One technical issue is that the modified plant can, in general, have a complicated nonlinear dependence on the uncertainty. We use a linear approximation step so that the robust additive regret condition can be recast as a standard mu-synthesis problem. A numerical example is used to demonstrate the proposed approach.

14:15-14:30, Paper FrB21.4	Add to My Program
Robust Multi-Objective Control for DC-DC Converters under Parametric Uncertainties: Experimental Validation

Smouni, Omaima	Université De Picardie Jules Verne - MIS / Icam De Grand-Paris Sud
Nachidi, Meriem	Icam / MIS
Rabhi, Abdelhamid	Universite De Picardie Jules Verne - MIS
Bosche, Jerome	Universite De Picardie Jules Verne - MIS
Yazidi, Amine	Université De Picardie Jules Verne - LTI
Keywords: Robust control, Power electronics, LMIs Abstract: This paper introduces a new robust multi-objective control strategy for DC-DC converters under input saturation and parametric uncertainties. An integral state-feedback controller is designed using linear matrix inequalities not only to ensure precise reference track- ing, but also to ensure that control signals remain within defined bounds based on a deadzone nonlinearity and a sector condition-based design. The controller also effectively rejects disturbances using an H∞ approach. Additionally, parametric uncertainties of the converter are incorporated into the controller design. The performance of the proposed controller is evaluated under various scenarios using a real prototype. The experimental results demonstrate the effectiveness and robustness of the proposed approach

14:30-14:45, Paper FrB21.5	Add to My Program
Resilient Composite Control for Stability Enhancement in EV Integrated DC Microgrids

Islam, Md Saiful	The University of Alabama in Huntsville
Bhadani, Rahul	The University of Alabama in Huntsville
Keywords: Robust control, Power electronics Abstract: When electric vehicles (EVs) are integrated into the standalone DC microgrids (DCMGs), stability issues arise due to their constant power load (CPL) behavior, which introduces negative incremental impedance (NII). In addition, the DCMGs suffer from an inherent low-inertia problem. Therefore, this study presents a composite controller incorporating a global integral terminal sliding mode controller with a backstepping controller. A virtual capacitor is employed to mitigate the lower inertia issue and strengthen the DC-bus voltage response. An improved fractional power-based reaching law decreases chattering as well as accelerates convergence. Exact feedback linearization converts the nonlinear power converter model into Brunovsky's canonical form, thereby mitigating NII effects and non-minimum phase issues. The entire system stability is verified using Lyapunov control theory. Finally, simulation outcomes confirm the superior performance of the proposed approach, with 34.4-53.3% reduction in overshoot, 52.9-74.9% in undershoot, and 12-47.4% in settling time compared to the existing controller.

14:45-15:00, Paper FrB21.6	Add to My Program
Robust Position Tracking Control of Electro-Hydraulic Actuators: Elimination of Velocity Measurements

Taskingollu, Sule	Ege University
Bayrak, Alper	Bolu Abant Izzet Baysal University
Selim, Erman	Ege University
Tatlicioglu, Enver	Ege University
Zergeroglu, Erkan	Gebze Technical University
Keywords: Robust control, Robotics, Lyapunov methods Abstract: This work presents a robust backstepping type controller formulation for the position tracking control of engineering systems actuated via electro-hydraulic actuators (EHAs). Specifically, a robust controller that does not require accurate knowledge of the system parameters and uses only position measurements is proposed. A filtered based approach is applied to remove the velocity dependency of the controller formulation. Stability of the closed loop system and the uniform boundedness of the tracking error signals are ensured via Lyapunov based arguments. The overall performance of the proposed method is illustrated, initially through physics-based MATLAB/Simscape studies, and then experimentally on a 1 degree of freedom (dof) EHA test-bed and a 2 dof robotic arm.


FrB22 Invited Session, Churchill C2	Add to My Program
Modeling, Control and Estimation of Soft Material and Continuum Systems

Chair: Zhang, Jun	University of Nevada Reno
Co-Chair: Haghshenas-Jaryani, Mahdi	New Mexico State University
Organizer: Vikas, Vishesh	University of Alabama
Organizer: Zhang, Jun	University of Nevada Reno
Organizer: Haghshenas-Jaryani, Mahdi	New Mexico State University
Organizer: Tan, Xiaobo	Michigan State University

13:30-13:45, Paper FrB22.1	Add to My Program
Design and Modeling of the Coiled String Actuator with a Constant Twisting and Coiling Zone (I)

Noack, Will	University of Nevada, Reno
Zhang, Jun	University of Nevada Reno
Keywords: Modeling, Smart structures, Robotics Abstract: Evolved from the twisted string actuator, the coiled string actuator (CSA) produces appreciable and reasonably consistent actuation by coiling a pair of fully twisted strings. Being a recently discovered compliant actuator, only the CSA with a varying twisting and coiling zone has been considered, namely, the strings are coiled as a continuum. CSA in this configuration cannot be easily applied to applications where coiled strings have direct contact with the system. It is strongly desirable to study CSA with a constant twisting and coiling zone, whose string sliding region can be conveniently embedded into systems like tendons. CSA in this configuration exhibits a unique simultaneous twisting-induced and coiling-induced actuation, which challenges both the actuator design and modeling. In this study, we propose a separator structure design that decreases the friction caused by the string-separator contact. A kinematic model is proposed that captures the simultaneous twisting-induced and coiling-induced actuation of the CSA with a constant twisting and coiling zone. Encouraging preliminary experimental characterizations and model simulation results are provided.

13:45-14:00, Paper FrB22.2	Add to My Program
Direct Data-Driven Predictive Control for a Three-Dimensional Cable-Driven Soft Robotic Arm (I)

Ouyang, Cheng	Mississippi State University
Ul Islam, Moeen	Mississippi State University
Chen, Dong	Mississippi State University
Zhang, Kaixiang	Michigan State University
Li, Zhaojian	Michigan State University
Tan, Xiaobo	Michigan State University
Keywords: Control applications, Robotics, Intelligent systems Abstract: Soft robots offer significant advantages in safety and adaptability, yet achieving precise and dynamic control remains a major challenge due to their inherently complex and nonlinear dynamics. Recently, Data-enabled Predictive Control (DeePC) has emerged as a promising model-free method that bypasses explicit system identification by directly leveraging input–output data. While DeePC has shown success in other domains, its application to soft robots remains underexplored, particularly for 3D soft robotic systems. This paper addresses this gap by developing and experimentally validating an effective DeePC framework on a 3D cable-driven soft arm. Specifically, we design and fabricate a soft robotic arm with a thick tubing backbone for stability, a dense silicone body with large cavities for strength and flexibility, and rigid endcaps for secure termination. Using this platform, we implement DeePC with singular value decomposition (SVD)-based dimension reduction for two key control tasks: fixed-point regulation and trajectory tracking in 3D space. Comparative experiments with a baseline model-based controller demonstrate DeePC’s superior accuracy and adaptability, highlighting its potential as a practical solution for dynamic control of soft robots.

14:00-14:15, Paper FrB22.3	Add to My Program
DiSA-IQL: Offline Reinforcement Learning for Robust Soft Robot Control under Distribution Shifts (I)

He, Linjin	Georgetown University
Qi, Xinda	Michigan State University
Chen, Dong	Mississippi State University
Li, Zhaojian	Michigan State University
Tan, Xiaobo	Michigan State University
Keywords: Control applications, Robotics, Reinforcement learning Abstract: Soft snake robots offer remarkable flexibility and adaptability in complex environments, yet their control remains challenging due to highly nonlinear dynamics. Existing model-based and bio-inspired controllers rely on simplified assumptions that limit their performance. Deep reinforcement learning (DRL) has recently emerged as a promising alternative, but online training is often impractical because of costly and potentially damaging real-world interactions. Offline RL provides a safer option by leveraging pre-collected datasets, but it suffers from distribution shift, which degrades generalization to unseen scenarios. To overcome this challenge, we propose DiSA-IQL (Distribution-Shift-Aware Implicit Q-Learning), an extension of IQL that incorporates robustness modulation by penalizing unreliable state–action pairs to mitigate distribution shift. We evaluate DiSA-IQL on goal-reaching tasks across two settings: in-distribution and out-of-distribution evaluation. Simulation results show that DiSA-IQL consistently outperforms baseline models, including Behavior Cloning (BC), Conservative Q-Learning (CQL), and vanilla IQL, achieving higher success rates, smoother trajectories, and greater robustness.

14:15-14:30, Paper FrB22.4	Add to My Program
Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion (I)

Gzenda, Vaughn	Carleton University
Chhabra, Robin	Toronto Metropolitan University
Keywords: Autonomous robots, Reinforcement learning, Adaptive systems Abstract: Soft robotic crawlers are mobile robots that utilize soft body deformability and compliance to achieve locomotion through surface contact. Designing control strategies for such systems is challenging due to model inaccuracies, sensor noise, and the need to discover locomotor gaits. In this work, we present a model-based reinforcement learning (MB-RL) framework in which Gaussian latent dynamics inferred from onboard sensors serve as a predictive model that guides an actor-critic algorithm to optimize locomotor policies. We evaluate the framework on a minimal crawler model in simulation using inertial measurement units and time-of-flight sensors as observations. The learned latent dynamics enable short-horizon motion prediction while the actor-critic discovers effective locomotor policies. This approach highlights the potential of latent-dynamics MB-RL for enabling embodied soft robotic adaptive locomotion based solely on noisy sensor feedback.

14:30-14:45, Paper FrB22.5	Add to My Program
A Physics-Informed Geometric Model for Characterizing Force and Length Variation of McKibben Pneumatic Artificial Muscles (I)

Montoya, Sabrina	New Mexico State University
Haghshenas-Jaryani, Mahdi	New Mexico State University
Keywords: Robotics, Modeling, Model Validation Abstract: Pneumatic Artificial Muscles (PAMs) are a type of soft actuator used in robotics to mimic the behavior of biological muscle, typically consisting of an inner bladder and a form of outer sleeve that causes contraction when pressurized. Many mathematical models have been developed to predict the force and displacement of PAMs. However, these models are mainly sensitive to the variation in materials and geometries of PAMs, making them limited in practical use for specific PAMs. This paper presents a unified physics-informed geometric model of PAMs for a range of sizes with initial lengths of 60, 70, and 80 mm and initial diameters ranging from 3.175, 6.35, and 9.525 mm. The fabrication and experimental testing of a type of PAM produced in our Lab for the muscle-driven snake-like robot locomotion is discussed. A series of experimental isometric and isotonic tests was conducted on customized PAMs with varying geometries made of the same materials. Finally, an empirical modeling approach was utilized, which resulted in a geometric model that preserves the structure of the Pujana-Aresse model. The model is limited to the size range tested in the lab and has only been validated for the aforementioned sizes. These muscle sizes were explored based on the overall geometry of the snake and the desired application for the muscles. For validation, the model prediction was compared to the experimental results, yielding a mean Normalized Root Mean Square Error (NRMSE) of 16.3% for the isometric force and 12% for the isotonic displacement.

14:45-15:00, Paper FrB22.6	Add to My Program
Adaptive Admittance Control of Nasal Swab Robots

Hsiao, Tesheng	National Yang Ming Chiao Tung University
Li, Yi-Jing	National Yang Ming Chiao Tung University
Keywords: Robotics, Adaptive systems, Control applications Abstract: Nasal swabbing is an essential procedure for detecting various respiratory diseases, and performing it with a robot can reduce the risk of infection for medical personnel. However, nasal swab robots face several challenges. For example, the narrow and elongated nasal passage allows only a small pose error during swab insertion, and the internal nasal structure varies slightly depending on each individual’s health condition. To deal with these problems, this paper proposes an admittance control law that enables the robot to adjust the swab’s pose based on the feedback of contact force and torque. However, the choice of admittance parameters significantly affects the behavior of the robot. To reduce the contact force and alleviate oscillation of the swab simultaneously, an adaptive admittance parameter strategy is introduced that modifies the admittance parameter in real time according to the pose of the swab. Consequently, safety, efficiency, and comfort of the nasal swabbing process are enhanced. Then, the proposed adaptive admittance control law was implemented in a 6-axis industrial robot, and a curved channel was constructed as a test environment for experimental verification. The results showed that a satisfactory balance among all performance indices, regarding safety, efficiency, and comfort, was achieved.


FrC03 Regular Session, Grand Salon 3	Add to My Program
Quantum Information and Control

Chair: Quintero, Kaydian	Embry-Riddle Aeronautical University
Co-Chair: Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech

15:30-15:45, Paper FrC03.1	Add to My Program
Exploring Chebyshev Spectral Method with Quantum Annealing Via Carleman Linearization

Nguyen, Hieu	North Carolina Agricultural and Technical State University
Ngo, Anh Phuong	North Carolina A&T State University
Keywords: Quantum information and control, Computational methods, Optimization Abstract: This paper explores the integration of Carleman linearization and quantum annealing to solve nonlinear ordinary differential equations (ODEs) under the Chebyshev spectral method. Using Carleman linearization, the computation of optimal Chebyshev coefficients is reformulated as a quadratic program, which can be further transformed into a Quadratic Unconstrained Binary Optimization (QUBO) problem via binary expansion. This QUBO formulation is well-suited for quantum annealers, where solutions are obtained by minimizing the Hamiltonian of the corresponding Ising model. To address hardware limitations, we employ an iterative quantum annealing strategy with dynamically updated parameters. Case studies on the D-Wave platform demonstrate the potential of our approach to harness the strengths of Carleman linearization and Chebyshev approximation theory, offering continuous and differentiable solutions to nonlinear ODEs, especially promising when quantum resources become more economically viable.

15:45-16:00, Paper FrC03.2	Add to My Program
Robust Multi-Parameter & State Estimation in Dissipative Flux Qubit Systems Via Sliding Mode Control

Quintero, Kaydian	Embry-Riddle Aeronautical University
Drakunov, Sergey V.	Embry-Riddle Aeronautical University
Berhane, Bereket	Embry-Riddle Aeronautical University
Keywords: Quantum information and control, Identification Abstract: Recent advancements in quantum computing, secure communication, and precision sensors indicate a transition from theoretical exploration to practical applications in quantum technologies. This paper addresses state estimation and control of superconducting flux qubits, a promising platform for scalable quantum computation. A novel adaptive sliding mode observer (ASMO) is proposed for simultaneous robust state reconstruction and online estimation of unknown damping parameters β and Λ in the presence of dissipation and measurement noise. Using incomplete measurement data, the observer ensures accurate estimation while maintaining robustness to model uncertainties. A sliding mode control law is then developed to stabilize the system and achieve desired trajectory tracking. Lyapunov-based analysis guarantees convergence of the estimation and control errors and ensures closed-loop stability. Simulation results demonstrate the effectiveness and robustness of the proposed framework, indicating its potential to enhance the reliability and stability of superconducting quantum processors under realistic operating conditions

16:00-16:15, Paper FrC03.3	Add to My Program
Continuous-Time Quantum Reservoirs

Chen, Anthony Siming	University of Nottingham
Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Keywords: Quantum information and control, Machine learning, Neural networks Abstract: This paper studies temporal learning with quantum reservoirs driven by continuous-time dynamics. Our goal is to model long-range dependencies under irregular sampling while avoiding backpropagation through time and repeated numerical ODE integration. We formulate a Continuous-time Quantum Reservoir (CQR) framework in which an input-modulated Hamiltonian drives Schrödinger evolution, and a classical linear readout is fitted from sampled quantum features. We analyze three variants: (i) CQR-unitary, which uses expectation values of Hermitian observables; (ii) CQR-measurement, which uses projective-measurement probabilities; and (iii) CQR-trainable, which additionally tunes input-coupling Hamiltonian parameters via parameter-shift updates. Experiments on Lorenz forecasting, human-activity classification, and two regression benchmarks show competitive accuracy, improved memory capacity, and clear compute-accuracy trade-offs across variants. We also provide a Stone-Weierstrass-based universality result for the induced feature algebra and discuss practical design limitations.

16:15-16:30, Paper FrC03.4	Add to My Program
Quantum Deception: Honey-X Deception Using Quantum Games

Reppas, Efstratios	Georgia Institute of Technology
Wadi, Ali	Georgia Institute of Technology
Gould, Brendan	Georgia Institute of Technology
Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Keywords: Game theory, Quantum information and control, Optimization Abstract: In this paper, we develop a framework for deception in quantum games, extending the Honey-X paradigm from classical zero-sum settings into the quantum domain. Building on a view of deception in classical games as manipulation of a player’s perception of the payoff matrix, we formalize quantum deception as controlled perturbations of the payoff Hamiltonian subject to a deception budget. We show that when victims are aware of possible deception, their equilibrium strategies surprisingly coincide with those of naive victims who fully trust the deceptive Hamiltonian. This equivalence allows us to cast quantum deception as a bilevel optimization problem, which can be reformulated into a bilinear semidefinite program. To illustrate the framework, we present simulations on quantum versions of the Penny Flip game, demonstrating how quantum strategy spaces and non-classical payoffs can amplify the impact of deception relative to classical formulations.

16:30-16:45, Paper FrC03.5	Add to My Program
Data-Driven Actuator Selection for High-Fidelity Quantum Optimal Control

Wadi, Ali	Georgia Institute of Technology
Vamvoudakis, Kyriakos G.	Georgia Inst. of Tech
Keywords: Quantum information and control, Optimal control, Machine learning Abstract: In this paper, we propose a new framework for quantum optimal actuator selection that unifies control allocation with optimal quantum control design. To guarantee the uniqueness of solutions, we transform the infinite-horizon control allocation problem into an equivalent finite-horizon setting. Leveraging the isometry of the Bloch representation, we systematically characterize available actuators and introduce a quantitative measure of controllability based on accessible control degrees of freedom. Building on this foundation, we develop a model-free, trajectory-informed machine learning approach that evaluates control performance directly from system trajectories to identify the optimal actuator set. Finally, we provide rigorous analysis demonstrating that, even under unknown system dynamics, the proposed method converges to the optimal value function for a sufficiently large time horizon.

16:45-17:00, Paper FrC03.6	Add to My Program
Quantum-Assisted Barrier Sequential Quadratic Programming for Nonlinear Optimal Control

Binandeh Dehaghani, Nahid	Aalborg University
Wisniewski, Rafal	Aalborg University
Aguiar, A. Pedro	Faculty of Engineering, University of Porto
Keywords: Quantum information and control Abstract: We propose a quantum-assisted framework for solving constrained finite-horizon nonlinear optimal control problems using a barrier Sequential Quadratic Programming (SQP) approach. A quantum subroutine is incorporated to efficiently solve the Schur complement step using block-encoding and Quantum Singular Value Transformation techniques. We formally analyze the time complexity and convergence behavior under the cumulative effect of quantum errors, establishing local input-to-state stability and convergence to a neighborhood of the stationary point, with explicit error bounds in terms of the barrier parameter and quantum solver accuracy. The proposed framework enables computational complexity to scale polylogarithmically with the system dimension demonstrating the potential of quantum algorithms to enhance classical optimization routines in nonlinear control applications.


FrC04 Regular Session, Grand Salon 4	Add to My Program
Human-In-The-Loop Control

Chair: Hedengren, John	Brigham Young University
Co-Chair: Barton, Kira	University of Michigan, Ann Arbor

15:30-15:45, Paper FrC04.1	Add to My Program
Adaptive Driving Style for SAE Level-2 Driving Automation: Minimizing Preference Mismatch

Akash, Kumar	Honda Research Institute USA, Inc
Zheng, Zhaobo	Honda Research Institute USA, Inc
Misu, Teruhisa	Honda Research Institute USA, Inc
Krishnamoorthy, Vidya	San Jose State University
Dong, Mia	San Jose State University
Lee, Yuni	San Jose State University
Huang, Gaojian	San Jose State University
Keywords: Human-in-the-loop control, Automotive control, Cooperative control Abstract: Driving style is a key factor in the comfort and acceptance of automated vehicle (AV) features. In SAE Level-2 automation, where the driver must supervise the system and remain ready to intervene, mismatches between the automation’s driving style and the driver’s preference can reduce trust and trigger takeovers. This paper proposes an adaptive driving-style control framework that minimizes such preference mismatch. In a driving-simulator study, we compare fixed, trust-based, and preference-based adaptation heuristics and analyze their effects on preference mismatch and trust. We then train a driving-preference prediction model and use it in an implicit adaptation policy that selects among bounded driving styles for upcoming events. A validation study shows that the predictive policy achieves equal or lower preference mismatch than comparison baselines, particularly when starting from a less defensive style, while also yielding higher average trust. The results provide a step toward developing human-aware driving automation that can implicitly adapt its driving style to the driver’s preferences.

15:45-16:00, Paper FrC04.2	Add to My Program
LLM-Enhanced Human-In-The-Loop MPC: TCLab Demonstration for Pharma 4.0

Pershing, Jonathan	Brigham Young University
Stone, Nathan	Brigham Young University
Joseph, Allison	Brigham Young University
Whitaker, Darren	Takeda Development Center Americas
Hedengren, John	Brigham Young University
Keywords: Human-in-the-loop control, Intelligent systems, Control applications Abstract: Industry 4.0 in pharmaceutical manufacturing demands advanced automation and control to handle increasingly specialized drugs and personalized therapies. However, leveraging sophisticated controls like model predictive control (MPC) often requires expertise beyond the typical plant operator. This paper proposes a human-in-the-loop framework where a large language model (LLM) assists operators in configuring and tuning an MPC using natural language commands. The LLM translates high-level objectives (e.g. “speed up the response,” “avoid overshoot”) into MPC tuning adjustments and provides real-time analysis of controller performance. A human supervisor remains in the loop as a safety net, which is essential in regulated pharma environments to ensure trust and compliance. The concept is demonstrated on a Temperature Control Lab (TCLab) hardware setup using a GEKKO-based MPC. Results show that key control objectives can be modified on the fly via natural language, enabling intuitive operator interaction. Four representative scenarios illustrate how LLM-translated commands affect MPC behavior, and an insight-generation feature provides automated textual analysis of performance. This human-in-the-loop, LLM-assisted MPC approach offers a pathway to deploy advanced control in pharma manufacturing while preserving human oversight.

16:00-16:15, Paper FrC04.3	Add to My Program
Energy-Efficient Intent Estimation and Distributed Optimal Control for Human–Multi-Robot Collaboration

Ganie, Irfan Ahmad	Wilkes University
Jagannathan, Sarangapani	Missouri Univ of Science & Tech
Keywords: Human-in-the-loop control, Learning, Optimal control Abstract: This paper introduces an energy-efficient distributed optimal control framework for human–multi-robot cooperative manipulation, combining a spiking neural network (SNN) observer for biologically inspired intent estimation with a game-theoretic distributed optimal control strategy for coordination. At the estimation level, the event-driven SNN captures human motor intent from local force feedback and consensus information, enabling real-time trajectory estimation through sparse spike processing. Unlike conventional neural networks, the SNN processes asynchronous events to significantly reduce computational energy, while singular value decomposition (SVD)–based weight tuning laws ensure stable gradient propagation during online learning. At the control level, a multilayer NN-based actor-critic architecture applies adaptive dynamic programming within a cooperative game-theoretic setting. Coupled Hamilton-Jacobi-Bellman equations are solved through neighborhood optimization, allowing each robot to minimize a performance cost that incorporates local dynamics, neighboring interactions, and human-robot force coordination. Simulation results in human–multi-robot collaborative tasks, achieve superior tracking accuracy with a 45% reduction in operational cost compared to state-of-the-art methods. The SNN-based observer further reduces energy consumption by 60% while maintaining estimation accuracy, enabling practical real-time deployment on resource-constrained robotic systems.

16:15-16:30, Paper FrC04.4	Add to My Program
Characterization of Modes of Human-Autonomy Team Behavior Using the Distance to Generalized Nash Equilibria in Shared Vehicle Control

Dudek, Aleksandra	University of Michigan
Naidja, Nouhed	CentraleSupelec/Laboratoire Des Signaux Et Systèmes (L2S)
James, Scott Clifford	Applied Dynamics International, Inc
Castanier, Matthew	US Army DEVCOM Ground Vehicle Systems Center
Pechberti, Steve	Institut VEDECOM, Versailles, France
Rahal, Mohamed-Cherif	Institut VEDECOM, Versailles, France
Vermillion, Christopher	University of Michigan
Barton, Kira	University of Michigan, Ann Arbor
Keywords: Human-in-the-loop control, Modeling Abstract: This paper quantifies the performance of human-autonomy teams (HATs) by measuring the distance between observed strategies and the generalized Nash equilibrium (GNE). Traditional approaches of assessing HATs do not fully capture the bidirectional interactions or provide actionable insight to improve team behavior. Here, we model HATs as non-zero-sum games and introduce a potential function (informed by those seen in potential games) to characterize how a HAT's behavior diverges from the identified GNE strategy. In a user study with a shared-control vehicle simulator, we apply the metric to assess the effectiveness of the teams formed between an autonomous agent and human participants. We further identify distinct behavior modes of participants based on the trends in the potential function values across different ratios of shared control. The results demonstrate the utility of the potential function in distinguishing patterns of human behavior, providing a pathway towards enhancing the effectiveness of HATs.

16:30-16:45, Paper FrC04.5	Add to My Program
Arbitration with Control Barrier Functions for Safe Shared Control

Uzun, Muhammed Yusuf	Bilkent University
Yildiz, Yildiray	Bilkent University
Keywords: Human-in-the-loop control Abstract: By combining automation accuracy with human adaptability, shared control provides enhanced performance and safety in dynamic, complex environments. Traditional arbitration methods for integrating automation and human inputs often rely on system-specific, parameter-dependent functions that are based on shared control metrics such as trust, workload, or attention. Meanwhile, Control Barrier Functions (CBFs) enforce safety constraints on automated systems but are typically limited to safeguarding plant states. This work introduces a novel arbitration method based on Control Barrier Functions (CBFs), where shared control metrics such as workload, attention, and trust are expressed as real-time inequality constraints. The resulting quadratic-programming formulation determines the automation assistance input that enforces these constraints while preserving feasibility and safety. This CBF-based arbitration provides a systematic, interpretable, and scalable foundation for safe human–autonomy integration.

16:45-17:00, Paper FrC04.6	Add to My Program
Shared Autonomy under Human Performance Uncertainty Via Bayesian Learning and Nested Control Barrier Functions

Li, Xiao	University of Michigan, Ann Arbor
Talbot, John	Toyota Research Institute
Dallas, James	Toyota Research Institute
Subosits, John	Stanford University: Dynamic Design Lab
Keywords: Autonomous systems, Constrained control, Human-in-the-loop control Abstract: Shared autonomy envisions seamless machine assistance to human drivers, enhancing both driving safety and personalization of the driving experience. In this paper, we propose a shared control algorithm for a lane-keeping application that formally guarantees the vehicle remains within a predefined track while minimizing intrusiveness. This is accomplished through the development of a family of Control Barrier Functions that construct a nested sequence of safe subsets, each representing a different level of conservativeness. The level of conservativeness is then adapted online to individual drivers based on their inferred, uncertain driving performance, using a Bayesian filter. We employ a gradient-based algorithm to automatically optimize the hyperparameters, thereby minimizing the intrusiveness of each Control Barrier Functions in parallel. The proposed approach achieves an average computation time of 0.0047 seconds, and its effectiveness is validated through comprehensive simulation studies and real-world hardware experiments with a human driver in the loop.


FrC06 Regular Session, Grand Salon 7	Add to My Program
Network Analysis and Control

Chair: Liu, Ji	Stony Brook University
Co-Chair: Gamarra, Marco	Air Force Research Laboratory

15:30-15:45, Paper FrC06.1	Add to My Program
A 3-State Clock Machine for Combinatorial Optimization

Cheng, Yi	University of Virginia
Lin, Zongli	University of Virginia
Keywords: Optimization algorithms, Network analysis and control Abstract: Many combinatorial optimization problems (COPs) are NP-hard and, in general, hard to solve. Over the past years, many unconventional computing paradigms have been proposed for solving such problems. Notably, Ising machines are one such paradigm for solving COPs with binary variables, which have attracted sustained attention in recent years. Specifically, Ising machines are a class of Langevin models for sampling low-energy configurations of the Ising model. Since many COPs such as the max-cut problem can be directly mapped to the Ising model, Ising machines solve the COPs by finding or sampling lowest-energy configurations (ground states) of the Ising model. Although Ising machines have shown great potential, they remain somewhat inadequate in addressing multi-state COPs. Here, inspired by the 3-state clock model, a generalization of the Ising model to a multi-state model, we introduce a nonlinear dynamical model, referred to as 3-state clock machine, for solving max-3-cut problems.

15:45-16:00, Paper FrC06.2	Add to My Program
Multiagent Social Influence: Modeling Persuasion in Contested Social Networks

Tumu, Renukanandan	University of Pennsylvania
Vasile, Cristian Ioan	Lehigh University
Preciado, Victor M.	University of Pennsylvania
Mangharam, Rahul	University of Pennsylvania
Keywords: Network analysis and control, Control of networks, Large-scale systems Abstract: We present the Social Influence Game (SIG), a framework for modeling adversarial persuasion in social networks with an arbitrary number of competing players. Our goal is to provide a tractable and interpretable model of contested influence that scales to large systems while capturing the structural leverage points of networks. Each player allocates influence from a fixed budget to steer opinions that evolve under DeGroot dynamics, and we prove that the resulting best-response optimization problem is a difference-of-convex program. To enable scalability, we develop an Iterated Linear (IL) solver that approximates player objectives with linear programs. In experiments on random and archetypical networks, IL achieves solutions within 7% of nonlinear solvers while being over 10× faster, scaling to large social networks. This paper lays a foundation for asymptotic analysis of contested influence in complex networks.

16:00-16:15, Paper FrC06.3	Add to My Program
Toward Extremal Graphs for Maximum Total Effective Resistance under Vertex and Edge Constraints

Lu, Susie	MIT
Gamarra, Marco	Air Force Research Laboratory
Liu, Ji	Stony Brook University
Keywords: Network analysis and control, Cooperative control Abstract: This paper studies an algorithm that generates generalized lollipop graphs for any feasible pair of numbers of vertices and edges. An explicit expression for the total effective resistance of these graphs is derived, and it is shown that this value achieves or approximates the maximal possible total effective resistance among all connected graphs with the same vertex and edge counts. In particular, the gap between the total effective resistance of the constructed generalized lollipop graphs and the extremal value converges to zero as the graph size tends to infinity.

16:15-16:30, Paper FrC06.4	Add to My Program
Distributed Online Submodular Maximization under Communication Delays: A Simultaneous Decision-Making Approach

Xu, Zirui	University of Michigan
Tzoumas, Vasileios	University of Michigan, Ann Arbor
Keywords: Network analysis and control, Optimization, Sensor networks Abstract: We provide a distributed online algorithm for multi-agent submodular maximization under communication delays. We are motivated by the future distributed information-gathering tasks in unknown and dynamic environments, where utility functions naturally exhibit the diminishing-returns property, i.e., submodularity. Existing approaches for online submodular maximization either rely on sequential multi-hop communication, resulting in prohibitive delays and restrictive connectivity assumptions, or restrict each agent’s coordination to its one-hop neighborhood only, thereby limiting the coordination performance. To address the issue, we provide the Distributed Online Greedy (DOG) algorithm, which integrates tools from adversarial bandit learning with delayed feedback to enable simultaneous decision-making across arbitrary network topologies. We provide the approximation performance of DOG against an optimal solution, capturing the suboptimality cost due to decentralization as a function of the network structure. Our analyses further reveal a trade-off between coordination performance and convergence time, determined by the magnitude of communication delays. By this trade-off, DOG spans the spectrum between the state-of-the-art fully centralized online coordination approach [1] and fully decentralized one-hop coordination approach [2].

16:30-16:45, Paper FrC06.5	Add to My Program
Opinion Clustering under the Friedkin-Johnsen Model: Agreement in Disagreement

Shrinate, Aashi	IIT Kanpur
Tripathy, Twinkle	IIT Kanpur
Keywords: Network analysis and control Abstract: The convergence of opinions in the Friedkin-Johnsen (FJ) framework is well studied, but the topological conditions leading to opinion clustering remain less explored. To bridge this gap, we examine the role of topology in the emergence of opinion clusters within the network. The key contribution of the paper lies in the introduction of the notion of topologically prominent agents, referred to as Locally Topologically Persuasive (LTP) agents. Interestingly, each LTP agent is associated with a unique set of (non-influential) agents in its vicinity. Using them, we present conditions to obtain opinion clusters in the FJ framework in any arbitrarily connected digraph. A key advantage of the proposed result is that the resulting opinion clusters are independent of the edge weights and the stubbornness of the agents. Finally, we demonstrate using simulation results that, by suitably placing LTP agents, one can design networks that achieve any desired opinion clustering.

16:45-17:00, Paper FrC06.6	Add to My Program
Multi-Agent Stage-Wise Conservative Linear Bandits

Afsharrad, Amirhossein	Stanford University
Moradipari, Ahmadreza	University of California Santa Barbara
Lall, Sanjay	Stanford University
Keywords: Machine learning, Network analysis and control, Reinforcement learning Abstract: In many real-world applications such as recommendation systems, multiple learning agents must balance exploration and exploitation while maintaining safety guarantees to avoid catastrophic failures. We study the stochastic linear bandit problem in a multi-agent networked setting where agents must satisfy stage-wise conservative constraints. A network of N agents collaboratively maximizes cumulative reward while ensuring that the expected reward at every round is no less than (1-alpha) times that of a baseline policy. Each agent observes local rewards with unknown parameters, but the network optimizes for the global parameter (average of local parameters). Agents communicate only with immediate neighbors, and each communication round incurs additional regret. We propose MA-SCLUCB (Multi-Agent Stage-wise Conservative Linear UCB), an episodic algorithm alternating between action selection and consensus-building phases. We prove that MA-SCLUCB achieves regret tilde{O}left(frac{d}{sqrt{N}}sqrt{T}cdotfrac{log(N T)}{sqrt{log(1/\|lambda_2\|)}}right) with high probability, where d is the dimension, T is the horizon, and \|lambda_2\| is the network's second largest eigenvalue magnitude. Our analysis shows: (i) collaboration yields frac{1}{sqrt{N}} improvement despite local communication, (ii) communication overhead grows only logarithmically for well-connected networks, and (iii) stage-wise safety adds only lower-order regret. Thus, distributed learning with safety guarantees achieves near-optimal performance in reasonably connected networks.


FrC07 Regular Session, Grand Salon 9	Add to My Program
Traffic Control

Chair: Malikopoulos, Andreas A.	Cornell University
Co-Chair: Scruggs, Jeff	University of Michigan

15:30-15:45, Paper FrC07.1	Add to My Program
Traffic Density Control Via Filtered Feedback Linearization in Homogeneous Highway Corridors with Input Constraints

Rahmanidehkordi, Arash	University of North Carolina at Charlotte
Ghasemi, Amirhossein	University of North Carolina Charlotte
Keywords: Traffic control, Constrained control, Feedback linearization Abstract: This paper presents a novel control framework that integrates Filtered Feedback Linearization (FFL) with Model Predictive Control (MPC) to manage traffic flow in a homogeneous highway network. FFL is a high-gain robust control method that relies on minimal knowledge of the traffic model and can effectively reject unknown and unmeasured disturbances; however, it does not inherently handle actuator constraints, which may lead to control saturation. Conversely, MPC effectively enforces constraints and optimizes control actions, but its performance depends on model accuracy and it lacks inherent robustness to disturbances. To bridge these limitations, we propose an FFL-MPC framework in which FFL ensures disturbance rejection while MPC regulates virtual control inputs within feasible limits. A key contribution of this work is a novel constraint mapping algorithm that dynamically transforms actuator constraints into the virtual control domain, preserving compatibility with the low-pass filtering dynamics of FFL. The METANET model is employed to model the traffic dynamics. The proposed framework is validated through numerical simulations of highway traffic, demonstrating its effectiveness in alleviating congestion, enforcing control limits, and preserving robust performance under model uncertainty and external disturbances.

15:45-16:00, Paper FrC07.2	Add to My Program
Spatiotemporal Forecasting of Incidents and Congestion with Implications for Sustainable Traffic Control (I)

Kinchen, Tony	Cornell Univerisity
Bai, Ting	Cornell University
Senthil Kumar, Nishanth Venkatesh	Cornell University
Malikopoulos, Andreas A.	Cornell University
Keywords: Simulation, Modeling, Estimation Abstract: Urban traffic anomalies, such as collisions and disruptions, threaten the safety, efficiency, and sustainability of transportation systems. In this paper, we present a simulation-based framework for modeling, detecting, and predicting such anomalies in urban networks. Using the Simulation of Urban MObility (SUMO) platform, we generate reproducible rear-end and intersection crash scenarios with matched baselines, enabling controlled experimentation and comparative evaluation. We record vehicle-level travel time, speed, and emissions for both edge- and network-level analysis. Building on this dataset, we develop a hybrid forecasting architecture that combines bidirectional long short-term memory networks with a diffusion convolutional recurrent neural network to capture temporal dynamics and spatial dependencies. Our simulation studies on the Broadway corridor in New York City demonstrate the framework's ability to reproduce consistent incident conditions, quantify their effects, and provide accurate multi-horizon traffic forecasts. Our results highlight the value of combining controlled anomaly generation with deep predictive models to support reproducible evaluation and sustainable traffic management.

16:00-16:15, Paper FrC07.3	Add to My Program
Route Recommendations for Traffic Management under Learned Partial Driver Compliance

Bang, Heeseung	Cornell University
Cho, Jung-Hoon	MIT
Wu, Cathy	UC Berkeley
Malikopoulos, Andreas A.	Cornell University
Keywords: Traffic control, Multivehicle systems, Transportation networks Abstract: In this paper, we aim to mitigate congestion in traffic management systems by guiding travelers along system-optimal (SO) routes. However, we recognize that most theoretical approaches assume perfect driver compliance, which often does not reflect reality, as drivers tend to deviate from recommendations to fulfill their personal objectives. Therefore, we propose a route recommendation framework that explicitly learns partial driver compliance and optimizes traffic flow under realistic adherence. We first compute an SO edge flow through flow optimization techniques. Next, we train a compliance model based on historical driver decisions to capture individual responses to our recommendations. Finally, we formulate a stochastic optimization problem that minimizes the gap between the target SO flow and the realized flow under conditions of imperfect adherence. Our simulations conducted on a grid network reveal that our approach significantly reduces travel time compared to baseline strategies, demonstrating the practical advantage of incorporating learned compliance into traffic management.

16:15-16:30, Paper FrC07.4	Add to My Program
Mesoscopic Digital Control for Disturbance String Stability Via Constant Time-Headway Spacing Policies

Bonsanto, Pietro	Università Degli Studi Dell'Aquila
Mattioni, Mattia	Università Degli Studi Di Roma La Sapienza
Iovine, Alessio	CNRS
De Santis, Elena	University of L'Aquila
Di Benedetto, Maria Domenica	University of L'Aquila
Keywords: Traffic control, Sampled-data control, Autonomous vehicles Abstract: This paper addresses the enforcement of Disturbance String Stability (DSS) in connected autonomous vehicle platoons subject to asynchronous communication, sampling, and quantization in case of a constant time headway spacing policy. Building on a mesoscopic control framework, we design digital controllers that guarantee DSS despite external disturbances and device limitations. Simulation results demonstrate the effectiveness of the proposed approach.

16:30-16:45, Paper FrC07.5	Add to My Program
Nonlinear Consensus for Traffic Assignment in Multi-Route Networks

Adjei, Emmanuel	University of California Irvine
Butler, Brooks A.	University of California, Irvine
Jin, Wen-Long	University of California, Irvine
Egerstedt, Magnus	University of North Carolina, Chapel Hill
Keywords: Transportation networks, Traffic control, Emerging control applications Abstract: This paper presents a framework connecting nonlinear consensus dynamics with traffic assignment problems in multi-route transportation networks. We define a model of route-choice behavior in which flow evolution is governed by first-in-first-out dynamics, with route costs combining baseline and congestion terms. This formulation yields Wardrop equilibria as consensus states, where all used routes have equalized costs. The analysis establishes key system properties, including flow conservation, invariance of unused routes, and uniqueness of the symmetric equilibrium. Stability is demonstrated through monotonic convergence, while case studies of balanced, congested, and asymmetric networks illustrate flow reallocation and steady-state concentration on optimal routes. This framework provides a foundation for decentralized route selection in intelligent transportation systems and enables control strategies that exploit consensus-like rerouting in urban traffic.

16:45-17:00, Paper FrC07.6	Add to My Program
Discrete-Time Pricing for Nash Stabilization in Routing Games

Lee, Richard	University of Michigan
Scruggs, Jeff	University of Michigan
Yin, Yafeng	University of Michigan
Keywords: Transportation networks, Traffic control, Lyapunov methods Abstract: This work presents a dynamic pricing scheme which guarantees global asymptotic stability of the Nash equilibrium set for a discrete-time routing game. We adopt a game theoretic approach in which each population consists of commuters between a particular origin-destination pair. Populations revise their strategies according to payoff discrepancies between routes, where the strategy revision process is modeled by the class of impartial pairwise comparison (IPC) evolutionary dynamic models (EDMs). We show that the EDM satisfies a discrete-time dissipativity property and leverage this insight to design a dynamic payoff mechanism acting as a supplemental toll imposed on each population. By introducing an augmented storage function which governs the toll dynamics, we ensure the control input (i.e., toll) asymptotically vanishes at the Nash equilibrium set, thus guaranteeing convergence of the closed-loop system. We validate our findings with numerical experiments.


FrC08 Regular Session, Grand Salon 10-13	Add to My Program
Multi-Vehicle Systems

Chair: Malikopoulos, Andreas A.	Cornell University
Co-Chair: Alexandridis, Alex	University of West Attica

15:30-15:45, Paper FrC08.1	Add to My Program
R3R: Decentralized Multi-Agent Collision Avoidance with Infinite-Horizon Safety

Vielmetti, Thomas Marshall	University of Michigan, Ann Arbor
Agrawal, Devansh Ramgopal	University of Michigan, Ann Arbor
Panagou, Dimitra	University of Michigan, Ann Arbor
Keywords: Decentralized control, Multivehicle systems, Autonomous robots Abstract: Existing decentralized methods for multi-agent motion planning lack formal, infinite-horizon safety guarantees, especially for communication-constrained systems. We present R3R which, to our knowledge, is the first decentralized and asynchronous framework for multi-agent motion planning under range-limited communication constraints with infinite-horizon safety guarantees for systems of nonlinear agents. R3R's novelty lies in combining our gatekeeper safety framework with a geometric constraint termed R-Boundedness, which together establish a formal link between an agent's communication radius and its ability to plan safely. We constrain trajectories to lie within a fixed planning radius, determined by a function of the agent's communication radius. This enables trajectories to be certified as provably safe for all time using only local information. Our algorithm is fully asynchronous, and ensures the forward invariance of these guarantees even in time-varying networks where agents asynchronously join and replan. We evaluate our approach in simulations of up to 128 Dubins vehicles, validating our theoretical safety guarantees in dense, obstacle-rich scenarios. We further show that R3R's computational complexity scales with local agent density rather than problem size, providing a practical solution for scalable and provably safe multi-agent systems.

15:45-16:00, Paper FrC08.2	Add to My Program
Stability Analysis of Vehicle Platooning with PID Control Via Dominant Pole Placement

Choudhary, Isha	Indian Institute of Technology Mandi
Dixit, Shilp	Vay Technology Inc., Las Vegas, USA
Halder, Kaushik	Indian Institute of Technology Mandi
Keywords: Multivehicle systems, Cooperative control, PID control Abstract: This paper proposes a novel distributed proportional-integral-derivative (PID) controller design using the dominant pole-placement method for homogeneous vehicle platoon systems with directed and undirected topologies under the constant time headway policy (CTHP). Analytical expressions for PID gains are derived using a coefficient matching approach, considering all real non-dominant poles. String stability criteria are derived for both the directed predecessor-following (PF) and undirected bidirectional (BD) topologies. A genetic algorithm (GA) is employed to optimize three key elements: (i) closed-loop design parameters, (ii) PID controller gains, and (iii) time headway. The algorithm searches the stabilizable region of the controller parameter space to satisfy derived PID analytical expressions and string stability criteria while minimizing the integral square error (ISE). The proposed PID controller ensures both internal and string stability. The effectiveness of the design is validated through numerical simulations on a five-vehicle platoon with PF and BD topologies under CTHP.

16:00-16:15, Paper FrC08.3	Add to My Program
A Lyapunov-Based Formation Control Framework for Multi-Agent Systems in Space-Restricted and Obstacle-Populated Environments

Protoulis, Teo	University of West Attica
Alexandridis, Alex	University of West Attica
Keywords: Multivehicle systems, Lyapunov methods, Autonomous systems Abstract: In this paper, a Lyapunov-based framework for flexible, distance-based, and leaderless formation control of multi-agent systems (MAS) in space-restricted and obstacle-populated environments is proposed. A control Lyapunov function methodology that considers a generic dynamic model of autonomous vehicles is introduced to derive decentralized control laws that are computed locally by each agent and guarantee a number of crucial properties. Among others, the feedback laws ensure that the swarm satisfies strict prespecified formation bounds, while guaranteeing inter-agent collision avoidance, convergence of the vehicles’ velocities to the desired values and adherence to hard upper and lower bounds, and satisfaction of spatial restrictions imposed by the navigation environment. Simulation results on a swarm of fixed-wing unmanned aerial vehicles validate the theoretical findings.

16:15-16:30, Paper FrC08.4	Add to My Program
Target Tracking with Field-Of-View Constraints Using Sentry-Type Robots with Limited Observations

O'Brien, Richard	United States Naval Academy
Dawkins, Jeremy	United States Naval Academy
Galloway, Kevin	United States Naval Academy
Kutzer, Michael	United States Naval Academy
Keywords: Sensor fusion, Multivehicle systems, Kalman filtering Abstract: A multi-robot, target tracking algorithm is proposed that improves tracking accuracy by imposing field-of-view (FOV) constraints based on each robot’s sensor characteristics. The algorithm is independent of the underlying tracking scheme and is evaluated using experimental data collected from sentry-type robots whose trajectories are independent of positive detections of the target. These data are processed using an unscented Kalman filter (UKF) with and without the FOV constraint. The UKF with FOV constraint delivers significantly less tracking error and recovers much more quickly than the standard UKF in periods where the target is unobserved.

16:30-16:45, Paper FrC08.5	Add to My Program
Combining Cooperative Re-Routing with Intersection Coordination for Connected and Automated Vehicles in Urban Networks

Typaldos, Panagiotis	Cornell University
Malikopoulos, Andreas A.	Cornell University
Keywords: Cooperative control, Traffic control, Simulation Abstract: In this paper, we present a hierarchical framework integrating upper-level routing with low-level optimal trajectory planning for connected and automated vehicles (CAVs) in urban networks. The upper-level controller distributes traffic flows using a dynamic re-routing algorithm that leverages real-time density information and fundamental diagrams to predict when edges reach critical density, proactively adjusting routing weights to prevent congestion. The low-level controller coordinates CAVs at signal-free intersections, generating optimal, fuel-efficient trajectories while ensuring safety through constraint satisfaction. We formulate this as an optimal control problem with an analytical solution. SUMO micro-simulation experiments on a realistic network demonstrate that our framework significantly outperforms baseline static routing, achieving 55.8% reduction in delays and 12.1% reduction in travel times, showing substantial improvements in urban traffic efficiency

16:45-17:00, Paper FrC08.6	Add to My Program
A Coordinated Routing Approach for Enhancing Bus Timeliness and Travel Efficiency in Mixed-Traffic Environment

Liang, Tanlu	Cornell University
Bai, Ting	Cornell University
Malikopoulos, Andreas A.	Cornell University
Keywords: Transportation networks, Autonomous systems, Cooperative control Abstract: This paper proposes a coordinated routing approach that investigates the use of connected and automated vehicles (CAVs) in dedicated bus lanes. The aim is to improve bus schedule adherence while enhancing the travel efficiency of CAVs during the transitional phase of mixed traffic environments. Our approach utilizes real-time traffic data to dynamically reroute CAVs in anticipation of congestion. By continuously monitoring traffic conditions on dedicated lanes and tracking the real-time positions of buses, the system adjusts CAV routes in advance to avoid potential interference with operating buses. This cooperation reduces CAV travel times and minimizes delays that impact transit services. The proposed strategy is validated using microscopic traffic simulations in SUMO. The results demonstrate significant improvements in both transit on-time performance and CAV travel efficiency across a range of traffic conditions.


FrC09 Regular Session, Grand Salon 12	Add to My Program
Energy Systems

Chair: Mukherjee, Sayak	Pacific Northwest National Laboratory
Co-Chair: Narimani, Mohammad Rasoul	California State University Northridge

15:30-15:45, Paper FrC09.1	Add to My Program
Control Affine Hybrid Power Plant Subsystem Modeling for Supervisory Control Design

Ampleman, Stephen	Johns Hopkins University
Sharma, Himanshu	Pacific Northwest National Laboratory
Mukherjee, Sayak	Pacific Northwest National Laboratory
Glavaski, Sonja	Pacific Northwest National Lab
Keywords: Energy systems, Constrained control, Supervisory control Abstract: Hybrid power plants (HPPs) combine multiple generation sources and energy storage to support generation shortfalls and grid-service demands. This paper introduces a modeling and control design framework for wind-solar-battery HPPs. Specifically, this work adapts established modeling paradigms for wind farms, solar plants and battery models into a supervisory-control-oriented form. In the case of wind and battery models, generator torque and cell current control laws are developed using nonlinear control and control barrier function techniques to track commands issued by a supervisory controller while maintaining safe and stable operation. The framework is demonstrated on a utility demand-tracking task under time-varying wind and irradiance using a rule-based supervisory control law.

15:45-16:00, Paper FrC09.2	Add to My Program
Certifying the Nonexistence of Feasible Paths between Power System Operating Points

Narimani, Mohammad Rasoul	California State University Northridge
Davis, Katherine	Texas A&M
Molzahn, Daniel	Georgia Institute of Technology
Keywords: Energy systems, Optimization algorithms, Power systems Abstract: By providing the optimal operating point that satisfies both the power flow equations and engineering limits, the optimal power flow (OPF) problem is central to the operation of electric power systems. While extensive research efforts have focused on reliably computing high-quality OPF solutions, assessing the feasibility of transitioning between operating points remains challenging since the feasible spaces of OPF problems may consist of multiple disconnected components. It is not possible to transition between operating points in different disconnected components without violating OPF constraints. To identify such situations, this paper introduces an algorithm for certifying the infeasibility of transitioning between two operating points within an OPF feasible space. As an indication of potential disconnectedness, the algorithm first seeks an infeasible point on the line connecting a pair of feasible points. The algorithm then certifies disconnectedness by using convex relaxation and bound tightening techniques to show that all points on the plane that is normal to this line are infeasible. Using this algorithm, we provide the first certifications of disconnected feasible spaces for a variety of OPF test cases.

16:00-16:15, Paper FrC09.3	Add to My Program
Control Co-Design for Maximizing the Stochastic Endurance of a Hybrid Electric Tiltrotor Drone

Haddad, Noushin	University of Maryland, College Park
Fathy, Hosam K.	University of Maryland
Keywords: Energy systems, Power systems, Optimization Abstract: This paper examines the problem of maximizing the endurance of an unmanned aerial vehicle. The paper focuses on a tiltrotor vehicle configuration. This configuration has the advantage of combining vertical takeoff/landing with more efficient horizontal flight. However, this comes at the disadvantage of a substantial power demand discrepancy between takeoff/landing versus cruise. This discrepancy motivates powertrain hybridization. Specifically, the paper examines a parallel hybrid powertrain that combines the energy and power density advantages of internal combustion and battery-electric propulsion, respectively, using a mix of lithium-ion and lithium-sulfur batteries. We define stochastic endurance in terms of the ability to execute a Monte Carlo-simulated family of flight missions without power shortages. We then pose a control co design problem, where the objective is endurance maximization and the optimization variables are the sizing of the powertrain components and the parameterization of an onboard power management policy. This policy uses Gaussian kernels to compute the power split ratio between the powertrain subsystems based on total power demand and the “states of charge” of the onboard energy stores. A simulation study shows that this approach leads to attractive endurance levels and supports mission planning by quantifying the impact of initial fuel and payload allocations on endurance.

16:15-16:30, Paper FrC09.4	Add to My Program
Small HVAC Control Demonstrations in Larger Buildings Often Overestimate Savings

J. Khabbazi, Arash	Purdue University
J. Kircher, Kevin	Purdue University
Keywords: Building and facility automation, Energy systems, Linear systems Abstract: How much energy, money, and emissions can advanced control of heating and cooling equipment save in real buildings? To address this question, researchers sometimes control a small number of thermal zones within a larger multi-zone building, then report savings for the controlled zones only. That approach can overestimate savings by neglecting heat transfer between controlled zones and adjacent zones. This paper mathematically characterizes the overestimation error when the dynamics are linear and the objectives are linear in the thermal load, as usually holds when optimizing energy efficiency, energy costs, or emissions. Overestimation errors can be large even in seemingly innocuous situations. For example, when controlling only interior zones that have no direct thermal contact with the outdoors, all perceived savings are fictitious. This paper provides an alternative estimation method based on the controlled and adjacent zones' temperature measurements. The new method does not require estimating how much energy the building would have used under baseline operations, so it removes the additional measurement and verification challenge of accurate baseline estimation.

16:30-16:45, Paper FrC09.5	Add to My Program
Risk-Sensitive Model Predictive Control for Grid Services through Distributed Energy Resources: A Conditional Value-At-Risk Approach

Khalil, Ahmed	The University of Texas at Austin
Sharma, Himanshu	Pacific Northwest National Laboratory
Wang, Wei	Pacific Northwest National Laboratory
Ramachandran, Thiagarajan	Pacific Northwest National Laboratory
Keywords: Energy systems, Uncertain systems, Optimal control Abstract: Large aggregations of distributed energy resources (DERs) are increasingly used to provide grid services, such as ramping and regulation, yet their integration is challenged by volatile net demand and stringent real-time safety requirements. This work proposes a risk-sensitive closed-loop model predictive control (MPC) framework to optimally allocate heterogeneous DERs while mitigating the impact of rare but severe demand excursions, such as sudden load spikes of large electric loads. The MPC approach leverages the fact that short-term load forecasts are more accurate, enabling each step in the rolling optimization sequence to use refined predictions while ensuring constraint satisfaction. To achieve this, the closed-loop model-predictive controller is parametrized using the System-Level Synthesis (SLS) framework, and risk aversion is achieved by embedding a Conditional Value-at-Risk (CVaR) constraint that explicitly limits the expected impact of extreme forecast errors. Numerical simulations on demand-forecast datasets demonstrate the proposed approach's ability to reduce the power required from bulk generation in real time and to account for forecast demand uncertainty.

16:45-17:00, Paper FrC09.6	Add to My Program
Coordinated Control of Active and Passive Elements for Energy-Efficient Building Temperature Regulation

Ishtiaq, Fardin	Rensselaer Polytechnic Institute
Rempel, Alexandra	University of Oregon
Kar, Koushik	Rensselaer Polytechnic Institute
Mishra, Sandipan	Rensselaer Polytechnic Institute
Keywords: Building and facility automation, Energy systems, Mechatronics Abstract: Space conditioning accounts for nearly half of building energy use and associated CO_2 emissions. Conventional indoor environmental control strategies rely on heating, ventilation, and air conditioning systems to regulate indoor temperature reliably, but they often do not exploit passive strategies such as shading and natural ventilation, which limits potential energy savings. Purely passive mechanisms, on the other hand, cannot maintain comfort across all climates and seasons. This paper develops a coordinated strategy for the joint control of active heating and cooling systems and operable passive elements. The proposed strategy combines decentralized PI control of active heating and cooling, with reinforcement learning (RL) agents that actuate operable passive elements such as window openings. The control objective is to minimize energy consumption while keeping zone temperatures within a prescribed band. Because the active and passive inputs are coupled through thermal dynamics, analytical conditions for Lyapunov stability and ultimate boundedness are derived for this joint control strategy. The portability of the proposed approach was demonstrated by training the RL agent on a single-zone building and deploying it on a multi-zone building without retraining. EnergyPlus simulations of a multi-zone dwelling in three cities and two seasons show reductions of 40--74% in degree-hour discomfort and 54--86% in active energy loads relative to a baseline PI controller with rule-based logic for passive elements. The coordinated controller kept the indoor temperature within 0.5^circC below and 1.4^circC above the comfort band. These results highlight the potential of coordinated active and passive strategies to deliver substantial energy savings while maintaining occupant comfort.


FrC10 Regular Session, Grand Salon 15	Add to My Program
Process Control

Chair: Pourkargar, Davood	Kansas State University
Co-Chair: Mitrai, Ilias	The University of Texas at Austin

15:30-15:45, Paper FrC10.1	Add to My Program
A Distributed Machine Learning Approach for Cyberattack Detection in Integrated Process Systems

Bagheri, Amirsalar	Kansas State University
Ebrahimi, AmirMohammad	Kansas State University
Pourkargar, Davood	Kansas State University
Keywords: Chemical process control, Machine learning, Sensor networks Abstract: Integrated process systems enhance operational efficiency but increase the cyberattack surface due to dense interconnections and reliance on networked sensing. This study develops a community-aware distributed transformer framework that jointly performs attack detection and localization of compromised sensors. A spectral community detection algorithm is used to decompose the graph representation of the process into strongly interacting communities. Within each community, a transformer learns the dynamics of local measured outputs and identifies sensor-level attack likelihoods and a community-level detection flag. System-level decisions are then obtained by aggregating the outputs across communities. The framework is evaluated on an integrated benzene alkylation process, with training data generated from closed-loop simulations under model predictive control and moving horizon estimation across multiple adversarial scenarios, including single- and multi-sensor attacks. Compared with centralized end-to-end and two-tier transformer baselines, the distributed approach attains higher accuracy, F1 scores, and Jaccard indices, demonstrating improved robustness and more precise localization of compromised sensors.

15:45-16:00, Paper FrC10.2	Add to My Program
Discovering Interpretable Piecewise Nonlinear Model Predictive Control Laws Via Symbolic Decision Trees

Mitrai, Ilias	The University of Texas at Austin
Keywords: Chemical process control, Process Control, Machine learning Abstract: In this paper, we propose symbolic decision trees as surrogate models for approximating model predictive control laws. The proposed approach learns simultaneously the partition of the input domain (splitting logic) as well as local nonlinear expressions for predicting the control action leading to interpretable piecewise nonlinear control laws. The local nonlinear expressions are determined by the learning problem and are modeled using a set of basis functions. The learning task is posed as a mixed integer optimization problem, which is solved to global optimality with state-of-the-art global optimization solvers. We apply the proposed approach to a case study regarding the control of an isothermal reactor. The results show that the proposed approach can learn the control law accurately, leading to closed-loop performance comparable to that of a standard model predictive controller. Finally, comparison with existing interpretable models shows that the symbolic trees achieve both lower prediction error and superior closed-loop performance.

16:00-16:15, Paper FrC10.3	Add to My Program
Approximate Dynamic Optimization Via Deep Neural Operators

Nassaji, Amin	University of Minnesota
Mitrai, Ilias	The University of Texas at Austin
Daoutidis, Prodromos	Univ. of Minnesota
Keywords: Chemical process control, Process Control, Machine learning Abstract: This paper addresses the solution of nonlinear dynamic optimization problems that compute optimal manipu- lated input profiles to enforce desired output profiles. Such tra- jectory optimization problems commonly arise in chemical pro- cess applications, for example, batch processes where optimal temperature or feeding profiles (in case of fed-batch processes) are calculated to enforce time-varying product quality profiles, tightly controlling the reaction rate or rate of heat generation. We propose deep neural operators that approximate function to function mappings as surrogates for the solution of such dynamic optimization problems. We specifically employ deep operator networks (DeepONets) and Fourier-enhanced Deep- ONets in a batch polymerization reactor case study for which number-average and weight-average molecular weight profiles, together with a final conversion target, are enforced through an optimal temperature program. Our results show that the Fourier-enhanced DeepONet architecture performs very well in approximating the solution of the dynamic optimization problem for different instances, achieving a lower prediction error compared to the standard DeepONet architecture and standard feedforward neural networks.

16:15-16:30, Paper FrC10.4	Add to My Program
Cooling under Convexity: An Inventory Control Perspective on Industrial Refrigeration

Shah, Vade	University of California, Santa Barbara
John, Yohan	University of California, Santa Barbara
Freifeld, Ethan	University of California, Santa Barbara
Chen, Lily Yuxuan	University of California Santa Barbara
Marden, Jason R.	University of California, Santa Barbara
Keywords: Process Control, Optimization, Optimal control Abstract: Industrial refrigeration systems have tremendous energy needs, but optimizing their operation remains challenging due to the tension between minimizing energy costs and meeting strict cooling requirements. Load shifting--strategic overcooling in anticipation of future demands--offers substantial efficiency gains. This work seeks to rigorously quantify these potential savings through the derivation of optimal load shifting policies. Our first contribution establishes a novel connection between industrial refrigeration and inventory control problems with convex ordering costs, where the convexity arises from the fundamental relationship between energy consumption and cooling capacity. Leveraging this formulation, we derive three main theoretical results: (1) an optimal algorithm for deterministic demand scenarios, with proof that optimal trajectories are non-increasing which is a valuable structural insight for practical control; (2) performance bounds that quantify the value of load shifting as a function of cost convexity, demand variability, and temporal patterns; (3) a computationally tractable load shifting heuristic with provable near-optimal performance under uncertainty. Numerical simulations validate our theoretical findings, and a case study using real industrial refrigeration data demonstrates an opportunity for improved load shifting.

16:30-16:45, Paper FrC10.5	Add to My Program
Real-Time Classification of Industrial Alarm Floods Using Context-Enriched Embeddings

Mohan Rao, Harikrishna Rao	University of Alberta
Tamascelli, Nicola	ABB AG
Chen, Tongwen	University of Alberta
Keywords: Chemical process control, Process Control, Pattern recognition and classification Abstract: Alarm floods are critical operational challenges in process industries, overwhelming operators and increasing the risk of missed or delayed responses. This paper presents a context-enriched Word2Vec framework for real-time alarm flood classification, designed to capture temporal spacing, activation order, and alarm priority within an embedding space. By extending the generic Skip-gram model with enriched tokenization and priority-weighted training, the method addresses key challenges such as variable sequence lengths, overlapping alarms across categories, and imbalance between high- and low-priority events. The framework is evaluated on the Tennessee Eastman benchmark, where it achieved high classification accuracy, with early divergence of similarity trajectories enabling timely identification of fault categories. Compared to a generic Word2Vec baseline, the enriched embeddings consistently improved both early-stage and overall classification performance while maintaining computational efficiency suitable for online applications. These results highlight the potential of computationally efficient natural language processing (NLP)-inspired approaches to support operators with faster and more reliable decision-making during industrial alarm floods.

16:45-17:00, Paper FrC10.6	Add to My Program
Multi-Agent Reinforcement Learning Based Adaptive PI Controller for Multivariable Process Control Applications

Yadav, Sourabh	Indian Institute of Technology Kanpur
Detroja, Ketan P.	Indian Institute of Technology Hyderabad
Keywords: PID control, Reinforcement learning, Decentralized control Abstract: Designing controllers for industrial processes is a daunting task, particularly because most industrial processes are multi-input multi-output (MIMO) in nature. This introduces coupling effects and loop interactions, which are not encountered in the case of single-input single-output (SISO) processes. In this manuscript, a novel multi-agent reinforcement learning-based approach for designing adaptive PI controllers is proposed. The Smith predictor is incorporated to compensate for the deadtime in the diagonal elements of the process transfer function. The proposed method employs the Deep Deterministic Policy Gradient (DDPG) algorithm with Long-Short-Term Memory (LSTM) based actor-critic network architecture to design adaptive PI controllers for a MIMO process. The performance of the proposed controller is measured against some of the existing methods for designing classical PI controllers, using performance indices such as Integral Absolute Error (IAE) and Integral Square Error (ISE). The proposed controller demonstrates superior performance compared to classical state-of-the-art PI controllers. The proposed controller is performing well even in the presence of plant-model mismatches, delay uncertainties, and disturbances, demonstrating robust performance.


FrC11 Regular Session, Grand Salon 16	Add to My Program
Robotics III

Chair: Chou, Glen	Georgia Institute of Technology
Co-Chair: Chhabra, Robin	Toronto Metropolitan University

15:30-15:45, Paper FrC11.1	Add to My Program
Shared Object Manipulation with a Team of Collaborative Quadrupeds

Wang, Shengzhi	The Chinese University of Hong Kong
Dehio, Niels	Technische Universität Braunschweig
Zeng, Xuanqi	Chinses University of Hong Kong
Yang, Xian	The Chinese University of Hong Kong
Zhang, Lingwei	CUHK
Liu, Yun Hui	The Chinese Univ. of Hong Kong
Au, Kwok Wai Samuel	CUHK
Keywords: Robotics, Control applications, Cooperative control Abstract: Utilizing teams of multiple robots is advantageous for handling bulky objects. Many related works focus on multi-manipulator systems, which are limited by workspace constraints. In this paper, we extend a classical hybrid motion-force controller to a team of legged manipulator systems, enabling collaborative loco-manipulation of rigid objects with a force-closed grasp. Our novel approach allows the robots to flexibly coordinate their movements, achieving efficient and stable object co-manipulation and transport, validated through extensive simulations and real-world experiments.

15:45-16:00, Paper FrC11.2	Add to My Program
Learning Constraints from Stochastic Partially-Observed Closed-Loop Demonstrations

Chiu, Chih-Yuan	Georgia Institute of Technology
Zhang, Zhouyu	Georgia Institute of Technology
Chou, Glen	Georgia Institute of Technology
Keywords: Robotics, Optimal control, Autonomous systems Abstract: We present a method for learning unknown parametric constraints from locally-optimal input-output trajectory data. We assume the data is generated by rollouts of stochastic nonlinear dynamics, under a single state or output feedback law and initial condition but distinct noise realizations, to robustly satisfy underlying constraints despite worst-case noise outcomes. We encode the Karush-Kuhn-Tucker (KKT) conditions of this robust optimal feedback control problem within a feasibility problem to recover constraints consistent with the local optimality of the demonstrations. We prove that our constraint learning method (i) accurately recovers the demonstrator’s policy, and (ii) conservatively estimates the set of policies that ensure constraint satisfaction despite worst-case noise realizations. Moreover, we perform sensitivity analysis, proving that when demonstrations are corrupted by transmission error, the inaccuracy in the learned feedback law scales linearly in the error magnitude. Empirically, our method accurately recovers unknown constraints from simulated noisy, closed-loop demonstrations generated using dynamics, both linear and nonlinear, (e.g., unicycle and quadrotor) and a range of feedback mechanisms.

16:00-16:15, Paper FrC11.3	Add to My Program
Safe Obstacle-Free Guidance of Space Manipulators in Debris Removal Missions Via Deep Reinforcement Learning

Lam, Vincent	Toronto Metropolitan University
Chhabra, Robin	Toronto Metropolitan University
Keywords: Robotics, Reinforcement learning, Autonomous robots Abstract: The objective of this study is to develop a model-free taskspace trajectory planner for space manipulators that operates in real time, using a Twin Delayed Deep Deterministic Policy Gradient (TD3) agent to enable safe and reliable debris capture. A local control strategy with singularity avoidance and manipulability enhancement is employed to ensure stable execution. The manipulator must simultaneously track a capture point on a non-cooperative target, avoid self-collisions, and prevent unintended contact with the target. To address these challenges, we propose a curriculum-based multi-critic network where one critic emphasizes accurate tracking and the other enforces collision avoidance. A prioritized experience replay buffer is also used to accelerate convergence and improve policy robustness. The framework is evaluated on a simulated seven-degree-of-freedom KUKA LBR iiwa mounted on a free-floating base in Matlab/Simulink, demonstrating safe and adaptive trajectory generation for debris removal missions.

16:15-16:30, Paper FrC11.4	Add to My Program
Torques-To-Pixels: Verification of Visual Controllers

Estornell, Alexander	Northeastern University
Jung, Leonard	Northeastern University
Everett, Michael	Northeastern University
Keywords: Vision-based control, Robust control Abstract: Perception-based neural network controllers are increasingly used in autonomous systems that rely on visual inputs to operate in the real world. Ensuring the safety of such systems under uncertainty is challenging. Existing verification techniques typically focus on Lp-bounded perturbations in the pixel space, which fails to capture the low-dimensional structure of many real-world effects. In this work, we introduce a novel verification framework for perception-based controllers that can generate outer-approximations of reachable sets through explicitly modeling uncertain observations with geometric perturbations. Our approach constructs a boundable mapping from states to images, enabling the use of state-based verification tools while accounting for uncertainty in perception. We provide theoretical guarantees on the soundness of our method and demonstrate its effectiveness across benchmark control environments. This work provides a principled framework for certifying the safety of perception-driven control systems under realistic visual perturbations.

16:30-16:45, Paper FrC11.5	Add to My Program
Gaussian Mixture-Based Inverse Perception Contract for Uncertainty-Aware Robot Navigation

Du, Bingyao	Columbia University
Kim, Joonkyung	Texas A&M University
Lyu, Yiwei	Texas A&M University
Keywords: Robotics, Vision-based control Abstract: Reliable navigation in cluttered environments requires perception outputs that are not only accurate but also equipped with uncertainty sets suitable for safe control. An inverse perception contract (IPC) provides such a connection by mapping perceptual estimates to sets that contain the ground truth with high confidence. Existing IPC formulations, however, instantiate uncertainty as a single ellipsoidal set and rely on deterministic trust scores to guide robot motion. Such a representation cannot capture the multi-modal and irregular structure of fine-grained perception errors, often resulting in over-conservative sets and degraded navigation performance. In this work, we introduce Gaussian Mixture-based Inverse Perception Contract (GM-IPC), which extends IPC to represent uncertainty with unions of ellipsoidal confidence sets derived from Gaussian mixture models. This design moves beyond deterministic single-set abstractions, enabling fine-grained, multi-modal, and non-convex error structures to be captured with formal guarantees. A learning framework is presented that trains GM-IPC to account for probabilistic inclusion, distribution matching, and empty-space penalties, ensuring both validity and compactness of the predicted sets. We further show that the resulting uncertainty characterizations can be leveraged in downstream planning frameworks for real-time safe navigation, enabling less conservative and more adaptive robot motion while preserving safety in a probabilistic manner.

16:45-17:00, Paper FrC11.6	Add to My Program
A Simulation Evaluation Suite for Robust Adaptive Quadcopter Control

Zhang, Dingqi	University of California, Berkeley
Tao, Ran	University of Illinois at Urbana-Champaign
Cheng, Sheng	University of Illinois Urbana-Champaign
Hovakimyan, Naira	University of Illinois at Urbana-Champaign
Mueller, Mark W.	University of California, Berkeley
Keywords: Robotics, Robust adaptive control, Simulation Abstract: Robust adaptive control methods are essential for maintaining quadcopter performance under external disturbances and model uncertainties. However, fragmented evaluations across tasks, simulators, and implementations hinder systematic comparison of these methods. This paper introduces an easy-to-deploy, modular simulation testbed for quadcopter control, built on textit{RotorPy}, that enables evaluation under a wide range of disturbances such as wind, payload shifts, rotor faults, and control latency. The framework includes a library of representative adaptive and non-adaptive controllers and provides task-relevant metrics to assess tracking accuracy and robustness. The unified modular environment enables reproducible evaluation across control methods and eliminates redundant reimplementation of components such as disturbance models, trajectory generators, and analysis tools. We illustrate the testbed’s versatility through examples spanning multiple disturbance scenarios and trajectory types, including automated stress testing, to demonstrate its utility for systematic analysis. Code is available at url{https://github.com/Dz298/AdaptiveQuadBench}.


FrC12 Regular Session, Grand Salon 18	Add to My Program
Formal Verification and Synthesis

Chair: Mallada, Enrique	Johns Hopkins University
Co-Chair: Aksaray, Derya	Northeastern University

15:30-15:45, Paper FrC12.1	Add to My Program
Learning Linear Temporal Specifications from Demonstrations with Uncertainty

Fahim, Parastou	Penn State University
Lagoa, Constantino M.	Pennsylvania State Univ
Meira-Goes, Romulo	Pennsylvania State University
Keywords: Formal verification/synthesis, Automata, Learning Abstract: Learning temporal logic specifications from system demonstrations is essential for tasks such as formal verification and controller synthesis, especially in safety-critical domains. Existing approaches typically assume demonstrations are correct or only affected by misclassification errors. In practice, however, system traces are often uncertain or incomplete due to sensor faults, measurement errors, or data loss. We present a framework for learning minimal Linear Temporal Logic (LTL) formulas from demonstrations with uncertainty. Our approach models uncertainty via Hamming distance to generate possible estimates around each observed trace, which are grouped with constraints requiring that at least one trace per group is consistent with the learned formula. Our problem is then reduced to an equivalent Pseudo-Boolean Optimization. We evaluate our method against state-of-the-art LTL learning approaches and show that it recovers specifications that more closely align with ground-truth formulas under uncertainty.

15:45-16:00, Paper FrC12.2	Add to My Program
Motion Planning under Temporal Logic Specifications in Semantically Unknown Environments

Taheri, Azizollah	Northeastern University
Aksaray, Derya	Northeastern University
Keywords: Formal verification/synthesis, Autonomous robots, Optimization Abstract: This paper addresses a motion planning problem to achieve spatio-temporal-logical tasks, expressed by syntactically co-safe linear temporal logic specifications (scLTLnext), in uncertain environments. Here, the uncertainty is modeled as some probabilistic knowledge on the semantic labels of the environment. For example, the task is "first go to region 1, then go to region 2"; however, the exact locations of regions 1 and 2 are not known a priori, instead a probabilistic belief is available. We propose a novel automata-theoretic approach, where a special product automaton is constructed to capture the uncertainty related to semantic labels, and a reward function is designed for each edge of this product automaton. The proposed algorithm utilizes value iteration for online replanning. We show some theoretical results and present some simulations/experiments to demonstrate the efficacy of the proposed approach.

16:00-16:15, Paper FrC12.3	Add to My Program
Temporal-Logic-Aware Frontier-Based Exploration

Taheri, Azizollah	Northeastern University
Aksaray, Derya	Northeastern University
Keywords: Formal verification/synthesis, Autonomous robots Abstract: This paper addresses the problem of temporal logic motion planning for an autonomous robot operating in an unknown environment. The objective is to enable the robot to satisfy a syntactically co-safe Linear Temporal Logic (scLTL) specification when the exact locations of the desired labels are not known a priori. We introduce a new type of automaton state, referred to as commit states. These states capture intermediate task progress resulting from actions whose consequences are irreversible. In other words, certain future paths to satisfaction become not feasible after taking those actions that lead to the commit states. By leveraging commit states, we propose a sound and complete frontier-based exploration algorithm that strategically guides the robot to make progress toward the task while preserving all possible ways of satisfying it. The efficacy of the proposed method is validated through simulations.

16:15-16:30, Paper FrC12.4	Add to My Program
Safety-Critical Control Via Recurrent Tracking Functions

Liu, Jixian	Johns Hopkins University
Mallada, Enrique	Johns Hopkins University
Keywords: Formal verification/synthesis, Closed-loop identification, Robotics Abstract: This paper addresses the challenge of synthesizing safety-critical controllers for high-order nonlinear systems, where constructing valid Control Barrier Functions (CBFs) remains computationally intractable. Leveraging layered control, we design CBFs in reduced-order models (RoMs) while regulating full-order models' (FoMs) dynamics at the same time. Traditional Lyapunov tracking functions are required to decrease monotonically, and systematic synthesis methods for such functions exist only for fully-actuated systems. To overcome this limitation, we introduce Recurrent Tracking Functions (RTFs), which replace the monotonic decay requirement with a weaker finite-time recurrence condition. This relaxation permits transient deviations of tracking errors while ensuring safety. By integrating CBFs for RoMs with RTFs, we construct recurrent CBFs (RCBFs) whose zero-superlevel set is control tau-recurrent, and guarantee safety for all initial states in such a set when RTFs are satisfied. We establish theoretical safety guarantees and validate the approach through numerical experiments, demonstrating RTFs' effectiveness and the safety of FoMs.

16:30-16:45, Paper FrC12.5	Add to My Program
Safe and Optimal Learning from Preferences Via Weighted Temporal Logic with Applications in Robotics and Formula 1

Karagulle, Ruya	Univ. of Michigan
Vasile, Cristian Ioan	Lehigh University
Ozay, Necmiye	Univ. of Michigan
Keywords: Formal verification/synthesis, Human-in-the-loop control, Machine learning Abstract: Autonomous systems increasingly rely on human feedback to align their behavior, expressed as pairwise comparisons, rankings, or demonstrations. While existing methods can adapt behaviors, they often fail to guarantee safety in safety-critical domains. We propose a safety-guaranteed, optimal, and efficient approach for solving the learning problem from preferences, rankings, or demonstrations using Weighted Signal Temporal Logic (WSTL). WSTL learning problems, when implemented naively, lead to multi-linear constraints in the weights to be learned. By introducing structural pruning and log-transform procedures, we reduce the problem size and recast it as a Mixed-Integer Linear Program while preserving safety guarantees. Experiments on robotic navigation and real-world Formula 1 data demonstrate that the method effectively captures nuanced preferences and models complex task objectives.

16:45-17:00, Paper FrC12.6	Add to My Program
Shielded Reinforcement Learning under Dynamic Temporal Logic Constraints

Yuksel, Sadik Bera	Northeastern University
Buyukkocak, Ali Tevfik	University of Minnesota
Aksaray, Derya	Northeastern University
Keywords: Reinforcement learning, Formal verification/synthesis, Autonomous robots Abstract: Reinforcement Learning (RL) has shown promise in various robotics applications, yet its deployment on real systems is still limited due to safety and operational constraints. The safe RL field has gained considerable attention in recent years, which focuses on imposing safety constraints throughout the learning process. However, real systems often require more complex constraints than just safety, such as periodic recharging or time-bounded visits to specific regions. Imposing such spatio-temporal tasks during learning still remains a challenge. Signal Temporal Logic (STL) is a formal language for specifying temporal properties of real-valued signals and provides a way to express such complex tasks. In this paper, we propose a framework that leverages sequential control barrier functions and model-free RL to ensure that the given STL tasks are satisfied throughout the learning process. Our method extends beyond traditional safety constraints by enforcing rich STL specifications, which can involve visits to dynamic targets with unknown trajectories. We also demonstrate the effectiveness of our framework through various simulations.


FrC13 Regular Session, Grand Salon 19	Add to My Program
Agent Based Systems II

Chair: Lin, Zongli	University of Virginia
Co-Chair: Selmic, Rastko	Concordia University

15:30-15:45, Paper FrC13.1	Add to My Program
Resilient Leader-Following Consensus Control Using Set-Membership Fuzzy Filtering

Rahimifard, Mahshid	Concordia University
Selmic, Rastko	Louisiana Tech University
Keywords: Agents-based systems, Cooperative control, Fault tolerant systems Abstract: This paper presents a resilient leader–following consensus control method for a class of nonlinear multi-agent systems under cyberattacks. A Takagi–Sugeno (T–S) fuzzy model is employed to approximate the system nonlinearities. A resilient leader-following consensus control law is developed to guarantee that the system states remain within a bounded ellipsoidal set of the leader state despite the simultaneous presence of Unknown But Bounded (UBB) noise, Denial of Service (DoS) attacks on control signals, and deception attacks on sensor measurements. Furthermore, a two-step resilient set-membership estimation method is proposed, together with a fuzzy set-membership filtering approach to provide reliable state estimates in the presence of UBB noise and cyberattacks. Numerical simulations demonstrate the effectiveness of the proposed method.

15:45-16:00, Paper FrC13.2	Add to My Program
Distributed Prescribed-Time Formation Control with Multiple Multi-Agent Systems for Smart Transportation Applications

Merzi, Mehmet Alp	University of Calabria
D'Alfonso, Luigi	Università Della Calabria
Fedele, Giuseppe	Università Della Calabria
Keywords: Agents-based systems, Distributed control, Cooperative control Abstract: In this paper, we present a novel approach for distributed prescribed-time formation control that addresses cooperative object transportation applications, where agents must converge to fixed positions around an object within a prescribed time, before actual transportation begins. The proposed method combines a diffeomorphism-based transformation with a swarm-inspired multi-agent framework to achieve four complementary objectives: (1) driving agents from their initial positions to fixed target locations, (2) guaranteeing convergence within a prescribed time, (3) ensuring collision avoidance throughout the process, and (4) achieving all these objectives with a simple formulation. At the core of the approach lies a time-varying affine transformation that maps agents between virtual and real reference frames, where a distributed control law based on attraction-repulsion dynamics is utilized. The virtual control law employs a time-varying gain that diverges as the time approaches the required prescribed time, enabling finite-time convergence through time normalization techniques. The method is first developed for scalar multi-agent systems and then extended to multidimensional cases by using additional diffeomorphism-based transformations.

16:00-16:15, Paper FrC13.3	Add to My Program
Temporal Graph-Theoretic Stability Analysis of Time-Varying Opinion Dynamics

Abedinzadeh, Mohammadhossein	BINGHAMTON UNIVERSITY Department of Electrical and Computer Engineering
Akyol, Emrah	SUNY Binghamton
Keywords: Agents-based systems, Large-scale systems, Network analysis and control Abstract: The time-varying Friedkin--Johnsen (TVFJ) model provides a principled foundation for studying realistic, time-varying social networks, as it captures both persistent individual biases and adaptive influences. Building on this model, we develop a graph-theoretic framework for stability analysis in opinion dynamics. We introduce emph{defected} and emph{weakly defected} temporal graphs (DTG/WDTG) as topological certificates that translate temporal connectivity and stubborn influence into contraction bounds on the state-transition matrix. Using these notions, we prove (i) emph{asymptotic stability} of TVFJ when DTGs recur infinitely often, and (ii) emph{exponential stability} under semi-periodic defected networks with explicit growth and decay rates. For emph{periodically switching} networks (PSFJ), we further provide a (p)-LTI decomposition that yields closed-form equilibria and establishes the tight bound (\|omega\|le p). Collectively, these results unify algebraic stability tests with an interpretable, topology-driven analysis, offering scalable tools for reasoning about opinion formation in evolving networks.

16:15-16:30, Paper FrC13.4	Add to My Program
Leader-Follower Consensus of Linear Multi-Agent Systems with Intermittent Communication and Disturbances Via Observer-Based Control

Zarei, Fatemeh	Northeastern University
Saeidi, Amirahmad	Amirkabir University of Technology
Shafai, Bahram	Northeastern Univ
Keywords: Distributed control, Robust control, Observers for Linear systems Abstract: In this paper, we investigate the leader–follower consensus problem for linear multi-agent systems (MASs) subject to intermittent communication and bounded exogenous disturbances. To address this challenge, we propose a distributed output-feedback control protocol that incorporates a Proportional-Integral Fading Observer (PIFO). The observer is designed to simultaneously estimate and compensate for unknown disturbances acting on each follower agent. Building on local relative information and disturbance estimates, we develop a distributed consensus control strategy by using Lyapunov stability theory and establish sufficient conditions that guarantee the asymptotic convergence of both consensus and estimation errors. Finally, simulation results are presented to confirm the theoretical findings and to demonstrate the effectiveness of the proposed method in rejecting disturbance and achieving consensus.

16:30-16:45, Paper FrC13.5	Add to My Program
Privacy-Preserving Dynamic Average Consensus by Masking Reference Signals

Maithripala, Mihitha	University of Virginia
Lin, Zongli	University of Virginia
Keywords: Distributed control, Agents-based systems Abstract: In multi-agent systems, dynamic average consensus (DAC) is a decentralized estimation strategy in which a set of agents tracks the average of time-varying reference signals. Because DAC requires exchanging state information with neighbors, attackers may gain access to these states and infer private information. In this paper, we develop a privacy-preserving method that protects each agent’s reference signal from external eavesdroppers and honest-but-curious agents while achieving the same convergence accuracy and convergence rate as conventional DAC. Our approach masks the reference signals by having each agent draw a random real number for each neighbor, exchanges that number over an encrypted channel at the initialization, and computes a masking value to form a masked reference. Then the agents run the conventional DAC algorithm using the masked references. Convergence and privacy analyses show that the proposed algorithm matches the convergence properties of conventional DAC while preserving the privacy of the reference signals. Numerical simulations validate the effectiveness of the proposed privacy-preserving DAC algorithm.

16:45-17:00, Paper FrC13.6	Add to My Program
Communication Efficient Consensus Via Lower-Dimensional Data

Rai, Ayush	Purdue University
Mou, Shaoshuai	Purdue University
Keywords: Agents-based systems, Autonomous systems, Linear systems Abstract: This paper studies consensus in multi-agent systems under limited information exchange, where each agent has access only to a low-dimensional projection of its neighbors' states. We propose a novel projection-based consensus algorithm in which agents update their states using these partial observations, governed by observation matrices that capture local interaction structures. To analyze the resulting dynamics, we introduce the concept of an extended adjacency matrix, which integrates both the network connectivity and the projection information. We provide a theoretical analysis showing that, for strongly connected directed networks and appropriately designed projections, the agents converge exponentially to a consensus within the intersection subspace of the communicated information. Numerical simulations illustrate and validate the theoretical findings, highlighting the effectiveness of the proposed method in achieving agreement under partial-state information.


FrC14 Regular Session, Grand Salon 21	Add to My Program
Machine Learning IV

Chair: Thein, May-Win	University of New Hampshire
Co-Chair: La, Hung	University of Nevada

15:30-15:45, Paper FrC14.1	Add to My Program
Uniting Reinforcement Learning and Model Predictive Control in Feedback Control of Nonlinear Processes

Cui, Xiaodong	University of California, Los Angeles
Khodaverdian, Arthur	University of California, Los Angeles
Christofides, Panagiotis D.	Univ. of California at Los Angeles
Keywords: Reinforcement learning, Lyapunov methods, Stability of nonlinear systems Abstract: This work explores the implementation of a reinforcement learning (RL) based approach to replace model predictive control (MPC) in cases where practical implementations of MPC are infeasible due to excessive computation times. Specifically, with the use of externally enforced stability guarantees, an RL-based controller that is trained to optimize the same cost function as MPC can serve as a potentially more appealing real-time option as opposed to using the same MPC with a shorter horizon. A benchmark nonlinear chemical process model is used to demonstrate the feasibility of this RL-based framework that simultaneously guarantees stability and enables improvements in computational efficiency and potential control quality of the closed-loop system.

15:45-16:00, Paper FrC14.2	Add to My Program
Learning Robust Regions of Attraction Using Rollout-Enhanced Physics-Informed Neural Networks with Policy Iteration

Wang, Junkai	Georgia Institute of Technology
Zhao, Yuxuan	Hong Kong University of Science and Technology
Zhou, Mi	Georgia Institute of Technology
Zhang, Fumin	Hong Kong University of Science and Technology
Keywords: Machine learning, Robust control, Lyapunov methods Abstract: The region of attraction is a key metric of the robustness of systems. This paper addresses the numerical solution of the generalized Zubov’s equation, which produces a special Lyapunov function characterizing the robust region of attraction for perturbed systems. To handle the highly nonlinear characteristic of the generalized Zubov's equation, we propose a physics-informed neural network framework that employs a policy iteration training scheme with rollout to approximate the viscosity solution. In addition to computing the optimal disturbance during the policy improvement process, we incorporate simulation-derived value estimates as anchor points to facilitate the training procedure to prevent singularities in both low- and high-dimensional systems. Numerical simulations validate the effectiveness of the proposed approach.

16:00-16:15, Paper FrC14.3	Add to My Program
LEAP-O: Learning to Predict Dynamic Obstacles for Safe Trajectory Planning

Nguyen, Binh	University of Central Florida
Nghiem, Truong X.	University of Central Florida
Nguyen, Linh	Federation University Australia
La, Hung	University of Nevada
Nguyen, Thang	Texas A&M University-Corpus Christi
Keywords: Machine learning, Autonomous robots, Optimization Abstract: Trajectory planning plays a crucial role in autonomous driving and navigation by enabling robots to generate safe paths while minimizing travel costs and avoiding collisions. This paper addresses the issue of predicting dynamic obstacles for safe trajectory planning when prior information is unavailable and detection range is limited. We propose a learning framework using Gaussian Processes (GP) for motion prediction and uncertainty estimation, further enhanced by Recurrent Neural Networks (RNN) for more accurate predictions. In addition, we develop a receding horizon planning method, formulated as a stochastic optimization problem, to ensure safe, collision-free paths with confidence probabilities. Together, these contributions provide a robust framework for adaptive and safe trajectory generation in dynamic environments. Simulations were performed to demonstrate the effectiveness of the proposed strategy, where our approach (combining GP and RNN) outperformed a baseline method that utilized only GP.

16:15-16:30, Paper FrC14.4	Add to My Program
Plug-And-Play Design of Machine Learning-Based Model Predictive Controllers for Nonlinear Systems

Ji, Yuxiao	National University of Singapore
Shi, Yao	National University of Singapore
Wu, Zhe	National University of Singapore
Keywords: Machine learning, Chemical process control, Predictive control for nonlinear systems Abstract: This work presents a plug-and-play (PnP) machine learning-based model predictive control (ML-MPC) framework for interconnected nonlinear systems. Assuming that each subsystem is input-to-state stable (ISS), the stability of the entire system is ensured via the small-gain theorem. Sufficient conditions are derived to guarantee stability when ML models replace existing controllers. The effectiveness of the framework is demonstrated through a nonlinear chemical process network.

16:30-16:45, Paper FrC14.5	Add to My Program
PIML-RHC As a Control Method: A Physics-Informed Transformer Policy for Real-Time Eco-Driving

Dos Santos, Arnon Bruno Ventrilho	Universidade Federal Do Parana
Zanata Alves, Marco Antonio	Universidade Federal Do Parana
Soares de Oliveira, Luiz Eduardo	Universidade Federal Do Parana
Keywords: Machine learning, Predictive control for nonlinear systems, Automotive control Abstract: We present PIML-RHC, a physics-informed receding-horizon controller in which a Transformer policy is trained end-to-end through a differentiable vehicle model to jointly decide powertrain actions (torque/brake/gear) while enforcing an rpm envelope over a preview horizon. The policy distills a long-horizon, reference-conditioned objective into a single, non-iterative forward pass. Nonlinear Model Predictive Control (NMPC) with mixed discrete-continuous decisions (gears, torque, brake) is a principled baseline but often relies on iterative solvers, which exhibit iteration-dependent compute and potential variability in worst-case execution time (WCET); by contrast, PIML-RHC has bounded per-step compute due to its fixed-depth network. Across 50 tracks (60 steps) from production-like distributions, PIML-RHC achieves competitive or superior efficiency and tracking to NMPC variants while consistently improving drivability (torque smoothness). A hardware-in-the-loop validation on an embedded Raspberry Pi 4B shows median end-to-end loop time 72.7 ms, compatible with supervisory eco-driving cycles. We also discuss WCET, the absence of optimization-time guarantees, and safety mechanisms (e.g., CBF supervisors) required for deployment.

16:45-17:00, Paper FrC14.6	Add to My Program
Reinforcement Learning-Based Path Planning and Obstacle Avoidance Using PPO and B-Splines in Unknown Environments

Shokouhi, Shahab	University of New Hampshire
Oruc, Oguzhan	Istanbul Technical University
Thein, May-Win	University of New Hampshire
Keywords: Reinforcement learning, Learning, Neural networks Abstract: This paper introduces Smart B-Splines (SmartBSP), an advanced reinforcement learning framework for real-time path planning and obstacle avoidance in autonomous robotics navigating through complex environments. The proposed system integrates Proximal Policy Optimization (PPO) and Actor-Critic architecture to process limited LiDAR inputs and compute spatial decision-making probabilities. The robotic vehicle's perceptual field is discretized into a grid format and then analyzed to produce a spatial probability distribution. During the training process a nuanced cost function is minimized that accounts for path curvature, endpoint proximity, and obstacle avoidance. Simulations results in different scenarios validate the algorithm's resilience and adaptability across diverse operational scenarios. Subsequently, Real-time experiments are carried out to assess the efficacy of the proposed algorithm. The experimental results show that the proposed algorithm is successful in different scenarios while maintaining path smoothness.


FrC15 Regular Session, Grand Salon 22	Add to My Program
Stability of Nonlinear Systems II

Chair: Lopez, Brett	University of California - Los Angeles
Co-Chair: Komaee, Arash	Southern Illinois University

15:30-15:45, Paper FrC15.1	Add to My Program
Global Asymptotic Stability Certificates for Discrete-Time Lur'e Systems under Incremental-Like Restrictions

Montana, Gioia	Sapienza Università Di Roma
Cristofaro, Andrea	Sapienza University of Rome
Mattioni, Mattia	Università Degli Studi Di Roma La Sapienza
Valmorbida, Giorgio	L2S, CentraleSupelec
Keywords: Stability of nonlinear systems, Lyapunov methods Abstract: In this paper, we provide Lyapunov-based conditions for assessing the global asymptotic stability of nonlinear discrete-time Lur'e systems under incremental-like restrictions. The proposed results invoke dissipation arguments for the linear and nonlinear components of the dynamics and rely on a Lyapunov function with a few parameters and integral terms. The proof of the main result does not require any approximations of the integral terms. The impact of the result is highlighted through a comparison with the literature.

15:45-16:00, Paper FrC15.2	Add to My Program
Accumulating Magnetic Nanorods in a Stable Magnetic Trap by Exploiting Their Rotational Dynamics

Cantrell, Denae	University of California, Berkeley
Komaee, Arash	Southern Illinois University
Keywords: Stability of nonlinear systems, MEMs and Nano systems, Biomedical Abstract: Magnetic drug delivery aims to accumulate a swarm of magnetized therapeutic nanoparticles at sites of disease, but it is impossible by the very nature of magnetic fields to create a stable equilibrium point inside them in order to trap these particles. When spherical nanoparticles are exposed to a magnetic field, they immediately align with that field, causing them to move towards the source of the field, rather than a targeted site of disease. Magnetic nanorods, on the other hand, exhibit a brief rotational period during which they could be repelled by the magnetic field towards the targeted site. Previous experimental work has successfully exploited the transient rotational dynamics of magnetic nanorods to concentrate them in the vicinity of a targeted point. To develop this purely experimental work into an operational drug delivery system, a mathematical model is necessary to quantitatively describe the experiment. This paper presents such a model, verified by computer simulations and experiment.

16:00-16:15, Paper FrC15.3	Add to My Program
Robust Data-Driven Invariant Sets for Nonlinear Systems

Kiani, Sahand	Pennsylvania State University
Lagoa, Constantino M.	Pennsylvania State Univ
Keywords: Stability of nonlinear systems, Optimization, Learning Abstract: The synthesis of robust invariant sets for nonlinear systems has traditionally been hindered by the inherent non-convexity and a strict reliance on exact analytical models. This paper presents a purely data-driven framework to compute robust polytopic contractive sets for unknown nonlinear systems operating under persistent bounded process noise and state-input constraints. Rather than attempting to identify a single, potentially nominal model, we utilize a finite data set to construct a polytopic consistency set—a rigorous geometric boundary encapsulating all possible system dynamics compatible with the noisy measurements. The core contribution of this work extends an established sufficient condition for λ-contractiveness into the data-driven setting. Crucially, we prove that enforcing this condition strictly over the vertices of the consistency set guarantees robust invariance.

16:15-16:30, Paper FrC15.4	Add to My Program
Analysis of the Geometric Heat Flow Equation: Computing Geodesics in Real-Time with Convergence Guarantees

Gessow, Samuel	University of California, Los Angeles
Lopez, Brett	University of California - Los Angeles
Keywords: Stability of nonlinear systems, Optimization, Numerical algorithms Abstract: We present an analysis on the convergence properties of the so-called geometric heat flow equation for computing geodesics (extremal curves) on Riemannian manifolds. Computing geodesics numerically in real time has become an important capability across several fields, including control and motion planning. The geometric heat flow equation involves solving a parabolic partial differential equation whose solution is a geodesic. In practice, solving this PDE numerically can be done efficiently, and tends to be more numerically stable and exhibit a better rate of convergence compared to numerical optimization. We prove that the geometric heat flow equation is exponentially stable in L_2 if the curvature of the Riemannian manifold does not exceed a positive bound and that asymptotic convergence in L_2 is always guaranteed. We also present a pseudospectral method that leverages Chebyshev polynomials to accurately compute geodesics in only a few milliseconds for non-contrived manifolds. Our analysis was verified with our custom pseudospectral method by computing geodesics on common non-Euclidean surfaces, and in feedback for a contraction-based controller with a non-flat metric for a nonlinear system.

16:30-16:45, Paper FrC15.5	Add to My Program
A Numerical Investigation of the Domain of Attraction of the Kapitza Pendulum Using Harmonic and Nonharmonic Base Excitation

Islam, Syed Aseem Ul	University of Michigan
Kouba, Omran	Higher Institute for Applied Sciences and Technology
Portella Delgado, Jhon Manuel	University of Michigan
Bernstein, Dennis S.	Univ. of Michigan
Keywords: Stability of nonlinear systems, Simulation Abstract: Harmonic, vertical base excitation can stabilize the inverted pendulum; this application of vibrational control is commonly known as the Kapitza pendulum. In addition to being visually surprising, this phenomenon is remarkable due to the fact that stabilization is achieved by open-loop control and thus without the need for sensing or real-time computing. The present paper contributes to the study of the Kapitza pendulum by exploring the limitations of vibrational control. In particular, for harmonic base excitation, we estimate the domain of attraction of the Kapitza pendulum over the range of stabilizing parameters. We then determine the ability of nonharmonic base excitation to extend the domain of attraction. Both investigations depend on numerical simulation and optimization.


FrC16 Regular Session, Grand Salon 24	Add to My Program
Iterative Learning Control

Chair: Rogers, Eric	University of Southampton
Co-Chair: Hashimoto, Kazumune	Osaka University

15:30-15:45, Paper FrC16.1	Add to My Program
MM-LMPC: Multi-Modal Learning Model Predictive Control Via Bandit-Based Mode Selection

Hashimoto, Wataru	Osaka University
Hashimoto, Kazumune	Osaka University
Keywords: Iterative learning control, Predictive control for nonlinear systems, Optimal control Abstract: Learning Model Predictive Control (LMPC) improves performance on iterative tasks by leveraging data from previous executions. However, in tasks with multiple possible solution routes, LMPC heavily depends on the initial trajectories: states with high cost-to-go are rarely selected as terminal candidates in later iterations, leaving parts of the state space unexplored and potentially missing better solutions. This challenge arises because LMPC constructs a sampled safe set from past trajectories and uses it as a terminal constraint, with a terminal cost given by the corresponding cost-to-go. To overcome this limitation, we propose Multi-Modal LMPC (MM-LMPC), which clusters past trajectories into modes and maintains mode-specific terminal sets and value functions. A bandit-based meta-controller with a Lower Confidence Bound (LCB) policy balances exploration and exploitation across modes, enabling systematic refinement across multiple modes. MM-LMPC preserves the standard LMPC properties of recursive feasibility and closed-loop stability, while additionally guaranteeing asymptotic convergence to the best mode and a logarithmic regret bound. Simulations on obstacle-avoidance tasks show the effectiveness of the proposed method.

15:45-16:00, Paper FrC16.2	Add to My Program
Accelerated Iterative Learning Control of Stochastic Discrete Linear Systems with Switching of the Dynamics and Reference Trajectory

Pakshin, Pavel	Arzamas Polytechnic Institute of R.E. Alekseev Nizhny Novgorod STU
Emelianova, Julia	Arzamas Polytechnic Institute of R.E. Alekseev NizhnyNovgorod State Technical University
Rogers, Eric	University of Southampton
Keywords: Iterative learning control, Stochastic systems, Stability of linear systems

16:00-16:15, Paper FrC16.3	Add to My Program
Event-Triggered Learning Robust MPC for Unknown Time-Varying ARX Systems Using Input-Output Data

Deng, Li	University of Alberta
Shu, Zhan	University of Alberta
Chen, Tongwen	University of Alberta
Keywords: Iterative learning control, Time-varying systems, Robust control Abstract: An event-triggered learning robust model predictive control (MPC) framework is proposed for unknown time-varying autoregressive exogenous (ARX) systems where all time-varying matrices are approximated by a polytope representation. An event-triggered learning scheme that combines model estimation with polytope learning is developed, which reduces the frequency of learning updates while ensuring convergence. Using the learned polytope, a robust MPC controller based on an augmented state-space model is designed to guarantee a mixed input-output constraint and minimize the upper bound of an infinite-horizon cost function. To capture the connection between consecutive learned polytopes, a matching error is introduced and thus input-to-state stability is analyzed. A numerical example is provided to demonstrate the effectiveness of the proposed approach.

16:15-16:30, Paper FrC16.4	Add to My Program
A Simple Structure Model-Free ILC Scheme for Nonlinear Batch Processes Designed within a Repetitive Process Setting

Paszke, Wojciech	University of Zielona Gora
Hao, Shoulin	Dalian University of Technology
Liu, Tao	Dalian University of Technology
Tao, Hongfeng	Jiangnan University
Rogers, Eric	University of Southampton
Keywords: Iterative learning control Abstract: This paper develops new results on data-driven iterative learning control for nonlinear batch processes. The dynamic linearization approach is used to obtain linearized local dynamical models utilizing only the collected process input and output data. As a result, no dynamic structure of the nonlinear model is required for the control design. Additionally, the design problem is formulated within the repetitive process framework, which simplifies the design procedure, facilitates the integrated synthesis of feedback and learning controllers, and aids in the adjustment of control parameters. The convergence of the new data-driven control method is demonstrated by the stability of the resulting repetitive process, which ensures that the tracking error decreases along both the time and iteration (batch) axes. Stability properties can be effectively checked using linear matrix inequality techniques. A numerical example is included to highlight the application of the new results.

16:30-16:45, Paper FrC16.5	Add to My Program
Koopman-Based Sliding Mode Control for Data-Driven Stabilization of Nonlinear Systems

Labbadi, Moussa	Bretagne INP
Zhong, Zhengang	University of Warwick
Keywords: Iterative learning control Abstract: This paper presents a data-driven method for the design of stabilizing controllers for nonlinear systems with exponential stability guarantees. The approach leverages Koopman operator theory to obtain a bilinear representation of the system dynamics directly from measurement data, while explicitly accounting for approximation errors arising from the use of finitely many data samples. On top of this lifted representation, a sliding mode control (SMC) strategy is developed to robustly stabilize the system and compensate for residual modeling error. The proposed framework focuses on bilinear single-input systems, for which rigorous stability guarantees are provided. Furthermore, the methodology naturally extends to multi-input systems, opening the way for broader applications in data-driven robust control.

16:45-17:00, Paper FrC16.6	Add to My Program
A Repetitive Learning Model Predictive Control Method for Nonlinear Systems with Application to Roll-To-Roll Manufacturing

Martin, Christopher	University of Texas at Austin
Li, Shihao	The University of Texas at Austin
Li, Jiachen	University of Texas at Austin
Li, Wei	University of Texas at Austin
Chen, Dongmei	The University of Texas at Austin
Keywords: Manufacturing systems, Iterative learning control, Predictive control for nonlinear systems Abstract: Roll-to-roll (R2R) mechanical dry transfer is a manufacturing process that enables high-throughput and environmentally friendly fabrication of advanced thin-film devices. Achieving high-quality transfer requires precise tension control, which is challenging due to nonlinear peeling dynamics and input constraints. Additionally, since devices are often transferred repetitively in continuous production lines, this repetition can be leveraged to improve control. To this end, this study presents a repetitive learning model predictive control (RLMPC) strategy. To account for repetitive disturbances and model mismatch, the method uses measurements from past periods and an internal linear time-varying (LTV) model to solve a receding horizon optimal control problem at each timestep. The LTV sensitivities are updated online using a nonlinear process model, enabling full utilization of the nonlinear model in a computationally efficient manner. A case study applying the RLMPC method to R2R mechanical dry transfer is presented, representing the first application of RLMPC to an R2R system. The approach significantly improved the web tension tracking performance across successive cycles, achieving a 64% reduction in root mean squared error compared to a baseline linear MPC. These results indicate that the proposed method has the potential to deliver superior performance in industrial applications.


FrC17 Regular Session, Churchill A1	Add to My Program
Stochastic Optimal Control II

Chair: Bakolas, Efstathios	The University of Texas at Austin
Co-Chair: Wan, Yan	University of Texas at Arlington

15:30-15:45, Paper FrC17.1	Add to My Program
Multi-Model Covariance Steering for Continuous-Time Stochastic Linear Systems

Bakolas, Efstathios	The University of Texas at Austin
Keywords: Stochastic optimal control, Stochastic systems, Uncertain systems Abstract: In this paper, we consider the finite-horizon covariance steering problem for continuous-time stochastic linear systems subject to white noise in the special case in which the (uncertain) system dynamics is unknown but belongs necessarily to a finite collection of known models. We refer to the latter problem as the multi-model covariance steering (MMCS) problem. Because a common controller that would steer the state covariance of each possible system realization to the same positive definite matrix may likely not exist in general, we consider a different problem formulation that is based on the a posteriori interpretation of the standard covariance steering problem as a Linear Quadratic Gaussian (LQG) control problem with a special terminal cost. This interpretation allows us to formulate the multi-model covariance steering problem as a minimax (robust) stochastic optimal control problem that can be addressed by means of the Robust Maximum Principle. In particular, we show that the feedback control law that solves the MMCS problem can be fully characterized by (1) the solution to an LQG-type problem for a system with an augmented state space and (2) a tractable, yet non-convex, optimization problem. Simulations results that illustrate some key ideas of this paper are also included.

15:45-16:00, Paper FrC17.2	Add to My Program
Fine-Tuning Diffusion Models Via Stochastic Control: Entropy Regularization and Beyond

Tang, Wenpin	Columbia University
Zhou, Fuzhong	Columbia University
Keywords: Stochastic optimal control, Reinforcement learning, Machine learning Abstract: This paper aims to develop and provide a rigorous treatment to entropy regularized fine-tuning in the context of continuous-time diffusion models, which was proposed by Uehara et al. (arXiv:2402.15194). The idea is to use stochastic control for sample generation, where the entropy regularizer is introduced to mitigate reward collapse. We also show how the analysis can be extended to fine-tuning with a general f -divergence regularizer. Numerical experiments on large-scale text-to-image models – Stable Diffusion v1.5 are conducted to validate our approach.

16:00-16:15, Paper FrC17.3	Add to My Program
Belief-Based Reinforcement Learning for Asymmetric Partially Observable Zero-Sum Games

Hosseini, Seyed Hamid	Northeastern University
Kazeminajafabadi, Armita	Northeastern University
Kamara, Amidu	U.S. Department of Homeland Security
Imani, Mahdi	Northeastern University
Keywords: Stochastic systems, Markov processes, Reinforcement learning Abstract: Adversarial decision-making under asymmetric partial observability arises across domains such as cybersecurity, autonomous maneuvering, and resource allocation. Existing approaches often rely on recurrent encoders or centralized critics, but these lack equilibrium guarantees and degrade under observation noise or information asymmetry. We develop a belief-space game-theoretic framework in which each player maintains a local belief vector, a posterior distribution over hidden states conditioned on their own observations. We show that these beliefs serve as sufficient statistics for optimal play, enabling partially observable interactions to be recast as a two-player zero-sum Markov game over belief pairs. We characterize Nash equilibria in this belief game and propose a belief-based actor–critic method that approximates these strategies at scale using decentralized belief-conditioned actors and a centralized critic active only during training. Experiments on network-based adversarial scenarios demonstrate significant gains over state-of-the-art baselines, with advantages increasing under higher noise and stronger asymmetry.

16:15-16:30, Paper FrC17.4	Add to My Program
Probabilistic Modeling of CGM-Derived Daily Minimum Glucose in Type 2 Diabetes: Hierarchical vs. Non-Hierarchical Bayesian Approaches

Ganji, Mohammadreza	University of Virginia
El Fathi, Anas	University of Virginia
Fabris, Chiara	University of Virginia
Keywords: Uncertain systems, Stochastic optimal control, Identification for control Abstract: Type 2 diabetes is a chronic metabolic disorder characterized by insulin resistance and progressive beta-cell dysfunction. After lifestyle modifications and non-insulin antidiabetic drugs, long-acting (basal) insulin is the therapy of choice. However, due to disease heterogeneity, treatment requires initial titration to escalate doses to the needed amount and frequent re-calibration due to the time-varying metabolic system. Historically, insulin dose titration and adaptation have relied on fasting plasma glucose measured by self-monitoring of blood glucose, but with continuous glucose monitoring (CGM) becoming standard of care, data streams with larger information content are available. In this study, CGM-derived daily minimum glucose (DMG) is examined as an informative metric for basal insulin titration and adaptation, motivated by its direct relationship with clinically defined hypoglycemia events. The goal is to introduce a probabilistic model of DMG conditioned on insulin input, and to compare two Bayesian modeling paradigms for DMG prediction: hierarchical and non-hierarchical. These paradigms reflect different assumptions about patient independence versus shared population structure, and their impact on parameter estimates and uncertainty quantification was systematically evaluated. On test data, the hierarchical model achieved a posterior predictive RMSE of 29.43 ± 0.84 and the non-hierarchical model 30.50 ± 1.37 (mean ± SD). The proposed framework yields uncertainty-aware models of DMG, enabling probabilistic inference and providing a foundation for stochastic control in basal insulin titration and adaptation.

16:30-16:45, Paper FrC17.5	Add to My Program
MPCM–Taguchi for Decision-Making in Uncertain Systems with Application to DC Microgrids

Zhou, Siyu	University of Texas at Arlington
Wan, Yan	University of Texas at Arlington
Koru, Ahmet Taha	University of Texas at Arlington
Keywords: Uncertain systems, Stochastic optimal control, Large-scale systems Abstract: Decision-making for systems with high-dimensional uncertain parameters is critical to maintaining system stability and reliable operation. This paper develops an efficient decision-making method for high-dimensional uncertain systems by leveraging the Multivariate Probabilistic Collocation Method (MPCM)-Taguchi method. Based on the MPCM-Taguchi framework, an integral reinforcement learning (IRL) control algorithm is developed. The proposed method is applied to power buffer systems in DC microgrids with uncertain loads. Comparative studies demonstrate the effectiveness and practical value of the approach for real-time decision-making in microgrid systems.

16:45-17:00, Paper FrC17.6	Add to My Program
Discrete-Time Linear Quadratic Stochastic Control with Equality-Constrained Inputs: Application to Energy Demand Response

Seugnet, Léo	Polytechnique Montréal
Gao, Shuang	Polytechnique Montreal
Keywords: Stochastic optimal control, Constrained control, Smart grid Abstract: We investigate the discrete-time stochastic linear quadratic control problem for a population of cooperative agents under the hard equality constraint on total control inputs, motivated by demand response in renewable energy systems. We establish the optimal solution that respects hard equality constraints for systems with additive noise in the dynamics. The optimal control law is derived using dynamic programming and Karush-Kuhn-Tucker (KKT) conditions, and the resulting control solution depends on a discrete-time Riccati-like recursive equation. Application examples of coordinating the charging of a network of residential batteries to absorb excess solar power generation are demonstrated, and the proposed control is shown to achieve exact power tracking while considering individual State-of-Charge (SoC) objectives.


FrC18 Regular Session, Churchill A2	Add to My Program
Fault Tolerant Systems II

Chair: Chung, Soon-Jo	California Institute of Technology
Co-Chair: Bodson, Marc	Univ. of Utah

15:30-15:45, Paper FrC18.1	Add to My Program
Control Structure Screening for Robust Isolation of Controller-Actuator False Data Injection Attacks

Gajjar, Aatam	University of California, Davis
El-Farra, Nael H.	University of California, Davis
Ellis, Matthew	University of California, Davis
Keywords: Chemical process control, Process Control, Fault diagnosis Abstract: In this work, we present a screening methodology that incorporates robust cyberattack isolation as a key criterion in the selection of control structures for dynamic processes. We focus on controller–actuator link attacks and on isolation schemes based on a bank of unknown-input observers with dedicated residuals. For this class of schemes, we characterize how the control system structure determines the ability to isolate attacks not only from one another, but also from persistent process disturbances. We introduce the robust attack isolation metric (RAIM), which measures the extent to which attack isolation can be guaranteed despite disturbances. Building on RAIM, we develop a systematic screening algorithm to guide the selection of control structures with the greatest potential for robust attack isolation. We demonstrate the effectiveness of the methodology by applying it to a chemical process to illustrate the attack isolation capabilities of a robustly cyberattack-aware control structure.

15:45-16:00, Paper FrC18.2	Add to My Program
Failure-Tolerant Data-Driven Control for Discrete-Time Linear Systems

Lu, Shuaizheng	Augusta University
Liao, Jiaping	Augusta University
Mo, Zihao	Augusta University
Xiang, Weiming	Augusta University
Keywords: Fault tolerant systems, Linear systems, Robust control Abstract: This paper presents a novel direct data-driven failure-tolerant control methodology for discrete-time linear time-invariant (LTI) systems that are subject to controller failure. Our approach leverages system data directly, obviating the need for an explicit system model. By applying time-varying piecewise quadratic Lyapunov function (TPQLF), the proposed methodology formulates the controller design as a semi-definite programming (SDP) problem, enabling exponential stabilization directly from collected system data. The effectiveness of the proposed approach is demonstrated through a numerical example involving a networked system experiencing packet drop-offs, showing its value for enhancing reliability and safety in variable conditions.

16:00-16:15, Paper FrC18.3	Add to My Program
Closing the Loop Inside Neural Networks: Causality-Guided Layer Adaptation for Fault Recovery Control

Taheri, Mahdi	California Institute of Technology (Caltech)
Chung, Soon-Jo	California Institute of Technology
Hadaegh, Fred Y.	California Inst. of Tech
Keywords: Fault tolerant systems, Neural networks, Adaptive control Abstract: This paper studies the problem of real-time fault recovery control for nonlinear control-affine systems subject to actuator loss of effectiveness faults and external disturbances. We develop a two-stage framework that combines causal inference with selective online adaptation to achieve an effective learning-based recovery control method. In the offline phase, we introduce a causal layer attribution technique based on the average causal effect (ACE) to evaluate the relative importance of each layer in a pretrained deep neural network (DNN) controller compensating for faults. This provides a principled approach to select the most causally influential layer for fault recovery control in the sense of ACE, and goes beyond the widely used last-layer adaptation approach. In the online phase, we deploy a Lyapunov-based gradient update to adapt only the ACE-selected layer to circumvent the need for full-network or last-layer only updates. The proposed adaptive controller guarantees uniform ultimate boundedness (UUB) with exponential convergence of the closed-loop system in the presence of actuator faults and external disturbances. Compared to conventional adaptive DNN controllers with full-network adaptation, our methodology has a reduced computational overhead in the online phase. To demonstrate the effectiveness of our proposed methodology, a case study is provided on a 3-axis attitude control system of a spacecraft with four reaction wheels.

16:15-16:30, Paper FrC18.4	Add to My Program
Fault Tolerant Torque/Speed Control of a Doubly-Fed Induction Machine

Hossain, Md Abid	University of Utah
Bodson, Marc	Univ. of Utah
Keywords: Electrical machine control, Fault tolerant systems, PID control Abstract: This paper proposes a control algorithm to perform torque control for the doubly-fed induction machine in the presence of inverter faults. With an open or short-circuited inverter switch, performing torque control using the remaining healthy phases of motors like permanent magnet (PM) and cage-rotor induction motors (IM) is impossible without additional switches or undesirable torque ripples. The objective of the paper is to demonstrate interesting fault-tolerance options available for doubly fed induction machines (DFIM) when operated using half-sized converters on the stator and rotor windings. In contrast with existing methods that reconfigure the DFIM as an induction machine by short-circuiting the stator or rotor windings or by including more complicated circuitry to detach the faulted part, this paper describes the operation without additional circuit components. With the proposed control method, the DFIM draws controlled current from the faulted inverter, leading to higher torque generation capability. The same strategy can be applied to single-phase open-circuit faults with minor modifications. The maximum torque per ampere feature is considered and integrated into the algorithm. Tests are performed on an experimental testbed to validate the proposed control algorithm.

16:30-16:45, Paper FrC18.5	Add to My Program
Fault-Tolerant Adaptive Switching Control of Deformable Mirrors under Actuator Failures and Saturation Limits

Xu, Binyan	Univeristy of Guelph
Aljanaideh, Khaled	American University of Sharjah
Goy, Matthias	Fraunhofer Institute for Applied Optics and Precision Engineering
Al Janaideh, Mohammad	University of Guelph
Keywords: Mechatronics, Mechanical systems/robotics Abstract: This paper presents an adaptive switching fault-tolerant control (FTC) framework for deformable mirrors to ensure reliable surface shape tracking under actuator faults and constraints. A finite-dimensional sensor–actuator model is established that operates directly in actuator–sensor coordinates while preserving both the resonant characteristics and static gain of the original mirror dynamics. The model is then decomposed into localized one-sensor, multiple-actuator subsystems, enabling decentralized control and reducing computational complexity. Each subsystem integrates an adaptive control law with an online switching mechanism that autonomously reconfigures actuator excitation subsets. Rather than relying on explicit fault detection, the switching is guided by a performance index that deactivates faulty or saturated actuators and reallocates effort to healthy ones. An adaptive disturbance compensation law further addresses modeling uncertainties and neglected actuator couplings. A Lyapunov-based analysis guarantees closed-loop stability for all subsystems and the overall DM system. Simulation results demonstrate that the proposed method achieves accurate wavefront shaping, efficient actuator utilization, and robust fault tolerance.


FrC19 Regular Session, Churchill B1	Add to My Program
Optimal Control V

Chair: Yel, Esen	Rensselaer Polytechnic Institute
Co-Chair: Bin Mohaya, Turki	University of Michigan

15:30-15:45, Paper FrC19.1	Add to My Program
Partial Attention in Deep Reinforcement Learning for Safe Multi-Agent Control

Bin Mohaya, Turki	University of Michigan, Ann Arbor
Seiler, Peter	University of Michigan, Ann Arbor
Keywords: Traffic control, Multivehicle systems, Robotics Abstract: Attention mechanisms excel at learning sequential patterns by discriminating data based on relevance and importance. This provides state-of-the-art performance in today’s advanced generative artificial intelligence models. This paper applies this concept of an attention mechanism for multi-agent safe control. We specifically consider the design of a neural network to control autonomous vehicles in a highway merging scenario. The environment is modeled as a Decentralized Partially Observable Markov Decision Process (Dec-POMDP). Within a QMIX framework, we include partial attention for each autonomous vehicle, thus allowing each ego vehicle to focus on the most relevant neighboring vehicles. Moreover, we propose a comprehensive reward signal that considers the environment’s global objectives (e.g., safety and vehicle flow) and the individual interests of each agent. Simulations are conducted in the Simulation of Urban Mobility (SUMO). The results show better performance compared to other driving algorithms in terms of safety, driving speed, and reward.

15:45-16:00, Paper FrC19.2	Add to My Program
Event-Based Control Via Sparsity-Promoting Regularization: A Rollout Approach with Performance Guarantees

Nishida, Shumpei	Ritsumeikan University
Okano, Kunihisa	Ritsumeikan University
Keywords: Optimal control, Optimization, Networked control systems Abstract: This paper presents a controller design framework aiming to balance control performance and actuation rate. Control performance is evaluated by an infinite-horizon average cost, and the number of control actions is penalized via sparsity-promoting regularization. Since the formulated optimal control problem has a combinatorial nature, we employ a rollout algorithm to obtain a tractable suboptimal solution. In the proposed scheme, actuation timings are determined through a multistage minimization procedure based on a receding-horizon approach, and the corresponding control inputs are computed online. We establish theoretical performance guarantees with respect to periodic control and prove the stability of the closed-loop system. The effectiveness of the proposed method is demonstrated through a numerical example.

16:00-16:15, Paper FrC19.3	Add to My Program
Planning Stealthy Backdoor Attacks in MDPs with Observation-Based Triggers

Wei, Xinyi	University of Florida
Han, Shuo	University of Illinois Chicago
Hemida, Ahmed (Ahmed H. Anwar)	ARL
Kamhoua, Charles	U.S. Army Research Laboratory
Fu, Jie	University of Florida
Keywords: Markov processes Abstract: This paper investigates backdoor attack planning in stochastic control systems modeled as Markov Decision Processes (MDPs). A backdoor attack involves an adversary deploying a policy that performs well in the original MDP to pass testing, but behaves maliciously at runtime when combined with a trigger that perturbs system dynamics. We consider a sophisticated attacker capable of jointly optimizing the backdoor policy and its trigger using only a blackbox simulator. During execution, the attacker has access only to partial observations of the system state and is restricted to introduce small perturbations to the system’s transition dynamics. We formulate the attack planning problem as a constrained Markov game with an augmented state space and two players: Player 0 learns a backdoor policy that maximizes attack rewards when the trigger is active. However, when the trigger is inactive, the backdoor policy behaves near-optimally in the original MDP; Player 1 designs a finite-memory, observation-based trigger to activate the attack. We propose a switching gradient-based optimization algorithm to jointly solve for the backdoor policy and trigger. Experiments on a case study demonstrate the effectiveness of our method in achieving stealthy and successful backdoor attacks. how the attack performance varies under different parameters related to the stealthiness of the backdoor attack.

16:15-16:30, Paper FrC19.4	Add to My Program
Safe Output Feedback Approximate Dynamic Programming for Nonlinear Control Affine Systems

Mahmud, S M Nahid	Purdue University
Rai, Ayush	Purdue University
Mou, Shaoshuai	Purdue University
Keywords: Nonlinear output feedback, Optimal control, Observers for nonlinear systems Abstract: In this work, we study the state-constrained optimal control problem for nonlinear control-affine systems under output feedback. While Approximate Dynamic Programming (ADP) enables online feedback via approximate Bellman solutions, most safe-ADP methods assume full-state availability, which is often unrealistic in practice. Simply replacing unmeasured states with their estimates can violate safety. We propose estimation-aware Lyapunov–Control Barrier Functions (EA-LCBFs) that modify barrier conditions using explicit bounds on estimation error, yielding an invasive safety filter around a nominal ADP policy. The resulting online scheme preserves ADP performance, certifies forward invariance of the actual safe set, and operates only using output measurements. We demonstrate reliable constraint satisfaction without compromising the performance of ADP-based optimization on a representative nonlinear system.

16:30-16:45, Paper FrC19.5	Add to My Program
Optimizing Task Completion Time Updates Using POMDPs

Eddy, Duncan	Stanford University
Yel, Esen	Rensselaer Polytechnic Institute
Passmore, Emma Elizabeth	Stanford University
Egan, Niles	Stanford University
Armour, Grayson	Stanford University
Asmar, Dylan	Stanford University
Kochenderfer, Mykel	Stanford University
Keywords: Control applications, Information technology systems, Uncertain systems Abstract: Managing announced task completion times is a fundamental control problem in project management. While extensive research exists on estimating task durations and task scheduling, the problem of when and how to update completion times communicated to stakeholders remains understudied. Organizations must balance announcement accuracy against the costs of frequent timeline updates, which can erode stakeholder trust and trigger costly replanning. Despite the prevalence of this problem, current approaches rely on static predictions or ad-hoc policies that fail to account for the sequential nature of announcement management. In this paper, we formulate the task announcement problem as a Partially Observable Markov Decision Process (POMDP) where the control policy must decide when to update announced completion times based on noisy observations of true task completion. Since most state variables (current time and previous announcements) are fully observable, we leverage the Mixed Observability MDP (MOMDP) framework to enable more efficient policy optimization. Our reward structure captures the dual costs of announcement errors and update frequency, enabling synthesis of optimal announcement control policies. Using off-the-shelf solvers, we generate policies that act as feedback controllers, adaptively managing announcements based on belief state evolution. Simulation results demonstrate significant improvements in both accuracy and announcement stability compared to baseline strategies, achieving up to 75% reduction in unnecessary updates while maintaining or improving prediction accuracy.

16:45-17:00, Paper FrC19.6	Add to My Program
Complex-Valued Optimal Control for Three-Phase Voltage Source Converter Model Using Policy Iteration

Nagappan, Manoj	SRM Institute of Science and Technology
Murugesan, Sathishkumar	SRM Institute of Science and Technology
Tran, Quang Huy	National Cheng Kung University
Liu, Yen-Chen	National Cheng Kung University
Keywords: Linear systems, Optimal control, Iterative learning control Abstract: We develop a unified model-based policy iteration (PI) framework for determining the optimal control policy of complex-valued (CV) linear continuous-time (CT) systems. Using the properties of CR-calculus and the value function for the CV system, we first derive the CV algebraic Riccati equation (CVARE) and the corresponding control policy. Afterwards, we develop a unified PI technique based on CV policy evaluation and policy update steps that iteratively generate a control policy, which can also be used for the real-valued case. We demonstrate convergence of the proposed algorithm toward the optimal control solution, and we illustrate its effectiveness through the computation of the optimal control policy for a three-phase voltage source converter incorporating an LC output filter in its CV representation.


FrC20 Regular Session, Churchill B2	Add to My Program
Model Predictive Control IV

Chair: Walker, Markus	Karlsruhe Institute of Technology (KIT)
Co-Chair: Nuculaj, Luke	Oakland University

15:30-15:45, Paper FrC20.1	Add to My Program
A Model Predictive Control Approach to Optimal Point Tracking of Unknown Trajectories for Human Binocular Eye Rotation

Athukorallage, Bhagya	Williams College
Ghosh, Bijoy	Texas Tech University
Finlay, Mikaela	MIT
Platonov, Liza	Middlebury College
Bagga, Akkshansh	Williams College
Tobin, Benjamin	Williams College
Keywords: Predictive control for nonlinear systems, Optimal control, Optimization Abstract: This paper presents an optimal control framework for modeling binocular eye movements while tracking unknown trajectories. Building upon a Riemannian geometric formulation of eye dynamics, we extend prior monocular and binocular models to derive equations of motion subject to Listing’s Law and binocular coupling constraints. We formulate a model predictive control (MPC) approach with two variants: a deterministic controller based on linear extrapolation and a stochastic controller leveraging Gaussian Process regression for uncertainty-aware trajectory prediction. Numerical simulations across sinusoidal, noisy sinusoidal, and random walk trajectories demonstrate that stochastic MPC significantly improves tracking robustness in uncertain or noisy conditions, achieving up to 94% reduction in tracking error compared to deterministic MPC.

15:45-16:00, Paper FrC20.2	Add to My Program
Sample-Efficient and Smooth Cross-Entropy Method Model Predictive Control Using Deterministic Samples

Walker, Markus	Karlsruhe Institute of Technology (KIT)
Frisch, Daniel	Karlsruhe Institute of Technology (KIT)
Hanebeck, Uwe D.	Karlsruhe Institute of Technology (KIT)
Keywords: Predictive control for nonlinear systems, Optimal control, Optimization algorithms Abstract: Cross-entropy method model predictive control (CEM—MPC) is a powerful gradient-free technique for non-linear optimal control, but its performance is often limited by the reliance on random sampling. This conventional approach can lead to inefficient exploration of the solution space and non-smooth control inputs, requiring a large number of samples to achieve satisfactory results. To address these limitations, we propose deterministic sampling CEM (dsCEM), a novel framework that replaces the random sampling step with deterministic samples derived from localized cumulative distributions (LCDs). Our approach introduces modular schemes to generate and adapt these sample sets, incorporating temporal correlations to ensure smooth control trajectories. This method can be used as a drop-in replacement for the sampling step in existing CEM-based controllers. Experimental evaluations on two nonlinear control tasks demonstrate that dsCEM consistently outperforms state-of-the-art iCEM in terms of cumulative cost and control input smoothness, particularly in the critical low-sample regime.

16:00-16:15, Paper FrC20.3	Add to My Program
Ensemble Kalman Inversion for Constrained Nonlinear MPC: An ADMM-Splitting Approach

Khalil, Ahmed	The University of Texas at Austin
Safwat, Mohamed	University of Washington, Seattle
Bakolas, Efstathios	The University of Texas at Austin
Keywords: Predictive control for nonlinear systems, Optimization algorithms, Optimal control Abstract: This work proposes a novel Alternating Direction Method of Multipliers (ADMM)-based Ensemble Kalman Inversion (EKI) algorithm for solving constrained nonlinear model predictive control (NMPC) problems. First, stage-wise nonlinear inequality constraints in the NMPC problem are embedded via an augmented Lagrangian with nonnegative slack variables. We then show that the resulting unconstrained augmented-Lagrangian primal subproblem admits a Bayesian interpretation: under independent Gaussian virtual observations, its minimizers coincide with MAP estimators, enabling solution via EKI. However, since the nonnegativity constraint on the slacks is a hard constraint not naturally encoded by a Gaussian model, our proposed algorithm yields a two-block ADMM scheme that alternates between (i) an inexact primal step that minimizes the augmented-Lagrangian objective (implemented via EKI rollouts), (ii) a nonnegativity projection for the slacks, and (iii) a dual ascent step. To balance exploration and convergence, an annealing schedule tempers sampling covariances while a penalty schedule increases constraint enforcement over outer iterations, encouraging global search early and precise constraint satisfaction later. We evaluate the proposed controller on a 6-DOF UR5e manipulation benchmark in MuJoCo, comparing it against DIAL-MPC (an iterative MPPI variant) as the arm traverses a cluttered tabletop environment.

16:15-16:30, Paper FrC20.4	Add to My Program
Personalized MPC for Autonomous Vehicle Lateral Motion Control Via Inverse Reinforcement Learning (I)

Zhou, Zhaodong	Oakland University
Tao, Mingyuan	Isuzu Technical Center of America, Inc
Qiu, Jiayi	Isuzu Technical Center of America, Inc
Zhang, Peng	Isuzu Technical Center of America, Inc
Xu, Meng	Isuzu Technical Center of America, Inc
Sun, Yong	Isuzu Technical Center of America, Inc
Chen, Jun	Oakland University
Keywords: Predictive control for nonlinear systems, Reinforcement learning, Automotive control Abstract: This paper presents a personalized lane change control framework that combines the maximum entropy inverse reinforcement learning (MaxEnt IRL) with model predictive control (MPC). Instead of manually tuning the MPC cost weight, the proposed method learns a cost function from expert driving demonstrations using interpretable trajectory features. The learned weights are incorporated into the MPC formulation to generate personalized lane change trajectories that mimic the expert driving style. Simulation results in CARLA show that the IRL-trained MPC controller can reproduce distinct driving styles and matches expert behaviors in heading, lateral motion, and steering. Furthermore, real world on-road testing on an ISUZU truck demonstrated that the proposed controller generalizes well to unseen initial conditions. Overall, the proposed approach reduces manual calibration efforts, eliminates the need for explicit path planning, improves controller interpretability, and enables personalized driver behavior replication.

16:30-16:45, Paper FrC20.5	Add to My Program
Model Predictive Control with High-Probability Safety Guarantee for Nonlinear Stochastic Systems

Liu, Zishun	Georgia Institute of Technology
Ma, Liqian	Georgia Institute of Technology
Chen, Yongxin	Georgia Institute of Technology
Keywords: Predictive control for nonlinear systems, Stochastic systems, Stochastic optimal control Abstract: We present a model predictive control (MPC) framework for nonlinear stochastic systems that ensures safety guarantee with high probability. Unlike most existing stochastic MPC schemes, our method adopts a set-erosion that converts the probabilistic safety constraint into a tractable deterministic safety constraint on a smaller safe set over deterministic dynamics. As a result, our method is compatible with any off-the-shelf deterministic MPC algorithm. The key to the effectiveness of our method is a tight bound on the stochastic fluctuation of a stochastic trajectory around its nominal version. Our method is scalable and can guarantee safety with high probability level (e.g., 99.99%), making it particularly suitable for safety-critical applications involving complex nonlinear dynamics. Rigorous analysis is conducted to establish a theoretical safety guarantee, and numerical experiments are provided to validate the effectiveness of the proposed MPC method.

16:45-17:00, Paper FrC20.6	Add to My Program
The Elastic Model Predictive Safety Filter

Auricchio, Alberto	University of Bristol
Zhang, Kaiqiang	University of Bristol
Richards, Arthur	University of Bristol
Keywords: Autonomous systems, Predictive control for nonlinear systems, Optimization Abstract: Autonomous systems often operate in multiple operational modes, each with its own set of safety constraints. Safely transitioning between modes, for example, from a high-speed survey mode to a close-inspection mode, presents a significant challenge. This paper introduces the `Elastic' Model Predictive Safety Filter (EPSF), a novel control framework that enables provably safe, dynamic transitions between different operational modes. The EPSF extends the Predictive Safety Filter by incorporating a soft-constrained multimodal Model Predictive Control formulation. In transition between modes, the controller's cost function penalises deviations from the constraints of a desired target mode whilst strictly enforcing the constraints of the currently operational mode. This objective steers the system state towards a region where a transition is feasible. The convergence of slack variables to zero serves as an online, verifiable certificate that this transition region has been reached, guaranteeing recursive feasibility and asymptotic convergence. A convex implementation is provided for linear systems, and simplifications of the sufficient conditions for convergence are discussed. The effectiveness of the EPSF is demonstrated through numerical simulation of an unstable linear system and a nonlinear kinematic car model.


FrC21 Regular Session, Churchill C1	Add to My Program
Sliding Mode Control

Chair: Tomsovic, Kevin	Clemson University
Co-Chair: Drakunov, Sergey V.	Embry-Riddle Aeronautical University

15:30-15:45, Paper FrC21.1	Add to My Program
Feedforward and Sliding Mode Control of a Novel Precision 2-DoF Reluctance Motion System

Pumphrey, Michael Joseph	University of Guelph
Al Saaideh, Mohammad	Memorial University of Newfoundland
Xu, Binyan	Univeristy of Guelph
Alatawneh, Natheer	University of Guelph
Al Janaideh, Mohammad	University of Guelph
Keywords: Mechatronics Abstract: This paper presents the design and implementation of a hybrid sliding mode control (SMC) strategy for a novel two-degree-of-freedom (2-DoF) motion system. The system, driven by a reluctance actuator (RA) for translational motion and two moving magnet actuators (MMAs) for rotational motion, is intended for high-precision applications such as fast-scan mirrors in extreme ultraviolet (EUV) lithography. The nonlinearities of the RA, along with system uncertainties, challenge the controller. To address the dynamics of each axis, a feedforward controller is implemented for the rotational axis, while a robust SMC is developed for the nonlinear translational axis. The SMC is specifically designed to compensate for the RA's strong force-current-position nonlinearity and parametric uncertainties. A smooth hyperbolic tangent function is used to mitigate chattering. The proposed controller is experimentally validated, demonstrating significant improvements in both transient and dynamic tracking performance and validating its suitability for advanced manufacturing and optical applications.

15:45-16:00, Paper FrC21.2	Add to My Program
Stability and Stabilization of Switched Systems with Random Intermittent Information Transmission

Taousser, Fatima	University of Tennessee
Yichao, Wang	University of Tennessee
Djouadi, Seddik, M.	University of Tennessee
Tomsovic, Kevin	Clemson University
Keywords: Stability of linear systems, Stochastic systems, Switched systems Abstract: This paper investigates the almost sure exponential stability of state-feedback control systems operating over stochastic, non-uniform time domains. The system is subject to intermittent signal transmission interruptions and employs a hold-on strategy, whereby the controller retains and applies the most recent control input during communication failures until transmission is restored. The resulting dynamics are modeled as a switched system evolving over stochastic, disjoint time intervals, interspersed with random discrete steps corresponding to held control inputs. Using a stochastic time-scale framework, we derive sufficient conditions that characterize a region of almost sure exponential stability. The proposed results are demonstrated through a leader-follower consensus problem in multi-agent systems with intermittent communication between the leader and followers.

16:00-16:15, Paper FrC21.3	Add to My Program
Follow-The-Ridge Approach Based Sliding-Mode Control for Local Minimax Optimisation

Zenati, Abdelhafid	City Univesity of London
Aouf, Nabil	City University of London
Keywords: Numerical algorithms, Machine learning, Optimization algorithms Abstract: Numerous machine learning tasks today are essentially about identifying equilibrium points of problems modeled as games played in steps. This is particularly true for two-player zero-sum games, which form the basis of engineering problems formulated as minimax optimisation. Traditional algorithms, such as gradient descent (GDA), often encounter difficulties in accurately locating local minimax points, frequently resulting in convergence to non-local-minimax solutions or trapped in limit cycles. This research introduces a groundbreaking algorithm integrating discrit sliding mode control (DSMC) theory-based strategy for minimax optimisation in machine learning that is shifting away from the conventional dependence on intricate second derivatives. The innovative algorithm introduces a refined nested loop structure that not only simplifies the optimisation process but also guarantees provable, uniform convergence to local minimax points, effectively avoiding the common trap of limit cycles.

16:15-16:30, Paper FrC21.4	Add to My Program
Sliding Mode Control with Disjoint Switching Manifolds and Distributed Digital Twin for a Steam Tracing System

D'Amico, Jay	Louisiana Steam Equipment Co
Chintalapati, Siddarth	Louisiana Steam Equipment Company
Fox, Kyle	Louisiana Steam Equipment Company
Kinzie, Ryan	Steam Solutions
O'Brien, Scarlett	Steam Solutions
Shafiyee, Alif Muhammad	Steam Solutions
Drakunov, Sergey V.	Embry-Riddle Aeronautical University
Keywords: Variable-structure/sliding-mode control, Process Control, Distributed control Abstract: This paper presents a novel sliding mode control with disjoint switching manifolds applied to a large-scale industrial steam-tracing system. The proposed control algorithm incorporates a digital twin based on a set of coupled partial differential equations that model heat transfer across multiple layers of piping infrastructure. The digital twin enables realtime data fusion from heterogeneous sensor sources, including thermocouples, fiber-optic temperature sensors, and pressure transducers. This information is used to maintain the product temperature within the desired range. The sliding mode controller governs an array of fast-switching steam valves, ensuring high precision and robustness in the face of ambient temperature fluctuations, adverse weather conditions, insulation degradation, valve malfunctions, and sensor faults.

16:30-16:45, Paper FrC21.5	Add to My Program
Necessary and Sufficient Conditions for Consensus of Binary Input Consensus Protocol

Li, Mingxi	Beihang University
Li, Dongyu	BEIHANG UNIVERSITY
Wang, Hanzhou	Beihang University
Keywords: Variable-structure/sliding-mode control, Robust control, Agents-based systems Abstract: The Binary Input Consensus Protocol (BICP) is a simple approach for multi-agent systems to reach finite-time consensus under disturbances. Existing research on BICP mainly focuses on discrete-time models or continuous-time models under assumptions that guarantee solution uniqueness. However, attacks, interference, or device failures may invalidate these assumptions, leaving it uncertain whether the system can achieve consensus. We establish the necessary and sufficient conditions for the BICP to achieve consensus under arbitrary bounded disturbances, laying a foundation for analyzing redundancy and failure resilience in such systems. Furthermore, we reformulate the necessary and sufficient conditions as a sign determination problem of the optimal value of an optimization problem and present a more intuitive and interpretable sufficient condition.

16:45-17:00, Paper FrC21.6	Add to My Program
Liquid Neural Network-Based Integral Sliding Mode Control of Multirotor UAV under Model Uncertainty and Disturbance

Akhtar, Zainab	University of Engineering & Technology
Ijaz, Salman	University of Nottingham Ningbo China
Ahmed, Qadeer	The Ohio State University
Bhatti, Sidra	OHIO State University
Keywords: Autonomous systems, Lyapunov methods, Uncertain systems Abstract: Reliable estimation of uncertainties and external disturbances is crucial for the autonomous operation of unmanned aerial vehicles (UAVs), as it directly impacts stable and robust flight control. To address this, this paper proposes an integrated Liquid Neural Network (LNN)-based integral sliding mode control approach to attain precise control of a multirotor UAV subject to model uncertainty and disturbance. The LNN is designed to estimate model uncertainties and disturbances, thereby providing robustness beyond conventional neural networks. These estimates are subsequently integrated into an integral sliding mode controller, ensuring accurate trajectory tracking under such effects. The simulation results demonstrate that the proposed LNN-based estimator achieves faster convergence, precise uncertainty and disturbance estimation, and improved trajectory tracking performance compared to baseline methods.


FrC22 Regular Session, Churchill C2	Add to My Program
Distributed Parameter Systems

Chair: Cui, Leilei	University of New Mexico
Co-Chair: Peet, Matthew M.	Arizona State University

15:30-15:45, Paper FrC22.1	Add to My Program
Deterministic Learning-Based Spatiotemporal Dynamics Identification and Estimation of Nonlinear Uncertain Parabolic PDE Systems

Zhang, Jingting	University of Electronic Science and Technology of China
Cheng, Hong	University of Electronic Science and Technology of China
Keywords: Nonlinear systems identification, Distributed parameter systems, Machine learning Abstract: This paper investigates the dynamics identification and estimation problems for a class of parabolic partial differential equation (PDE) systems with uncertain nonlinear spatiotemporal dynamics. A key novelty of the proposed approach is to accurately estimate and identify the small dynamics-change by distinguishing it from the dynamics-uncertainty in the time and space scales. This can facilitate the design of, e.g., precise control and fault diagnosis, for the highly-uncertain PDE systems. Specifically, a deterministic learning-based spatiotemporal dynamics identification scheme is first proposed using the radial basis function neural network (RBF NN) for the PDE system's uncertain dynamics. Notably, it exhibits a distinctive capability of guaranteeing the satisfaction of the partial persistent excitation condition, and the optimal convergence of the NN weights. This enables the learned knowledge to be obtained and stored with a convergent RBF NN model, to provide accurate approximation of the system's spatiotemporal dynamics without needing parameter adaptation. A knowledge-based spatiotemporal dynamics estimator is then designed with this convergent RBF NN model, which can provide accurate detection/estimation of the system small dynamics change by distinguishing it from the system uncertainty. Rigorous simulation study on a typical transport-reaction process is performed to validate the effectiveness of our approach, as well as the applicability to practical applications, e.g., fault diagnosis.

15:45-16:00, Paper FrC22.2	Add to My Program
Distributed Koopman Learning Using Partial Trajectories for Control

Hao, Wenjian	Purdue University
Lu, Zehui	Independent Researcher
Upadhyay, Devesh	Saab
Mou, Shaoshuai	Purdue University
Keywords: Distributed parameter systems, Learning Abstract: This paper proposes a distributed data-driven framework for dynamics learning, termed distributed deep Koopman learning using partial trajectories (DDKL-PT). In this framework, each agent in a multi-agent system is assigned a partial trajectory offline and locally approximates the unknown dynamics using a deep neural network within the Koopman operator framework. By exchanging local estimated dynamics rather than training data, agents achieve consensus on a global dynamics model without sharing their private training trajectories. Simulation studies on a surface vehicle demonstrate that DDKL-PT achieves consensus on the learned dynamics, and each agent attains reasonably small approximation errors on the testing dataset. Furthermore, a model predictive control scheme is developed by integrating the learned Koopman dynamics with known kinematic relations. Results on a reference-tracking task indicate that the distributedly learned dynamics are sufficiently accurate for model-based optimal control.

16:00-16:15, Paper FrC22.3	Add to My Program
Parameterization of Seed Functions for Equivalent Representations of Time-Varying Delay Systems

Kisole, Sengiyumva	Arizona State University
Chun, Jungbae	University of Michigan, Ann Arbor
Seiler, Peter	University of Michigan, Ann Arbor
Peet, Matthew M.	Arizona State University
Keywords: Delay systems, Distributed parameter systems, Linear parameter-varying systems Abstract: Abel's classic transformation shows that any well-posed system with time-varying delay is equivalent to a parameter-varying system with fixed delay. The existence of such a parameter-varying constant delay representation then simplifies the problems of stability analysis and optimal control. Unfortunately, the method for construction of such transformations has been ad-hoc -- requiring an iterative time-stepping approach to constructing the transformation beginning with a seed function subject to boundary-value constraints. Moreover, a poor choice of seed function often results in a constant delay representation with large time-variations in system parameters -- obviating the benefits of such a representation. In this paper, we show how the set of all feasible seed functions can be parameterized using a basis for L_2. This parameterization is then used to search for seed functions for which the corresponding time-transformation results in smaller parameter variation. The parameterization of admissible seed functions is illustrated with numerical examples that contrast how well-chosen and poorly chosen seed functions affect the boundedness of a time transformation.

16:15-16:30, Paper FrC22.4	Add to My Program
Data-Driven Operator Policy Iteration with an Application to 1-D Wave Equations

Yin, Zhun	New York University
Jiang, Zhong-Ping	New York University
Keywords: Distributed parameter systems, Optimal control, Reinforcement learning Abstract: This paper investigates data-driven policy iteration (PI) for a broad class of infinite-dimensional linear systems in which both the system coefficients and those of the input operator are unknown. By leveraging semigroup theory, the method is formulated in a general operator-theoretic setting. Within this framework, we introduce the operator persistent excitation (OPE) condition, under which a sharp convergence rate is established. Furthermore, we also derive a necessary condition for the OPE property. To illustrate the applicability of the approach, we consider a 1-d wave equation with constant viscous damping. Simulation results demonstrate that the proposed method can significantly improve control performance relying solely on an offline sampled dataset.

16:30-16:45, Paper FrC22.5	Add to My Program
A System Level Approach to LQR Control of the Diffusion Equation

McCurdy, Addie	University of Colorado Boulder
Gusty, Andrew	University of Colorado
Jensen, Emily	University of Colorado, Boulder
Keywords: Distributed parameter systems, Optimal control Abstract: The optimal controller design problem for a linear, first-order spatially-invariant distributed parameter system is considered. Through a case study of the Linear Quadratic Regulator (LQR) problem for the diffusion equation over the torus, it is illustrated that the optimal controller design problem can be equivalently formulated as an optimization problem over the system's closed-loop mappings, analogous to the System Level Synthesis framework. This reformulation is solved analytically to recover the LQR for the diffusion equation, and an internally stabilizing implementation of this controller is recovered from the optimal closed-loop mappings. It is further demonstrated that a class of spatio-temporal constraints on the closed-loop maps can be imposed on this closed-loop formulation while preserving convexity.

Technical Program for Friday May 29, 2026