| | |
Last updated on May 26, 2026. This conference program is tentative and subject to change
Technical Program for Friday May 29, 2026
| |
| FrAR01 RI Session, Grand Ballroom C |
Add to My Program |
| Adaptive Control |
|
| |
| Chair: Panagou, Dimitra | University of Michigan, Ann Arbor |
| Co-Chair: Yang, Yu | John A. Paulson School of Engineering and Applied Sciences, Harvard University |
| |
| 10:00-10:03, Paper FrAR01.1 | Add to My Program |
| Sequence Aware Soft Actor-Critic Control Agents for Series Electrified Powertrain |
|
| Jaleel, Wafeeq | The Ohio State University |
| Rownak, Md Ragib | The Ohio State University |
| Hanif, Athar | The Ohio State University |
| Bhatti, Sidra | The Ohio State University |
| Ahmed, Qadeer | The Ohio State University |
Keywords: Adaptive control, Automotive systems, Reinforcement learning
Abstract: As Hybrid Electric Vehicles (HEVs) gain traction in heavy-duty trucks, adaptive energy management is critical for minimizing fuel consumption and maintaining battery charge over long operations. To address this, we present a reinforcement learning (RL) framework for engine control in Series HEVs, based on Soft Actor-Critic (SAC) framed as a sequential decision-making problem. Specifically, standard Feedforward networks (FFNs) in the actor and critic are replaced with temporal models, Gated Recurrent Units (GRUs), and Decision Transformers (DTs), to better capture sequential dependencies. Through comprehensive ablation studies, we identify two high-performing and well-generalized sequence-aware agent configurations: a DT actor with a GRU critic (DT-GRU), and a GRU actor with a GRU critic (GRUGRU). Validation compared these SAC agents against Dynamic Programming (DP) across two training setups: one trained on a single standard cycle (HFET), and another using a wider range of initial SOCs, drive durations, and power demands. On the training cycle (HFET), the DT-GRU agent achieved fuel performance within 1.8% of DP, outperforming the GRU-GRU agent, and the FFN-FFN agent at 3.16% and 3.43%, respectively. The unseen highway test cycles, one with significantly higher power demand (US06), while another with extended operation time (HHDDT), compared to the training cycle, allowed us to assess robustness. Both sequence-aware agents showed better generalization than the FFN-FFN agent across these challenging conditions. Notably, agents trained with broader initial conditions not only maintained strong fuel economy but also satisfied critical final constraints.
|
| |
| 10:03-10:06, Paper FrAR01.2 | Add to My Program |
| Modeling Adaptive Tracking of Predictable Stimuli in Electric Fish |
|
| Yang, Yu | John A. Paulson School of Engineering and Applied Sciences, Harvard University |
| Oliveira, Andreas | Northeastern University |
| Whitcomb, Louis L. | Johns Hopkins University |
| Pait, Felipe | Univ. Sao Paulo |
| Sznaier, Mario | Northeastern University |
| Cowan, Noah J. | Johns Hopkins University |
|
|
| |
| 10:06-10:09, Paper FrAR01.3 | Add to My Program |
| Realizing Optimal Bidding and Budget Allocation with Constraints in Programmatic Advertising |
|
| Hafner, William | The Trade Desk |
| Schaefer, Alexander | The Trade Desk |
| Amin, Victor | The Trade Desk |
| Ahuja, Sunil | The Trade Desk |
Keywords: Adaptive control, Control applications, Constrained control
Abstract: Maximizing advertiser value under budget constraints is a central challenge in programmatic advertising. This paper presents a unified framework that addresses optimal bidding, budget allocation, and real-time spend control in campaigns with multiple ad groups, each subject to individual spend constraints. We first derive the structure of an optimal bidding policy that maximizes expected return on investment in first-price auctions. Building on this, we introduce an algorithm for allocating campaign budgets optimally across ad groups in real time, ensuring compliance with both campaign-level and ad group-level constraints. Finally, we propose a feedback-control system that dynamically adjusts bidding parameters to achieve and maintain optimal spend distribution under changing market conditions. Simulations demonstrate convergence to optimal allocations and robust behavior in both constrained and unconstrained scenarios. By designing the system with dependence only on practically estimable quantities, this work bridges the gap between theoretically optimal bidding strategies, and real-world real-time bidding applicability.
|
| |
| 10:09-10:12, Paper FrAR01.4 | Add to My Program |
| Computing Control Directions for Discrete-Time Systems with Disturbances |
|
| Mazenc, Frederic | Inria Saclay |
| Malisoff, Michael | Louisiana State University |
Keywords: Adaptive control, Discrete event systems, Stability of nonlinear systems
Abstract: We study a general class of nonlinear discrete-time systems containing disturbances in the dynamics and in the available output measurements, and unknown control directions. Under suitable conditions on the coefficient matrices in the linear portion of the dynamics and on a coefficient matrix in the output measurement, and under an observability condition, we provide a new method to identify the unknown control direction in finite time. Our method is amenable to systems whose linear portions contain uncertain coefficients. We illustrate our method using a discrete-time chain of integrators.
|
| |
| 10:12-10:15, Paper FrAR01.5 | Add to My Program |
| Adaptive Control Allocation for Underactuated Time-Scale Separated Non-Affine Systems |
|
| Cherenson, Daniel | University of Michigan |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Adaptive control, Flight control
Abstract: Many robotic systems are underactuated, meaning not all degrees of freedom can be directly controlled due to lack of actuators, input constraints, or state-dependent actuation. This property, compounded by modeling uncertainties and disturbances, complicates the control design process for trajectory tracking. In this work, we propose an adaptive control architecture for uncertain, nonlinear, underactuated systems with input constraints. Leveraging time-scale separation, we construct a reduced-order model where fast dynamics provide virtual inputs to the slower subsystem and use dynamic control allocation to select the optimal control inputs given the non-affine dynamics. To handle uncertainty, we introduce a state predictor-based adaptive law, and through singular perturbation theory and Lyapunov analysis, we prove stability and bounded tracking of reference trajectories. The proposed method is validated on a vertical takeoff and landing (VTOL) quadplane with nonlinear, state-dependent actuation, demonstrating its utility as a unified controller across various flight regimes, including cruise, landing transition, and hover.
|
| |
| 10:15-10:18, Paper FrAR01.6 | Add to My Program |
| Nested Extremum Seeking Converges to Nash Equilibrium |
|
| Ratto, Brad | University of California San Diego |
| Williams, Alan | Los Alamos National Laboratory |
| Krstic, Miroslav | University of California, San Diego |
| Scheinker, Alexander | Los Alamos National Lab |
Keywords: Adaptive control, Game theory, Optimization
Abstract: The nested Extremum Seeking (nES) algorithm is a model-free optimization method for tuning system parameters across multiple objective functions using bounded extremum seeking. Using weak-limit averaging and singular perturbation techniques, we establish a rigorous stability result proving semiglobal practical convergence to a Nash equilibrium. These results suggest that nES provides a principled approach for feedback-based tuning of coupled objectives, including settings which admit a game-theoretic interpretation.
|
| |
| 10:18-10:21, Paper FrAR01.7 | Add to My Program |
| On Online Control of Opinion Dynamics |
|
| Paul, Sheryl | University of Southern California |
| Cruz Juarez, Leslie Paola | University of Southern California |
| Deshmukh, Jyotirmoy | University of Southern California |
| Savla, Ketan | University of Southern California |
Keywords: Adaptive control, Identification for control, Networked control systems
Abstract: Networked multi-agent dynamical systems have been used to model how individual opinions evolve over time due to the opinions of other agents in the network. Particularly, such a model has been used to study how a planning agent can be used to steer opinions in a desired direction through repeated, budgeted interventions. In this paper, we consider the problem where individuals’ susceptibilities to external influences are unknown. We propose an online algorithm that alternates between estimating this susceptibility parameter, and using the current estimate to drive the opinion to a desired target. We provide conditions that guarantee stability and convergence to the desired target opinion when the planning agent faces budgetary or temporal constraints. Our analysis shows that the key advantage of estimating the susceptibility parameter is that it helps achieve near-optimal convergence to the target opinion given a finite amount of intervention rounds, and, for a given intervention budget, quantifies how close the opinion can get to the desired target.
|
| |
| 10:21-10:24, Paper FrAR01.8 | Add to My Program |
| Concurrent Learning for System Identification and Control Using Lyapunov-Based Deep Neural Networks |
|
| Hart, Rebecca | University of Florida |
| Patil, Omkar Sudhir | University of Florida |
| Bell, Zachary I. | Air Force Research Laboratory |
| Dixon, Warren E. | University of Florida |
Keywords: Adaptive control, Identification for control, Neural networks
Abstract: This paper presents the first result that enables the use of a concurrent learning (CL) adaptation law for the weights of all the layers of the DNN-based controller applied to a class of second-order control-affine systems. The developed CL-based adaptation achieves convergence of the DNN’s parameter estimates to a neighborhood of their ideal values, provided the DNN’s Jacobian satisfies a finite excitation condition. A Lyapunov-based stability analysis is conducted to ensure convergence of the tracking error, weight estimation errors, and observer errors to a neighborhood of the origin.
|
| |
| 10:24-10:27, Paper FrAR01.9 | Add to My Program |
| Online Policy Iterations for Continuous-Time Optimal Tracking Problems Using Single Network Adaptive Critic |
|
| Engelhardt, Randal | California State University Northridge |
| Sardarmehni, Tohid | California State University Northridge |
Keywords: Adaptive control, Intelligent systems, Reinforcement learning
Abstract: This paper extends the single network adaptive critic structure, initially developed for near-optimal control of discrete-time systems with offline training, to an online continuous-time setting. A novel gradient-descent-based training law is proposed, and its exponential convergence is established under the persistency of excitation assumption. The stability of the closed-loop system is analyzed using Lyapunov’s direct method, and the uniformly ultimately bounded nature of the closed-loop tracking error is proved. Numerical simulations on both tracking and regulation tasks demonstrate the effectiveness of the proposed approach.
|
| |
| 10:27-10:30, Paper FrAR01.10 | Add to My Program |
| Bearing-Only Solution to the Fermat-Weber Location Problem for Euler-Lagrange Systems |
|
| Cheah, Hong Liang | UNSW |
| Deghat, Mohammad | University of New South Wales |
Keywords: Adaptive control, Optimization, Robust adaptive control
Abstract: This paper studies the bearing-only Fermat–Weber Location Problem (FWLP) for Euler–Lagrange systems. The objective is to design bearing-only control laws that guide an autonomous robot to the point that minimizes the weighted sum of Euclidean distances to a set of beacons, known as the Fermat–Weber point. In contrast to previous studies that focus on simplified models such as single- or double-integrator dynamics, this work considers Euler–Lagrange systems, which offer a more realistic representation of physical robot dynamics. The physical parameters of the robot are considered unknown. For stationary beacons, we develop a bearing-only control law using an adaptive backstepping method to steer the robot to the Fermat–Weber point. We then extend the analysis to a case in which the beacons move with a constant velocity, and the dynamics of the robot is affected by disturbances. To address this scenario, we propose a novel robust adaptive bearing-only control law that allows the robot to track the moving Fermat–Weber point. Both proposed control laws guarantee asymptotic convergence, and simulation results are provided to validate the theoretical findings.
|
| |
| 10:30-10:33, Paper FrAR01.11 | Add to My Program |
| Collaborative Indirect Influencing and Control on Graphs Using Graph Neural Networks |
|
| Gardenswartz, Max L. | University of Florida |
| Fallin, Brandon C. | University of Florida |
| Nino, Cristian F. | Florida Institute for Human and Machine Cognition |
| Dixon, Warren E. | University of Florida |
Keywords: Adaptive control, Neural networks, Stability of nonlinear systems
Abstract: This paper presents a novel approach to solving the indirect influence problem in networked systems, in which cooperative nodes must regulate a target node with uncertain dynamics to follow a desired trajectory. We leverage the message-passing structure of a graph neural network (GNN), allowing nodes to collectively learn the unknown target dynamics in real time. We develop a novel GNN-based backstepping control strategy with formal stability guarantees derived from a Lyapunov-based analysis. Numerical simulations are included to demonstrate the performance of the developed controller.
|
| |
| 10:33-10:36, Paper FrAR01.12 | Add to My Program |
| Event-Driven Safe and Resilient Control of Automated and Human-Driven Vehicles under EU-FDI Attacks |
|
| Zhang, Yi | University of Connecticut |
| Wang, Yichao | University of Connecticut |
| Xiao, Wei | WPI |
| Rajabinezhad, Mohamadamin | University of Connecticut (UCONN) |
| Zuo, Shan | University of Connecticut |
Keywords: Adaptive control, Optimization, Cooperative control
Abstract: This paper studies the safe and resilient control of Connected and Automated Vehicles (CAVs) operating in mixed traffic environments where they must interact with Human-Driven Vehicles (HDVs) under uncertain dynamics and exponentially unbounded false data injection (EU-FDI) attacks. These attacks pose serious threats to safety-critical applications. While resilient control strategies can mitigate adversarial effects, they often overlook collision avoidance requirements. Conversely, safety-critical approaches tend to assume nominal operating conditions and lack resilience to adversarial inputs. To address these challenges, we propose an event-driven safe and resilient (EDSR) control framework that integrates event-driven Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) with adaptive attack-resilient control. The framework further incorporates data-driven estimation of HDV behaviors to ensure safety and resilience against EU-FDI attacks. Specifically, we focus on the lane-changing maneuver of CAVs in the presence of unpredictable HDVs and EU-FDI attacks on acceleration inputs. The event-driven approach reduces computational load while maintaining real-time safety guarantees. Simulation results, including comparisons with conventional safety-critical control methods that lack resilience, validate the effectiveness and robustness of the proposed EDSR framework in achieving collision-free maneuvers, stable velocity regulation, and resilient operation under adversarial conditions.
|
| |
| 10:36-10:39, Paper FrAR01.13 | Add to My Program |
| Adaptive Control and Control Allocation of Uncertain Over-Actuated Systems with Nonlinear Reference Models |
|
| Bray, Andrew | Embry-Riddle Aeronautical University |
| Sarioglu, Eren | Embry-Riddle Aeronautical University |
| Dogan, K. Merve | Embry-Riddle Aeronautical University |
Keywords: Adaptive systems, Uncertain systems, Lyapunov methods
Abstract: The presence of system anomalies, such as modeling uncertainties and unknown actuator degradation, weakens the performance of controlled systems when not properly compensated. Thus, in this paper, for improved closed-loop performance, a novel high-performance model reference adaptive controller, along with an adaptive control allocation method, is designed for uncertain over-actuated nonlinear dynamical systems in the presence of unknown actuator degradation. A nonlinear reference model is used to track the considered dynamical system, since standard linear reference models limit the closed-loop performance of the nonlinear system. A Lyapunov stability analysis is then used to show the asymptotic convergence of the tracking error between the closed-loop system and the nonlinear reference model. Simulations are also conducted to show the benefits of the proposed method.
|
| |
| 10:39-10:42, Paper FrAR01.14 | Add to My Program |
| Sensor-Noise Mitigation in Extremum Seeking Control Using Adaptive Numerical Differentiation |
|
| Verma, Shashank | University of Michigan |
| Paredes Salazar, Juan Augusto | University of Maryland, Baltimore County |
| Portella Delgado, Jhon Manuel | University of Maryland Baltimore County |
| Goel, Ankit | University of Maryland Baltimore County |
| Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Adaptive control, Optimization algorithms, Estimation
Abstract: Extremum-seeking control (ESC) is widely used to optimize performance when the system dynamics are uncertain. However, sensitivity to sensor noise is a crucial issue in ESC implementation due to the use of high-pass filters or gradient estimators. To reduce the sensitivity of ESC to noise, this paper investigates the use of the recently developed adaptive input and state estimation (AISE) technique for numerical differentiation. In particular, this paper develops extremum-seeking control with adaptive input and state estimation (ESC/AISE), where AISE replaces the high-pass filter of ESC to improve performance under sensor noise. The effectiveness of ESC/AISE is illustrated via numerical examples.
|
| |
| 10:42-10:45, Paper FrAR01.15 | Add to My Program |
| Online Model Discrimination Using Dual Model Predictive Path Integral Control |
|
| Purohit, Vasudev Asheesh Kumar | Clemson University International Center for Automotive Research |
| Zhu, Qilun | Clemson University, CU-ICAR |
| Prucka, Robert | Clemson University - International Center ForAutomotiveResearch |
| Castanier, Matthew | US Army DEVCOM Ground Vehicle Systems Center |
| Figueroa-Santos, Miriam, A | GVSC |
| Barron, Morgan | Ground Vehicle System Center |
Keywords: Adaptive control, Stochastic optimal control, Automotive control
Abstract: Accurate predictive control requires not only reliable parameter estimates but also correct models of system dynamics. In many real world applications, multiple candidate models can describe the system equally well, and conventional controllers typically rely on passive adaptation that fails to actively reduce such model uncertainty. This paper presents a Dual Model Predictive Path Integral (dual-MPPI) controller for online model discrimination, enabling autonomous systems to both act and learn under competing model hypotheses. By combining sampling-based rollout evaluation with Bayesian updates of model probabilities and parameter beliefs, the proposed method balances exploration and exploitation during control. Effectiveness is demonstrated on a simulated off-road vehicle navigating terrains represented by two distinct tire-terrain interaction models. Compared against a certainty-equivalent MPPI baseline, dual-MPPI proactively issues exploratory actions that accelerate model discrimination while maintaining safety, resulting in faster convergence of model probabilities and improved vehicle stability in dynamic maneuvers. These result highlight the potential of dual-MPPI to address both parametric and model uncertainty in safety-critical autonomous systems.
|
| |
| 10:45-10:48, Paper FrAR01.16 | Add to My Program |
| A Switched Adaptive Control Approach to Reduce Sensing Needs in Trajectory Tracking Problems |
|
| Qureshi, Muzaffar | University of Florida |
| Ogri, Tochukwu Elijah | University of Florida |
| Ramos, J. Humberto | University of Florida |
| Makumi, Wanjiku A. | Air Force Research Laboratory |
| Bell, Zachary I. | Air Force Research Laboratory |
| Kamalapurkar, Rushikesh | University of Florida |
Keywords: Adaptive control, Switched systems, Observers for nonlinear systems
Abstract: This letter considers an autonomous agent that intermittently acquires state measurements to maintain trajectory tracking performance. The objective is to minimize sensing needs using extended periods of sensor-denied operation. A Lyapunov-based adaptive switched systems approach is developed, where the agent uses the intermittently acquired state measurements to learn the system model. The learned system models are then used during sensor-denied intervals to extend their length while maintaining tracking performance. The design uses a modeling error-dependent bound on the duration of the sensor-denied intervals to progressively reduce sensing needs as the modeling error decreases. A Lyapunov-based analysis is used to show the stability of the closed-loop system, and the effectiveness of the developed technique is verified through a simulation study.
|
| |
| 10:48-10:51, Paper FrAR01.17 | Add to My Program |
| Computing Control Directions of Continuous-Time Nonlinear Systems with Disturbances |
|
| Mazenc, Frederic | Inria Saclay |
| Malisoff, Michael | Louisiana State University |
Keywords: Adaptive control, Uncertain systems, Time-varying systems
Abstract: We provide a new method to compute signs of unknown control directions for a general class of time-varying nonlinear systems that contain unknown disturbances in the available output and in the dynamics. Our relation between the output and a matrix factor in the control effectiveness matrix guarantees that the sign can be computed in an arbitrarily small time. Our method yields bounds on the allowable uncertainty in the coefficient matrices. We illustrate our method using a nonlinear perturbed generalization of a chain of integrators.
|
| |
| 10:51-10:54, Paper FrAR01.18 | Add to My Program |
| Discrete-Time Model Reference Adaptive Control of Uncertain Fully-Actuated Systems in the Presence of Unknown Control Effectiveness |
|
| Sisson, Nathaniel B. | Embry-Riddle Aeronautical University |
| Vongkunghae, Thitiphun | Embry-Riddle Aeronautical University |
| Dogan, K. Merve | Embry-Riddle Aeronautical University |
Keywords: Adaptive systems, Direct adaptive control, Uncertain systems
Abstract: In this work, we propose a discrete-time adaptive control architecture for uncertain fully-actuated systems in the presence of unknown control effectiveness, modeled by a diagonal matrix that multiplies the control input vector in the dynamical model. To establish boundedness of the closed-loop system and convergence of the asymptotic tracking error, a Lyapunov stability analysis is conducted. Moreover, simulation results demonstrate the efficacy of the proposed method.
|
| |
| FrAR02 RI Session, Grand Ballroom D |
Add to My Program |
| Estimation |
|
| |
| Chair: Kumar, Manish | University of Cincinnati |
| Co-Chair: Khamvilai, Thanakorn | Texas Tech University |
| |
| 10:00-10:03, Paper FrAR02.1 | Add to My Program |
| Tobit Kalman Filter and Neuro-Dynamic Inversion Adaptive Control for GNSS/INS Navigation under Censored Measurements |
|
| Jetawatthana, Sarocha | Texas Tech University |
| Khamvilai, Thanakorn | Texas Tech University |
Keywords: Estimation, Adaptive control, Neural networks
Abstract: We present a novel formulation of the Tobit Kalman Filter (TKF) for an inertial navigation problem with censored GNSS measurements. Our formulation utilizes the IMU measurement for the propagation step, and the last known GNSS position before censorship as a measurement for the update step. To calculate the innovation and the Kalman gain under censored measurements, we derive new formulas for the predicted measurement and the covariance terms using likelihood functions based on a statistical Tobit model. Therefore, TKF allows for state estimation despite the stagnant measurements. Additionally, due to the coupling between the navigation and control problems, censored measurements contribute to the increased uncertainty of the overall system dynamics. From the control perspective, the uncertainty can be handled by an adaptive control technique. Hence, we integrate a dynamic inversion adaptive controller to complete the system autonomy. We use fixed-wing airplane dynamics with a neural network (NN) as an adaptive element. The NN weights are updated online using a law derived from the Lyapunov theorem. The proposed approach is evaluated using a UAV flight simulator. The result confirms that this approach makes navigation and control in a censored measurement environment feasible.
|
| |
| 10:03-10:06, Paper FrAR02.2 | Add to My Program |
| Constrained Variational Inference Via Safe Particle Flow |
|
| Yi, Yinzhuang | University of California, San Diego |
| Cortes, Jorge | UC San Diego |
| Atanasov, Nikolay | University of California, San Diego |
Keywords: Estimation, Constrained control, Variational methods
Abstract: We propose a control barrier function (CBF) formulation for enforcing equality and inequality constraints in variational inference. The key idea is to define a barrier functional on the space of probability density functions that encode the desired constraints imposed on the variational density. By leveraging the Liouville equation, we establish a connection between the time derivative of the variational density and the particle drift, which enables the systematic construction of corresponding CBFs associated to the particle drift. Enforcing these CBFs gives rise to the safe particle flow and ensures that the variational density satisfies the original constraints imposed by the barrier functional. This formulation provides a principled and computationally tractable solution to constrained variational inference, with theoretical guarantees of constraint satisfaction. The effectiveness of the method is demonstrated through numerical simulations.
|
| |
| 10:06-10:09, Paper FrAR02.3 | Add to My Program |
| Distributed and Consistent Multi-Robot Visual-Inertial-Ranging Odometry on Lie Groups |
|
| Kang, ZiWei | North China Electric Power University |
| Zhou, Yizhi | Geroge Mason University |
Keywords: Estimation, Distributed control, Autonomous robots
Abstract: Reliable localization in GPS-denied environments remains challenging for multi-robot systems. Although visual–inertial odometry (VIO) provides accurate motion estimation, it suffers from drift without global references. Ultra-Wideband (UWB) ranging offers complementary constraints, but existing UWB-aided VIO methods typically assume single-robot operation and known anchor locations. We propose a distributed collaborative visual–inertial–ranging odometry (DC-VIRO) framework that tightly fuses VIO and UWB across multiple robots. Anchor states are jointly estimated, and shared observations introduce additional geometric constraints through inter-robot communication. A right-invariant formulation preserves VIO unobservable directions, ensuring consistency. Simulations show improved localization accuracy, robustness, and distributed anchor self-calibration.
|
| |
| 10:09-10:12, Paper FrAR02.4 | Add to My Program |
| GRU-Based Estimation Model for Bearings-Only Tracking |
|
| Huang, Xu | University of Missouri |
| Feng, Yuan | University of Missouri |
| Zhang, Yang | University of Missouri |
| Xin, Ming | University of Missouri |
Keywords: Estimation, Filtering, Machine learning
Abstract: Target tracking using bearings-only information is a well-known challenge, primarily attributed to the system's poor observability and the highly nonlinear nature of the estimation process. In this paper, a deep learning framework based on the Gated Recurrent Unit (GRU) architecture is proposed to address this bearings-only tracking problem. Different from conventional nonlinear filters that rely on explicit system models, the proposed data-driven method learns to estimate the target state directly from sequential noisy bearing observations. A comprehensive comparative study is conducted to evaluate the proposed model against other prominent deep learning architectures, including the Transformer, Long Short-Term Memory (LSTM), and Convolution Neural Network (CNN), as well as traditional nonlinear filters. Simulation results demonstrate that the proposed GRU-based tracker provides the most accurate position estimates in a nominal scenario. In a challenging scenario with high uncertainty, the GRU model attains the best balance among position and velocity estimation accuracies and computation efficiency. This highlights the strong robustness and stability of the GRU, and its great potential for applications in modern passive tracking systems.
|
| |
| 10:12-10:15, Paper FrAR02.5 | Add to My Program |
| Delayed Norm-Constrained Augmented Kalman Filter: A Selector-Matrix Approach for Quaternion Norm Adherence |
|
| Mozaffari, Hamed | Southern Illinois University of Edwardsville |
| Dabiri, Arman | Southern Illinois University Edwardsville |
Keywords: Estimation, Kalman filtering, Delay systems
Abstract: This paper presents a Delayed Constrained Extended Kalman Filter (DC-EKF) that jointly addresses measurement delays and quaternion norm preservation. Existing constrained filters typically handle delays or norm constraints separately, degrading accuracy in multi-sensor fusion. The proposed method reformulates the delayed system as an augmented, delay-free model in which each quaternion state satisfies the unit-norm constraint. Multiple selector matrices are employed to decompose the multi-constraint optimization into independent components, yielding a closed-form Kalman gain via Lagrange multipliers. This approach avoids conventional partially constrained filters' partitioning and reassembly steps while maintaining computational efficiency. Simulation studies of rigid-body motion demonstrate that the DC-EKF achieves faster convergence, lower estimation variance, and stricter quaternion norm adherence compared to the EKF, pseudo-measurement EKF, and augmented constrained EKF. On average, settling times are reduced by 50--90% and steady-state errors by 70--90%, while quaternion norms remain within 10^{-6} of unity. These results highlight DC-EKF as an effective and practical solution for accurate pose estimation with delayed measurements.
|
| |
| 10:15-10:18, Paper FrAR02.6 | Add to My Program |
| Natural Gradient Gaussian Approximation Filter with Positive Definiteness Guarantee |
|
| Zhang, Tianyi | Tsinghua University |
| Cao, Wenhan | Tsinghua University |
| Li, Shengbo Eben | Tsinghua University |
Keywords: Estimation, Kalman filtering, Machine learning
Abstract: Popular Bayes filters often apply linearization techniques, such as Taylor expansion or stochastic linear regression, to enable the use of the Kalman filter structure, but this can lead to large errors in strongly nonlinear systems. The recently proposed NANO filter addresses this issue by interpreting the prediction and update steps of Bayesian filtering as two distinct optimization problems and solving them through moment matching and natural gradient descent, thereby avoiding model linearization errors. However, the natural gradient update in NANO can occasionally diverge because the posterior covariance in its iteration may lose positive definiteness. Our analysis shows that the posterior covariance is the sum of the inverse prior covariance and the expected Hessian of the log-likelihood function, and that the indefiniteness of the latter term is the root cause of update failure. To address this issue, we propose two remedies. The first approximates the log-likelihood Hessian using the Gauss–Newton method, representing it as the self-adjoint product of the Jacobian of the normalized measurement residual, which is guaranteed to be positive semi-definite. The second reformulates the covariance update as an exponential-form update of the Cholesky factor and reconstructs the covariance via its Gram matrix, which ensures positive definiteness. Experiments on three classical nonlinear systems demonstrate that the proposed NANO filter with guaranteed positive definiteness outperforms popular members of the Kalman filter family and original NANO filter.
|
| |
| 10:18-10:21, Paper FrAR02.7 | Add to My Program |
| Linear Quadratic Control for Discrete-Time Systems with Stochastic and Bounded Noises |
|
| Ma, Xuehui | Xi'an University of Technology |
| Zhang, Shiliang | University of Oslo |
| Zhang, Xiaohui | Xi’an University of Technology |
| Xin, Jing | Xi'an University of Technology |
| Garcia de Marina, Hector | Universidad De Granada |
Keywords: Estimation, Kalman filtering, Robust control
Abstract: This paper focuses on the linear quadratic control (LQC) design of systems corrupted by both stochastic noise and bounded noise simultaneously. When only one type of these noises is considered, the LQC strategy leads to either a stochastic or a robust controller. However, there is no LQC strategy that can simultaneously handle stochastic and bounded noises efficiently. This limits the scope where existing LQC strategies can be applied. In this work, we look into the LQC problem for discrete-time systems that have both stochastic and bounded noises in their dynamics. We develop a state estimation method for such systems by efficiently combining a Kalman filter and an ellipsoid set-membership filter. The developed estimator can recover the estimation optimality when the system is subject to both kinds of noise, the stochastic and the bounded. Leveraging the developed state estimation, we derive a robust state-feedback control law for the LQC problem. The control law derivation takes into account both stochastic and bounded-state estimation errors, so as to avoid over-conservativeness while sustaining stability in the control. In this way, the developed LQC strategy extends the range of scenarios where LQC can be applied, especially those of real-world control systems with diverse sensing which are subject to different kinds of noise. We present numerical simulations, and the results demonstrate the enhanced control performance with the proposed strategy.
|
| |
| 10:21-10:24, Paper FrAR02.8 | Add to My Program |
| Pose Estimation of a Thruster-Driven Bioinspired Multi-Link Robot |
|
| Andrews, Nicholas B. | University of Washington |
| Yang, Yanhao | Oregon State University |
| Akhetova, Sofya | University of Washington |
| Morgansen, Kristi A. | University of Washington |
| Hatton, Ross L. | Oregon State University |
Keywords: Estimation, Mechanical systems/robotics, Nonholonomic systems
Abstract: This work demonstrates simultaneous pose (position and orientation) and shape estimation for a free-floating, bioinspired multi-link robot with unactuated joints, link-mounted thrusters for control, and a single gyroscope per link, resulting in an underactuated, minimally sensed platform. Because the inter-link joint angles are constrained, translation and rotation of the multi-link system requires cyclic, reciprocating actuation of the thrusters, referred to as a gait. Through a proof-of-concept hardware experiment and offline analysis, we show that the robot's shape can be reliably estimated using an Unscented Kalman Filter augmented with Gaussian process residual models to compensate for non-zero-mean, non-Gaussian noise, while the pose exhibits drift expected from gyroscope integration in the absence of absolute position measurements. Experimental results demonstrate that a Gaussian process model trained on a multi-gait dataset (forward, backward, left, right, and turning) performs comparably to one trained exclusively on forward-gait data, revealing an overlap in the gait input space, which can be exploited to reduce per-gait training data requirements while enhancing the filter's generalizability across multiple gaits. Lastly, we introduce a heuristic derived from the observability Gramian to correlate joint angle estimate quality with gait periodicity and thruster inputs, highlighting how control affects estimation quality.
|
| |
| 10:24-10:27, Paper FrAR02.9 | Add to My Program |
| Conditional Graph Variational Autoencoder for Generation of Class-Conditioned Spatial Data |
|
| Shetty, Akanksh Prasad | Innovative Numerics LLC |
| David, Deepak Antony | University of Cincinnati |
| Busse, Luke | University of Cincinnati |
| Kumar, Manish | University of Cincinnati |
Keywords: Estimation, Neural networks, Pattern recognition and classification
Abstract: This paper tackles the problem of class conditional generation for spatial data represented as unordered sets of graph-structured nodes. Each node encodes spatial coordinates and activation intensities, and the goal is to model the conditional distribution over such sets, given a class label. We propose a Conditional Graph Variational Autoencoder (CGVAE) that learns to estimate class-aware latent representations and reconstruct node sets in a statistically grounded probabilistic framework. Our approach conditions both the encoder and decoder networks on class labels using Feature-wise Linear Modulation (FiLM) layers, enabling accurate posterior estimation while avoiding the need for per class decoders. To ensure stable and meaningful learning, we introduce a regularization strategy based on controlled information flow in the latent space. We employ a customized KL divergence loss which ensures that the model’s internal predictions remain close to a target distribution and a permutation-invariant reconstruction loss that compares both the node positions and intensities. This helps the generated data to closely match real data in a reliable way. Results on the MNIST superpixels demonstrate that the presented model is capable of generating diverse and label-consistent samples, and can support manipulations of geometric attributes such as slant, width, and height through latent space optimization. Further analysis reveals that the class identity is controlled by the FiLM layer, while geometric variation within each class is captured in the latent space, thus highlighting the model’s ability to separately estimate structured components of the data.
|
| |
| 10:27-10:30, Paper FrAR02.10 | Add to My Program |
| An Adaptive Fuzzy Logic Based Uncertainty Estimator for the Modeling Uncertainties of Super Coiled Polymer Actuators |
|
| Hindistan, Cagri | Ege University |
| Yilmaz, Bayram Melih | Ege University |
| Selim, Erman | Ege University |
| Tatlicioglu, Enver | Ege University |
| Zergeroglu, Erkan | Gebze Technical University |
Keywords: Fuzzy systems, Estimation
Abstract: This work presents an observer based estimation method for the model uncertainties of super coiled polymer (SCP) actuators using an adaptive fuzzy logic (AFL) based approach. Specifically, an observer formulation based on fuzzy logic estimator is proposed for the uncertainties present in the nonlinear model of the SCP actuator. The proposed observer utilizes the known parts of the system model while employing AFL to estimate the unknown parameters and uncertainties. The design incorporates a self-learning fuzzy logic based term, where the control representative value matrix, as well as the means and variances of the membership functions, are dynamically updated. The stability and convergence of the overall system is ensured rigorously via Lyapunov type arguments. Simulation studies are presented to illustrate the performance and viability of the proposed method.
|
| |
| 10:30-10:33, Paper FrAR02.11 | Add to My Program |
| Data-Driven Estimation of Quadrotor Motor Efficiency Via Residual Minimization |
|
| Cheng, Sheng-Wen | The University of Texas at Austin |
| Cheng, Teng-Hu | National Chiao Tung University |
Keywords: Estimation, Optimization, Robotics
Abstract: A data-driven framework is proposed for online estimation of quadrotor motor efficiency via residual minimization. The problem is formulated as a constrained nonlinear optimization that minimizes trajectory residuals between measured flight data and predictions generated by a quadrotor dynamics model. A sliding-window strategy enables online estimation, and the optimization is efficiently solved using an iteratively reweighted least squares (IRLS) scheme combined with a primal-dual interior-point method, with inequality constraints enforced through a logarithmic barrier function. Robust z-score weighting is employed to reject outliers, which is particularly effective in motor clipping scenarios where the proposed estimator exhibits smaller spikes than an EKF baseline. Compared to traditional filter-based approaches, the batch-mode formulation allows selective inclusion of data segments via IRLS reweighting and hard-rejection. This structure is well-suited for online estimation and supports applications such as fault detection and isolation (FDI), health monitoring, and predictive maintenance in aerial robotic systems. Simulation results under various degradation scenarios demonstrate the accuracy and robustness of the proposed estimator.
|
| |
| 10:33-10:36, Paper FrAR02.12 | Add to My Program |
| A Generalized Synthetic Control Method for Baseline Estimation in Demand Response Services |
|
| Sievers, Jonas | Karlsruhe Institute of Technology |
| Roozbehani, Mardavij | Massachusetts Institute of Technology |
Keywords: Estimation, Smart grid, Neural networks
Abstract: Baseline estimation is critical to Demand Response (DR) settlement in electricity markets, yet existing machine learning methods remain limited in predictive performance, while methodologies from causal inference and counterfactual prediction are still underutilized in this domain. We introduce a Generalized Synthetic Control Method that builds on the classical Synthetic Control Method (SCM) from econometrics. While SCM provides a powerful framework for counterfactual estimation, classical SCM remains a static estimator: it fits the treated unit as a combination of contemporaneous donor units and therefore ignores predictable temporal structure in the residual error. We develop a generalized SCM framework that transforms baseline estimation into a dynamic counterfactual prediction problem by augmenting the donor representation with exogenous features, lagged treated load, and selected lagged donor signals. This enriched representation allows the estimator to capture autoregressive dependence, delayed donor-response patterns, and error-correction effects beyond the scope of standard SCM. The framework further accommodates nonlinear predictors when linear weighting is inadequate, with the greatest benefit arising in limited-data settings. Experiments on the Ausgrid smart-meter dataset show consistent improvements over classical SCM and strong benchmark methods, with the dominant performance gains driven by dynamic augmentation.
|
| |
| 10:36-10:39, Paper FrAR02.13 | Add to My Program |
| Particle Filter Based Real-Time Multi-Modal Human Intention Prediction |
|
| Al Alsheikh, Abdullatif | Massachusetts Institute of Technology (MIT) |
| Zhang, Xiaotong | Massachusetts Institute of Technology (MIT) |
| Youcef-Toumi, Kamal | Massachusetts Inst. of Tech |
Keywords: Robotics, Intelligent systems, Sensor fusion
Abstract: This paper presents a novel particle filter-based approach for real-time human intention prediction in human-robot interaction (HRI) scenarios. Unlike existing methods that rely on offline training or single-modality inputs, our framework fuses three complementary information sources—head orientation, hand orientation based on object affordances, and hand velocity vectors—to predict which object a human intends to interact with. The particle filter naturally accommodates dynamic scenes where objects can be added or removed without requiring retraining, addressing a critical limitation of current learning-based approaches. We validate our method using a custom dataset of 48 grasping trajectories across 16 experiments, achieving approximately 90% accuracy in intention prediction with processing times of 2.7ms per frame. Comparative evaluation against a Bayesian Network approach demonstrates comparable performance (80.6% vs. 83.1% accuracy) despite requiring no offline training. Furthermore, we demonstrate the system’s adaptability through simulated object addition experiments, where the particle filter maintains stable performance as the state space dynamically expands. Our approach offers a practical solution for real-time human intention prediction in collaborative robotics applications, eliminating the need for task-specific training while maintaining computational efficiency suitable for real-time control.
|
| |
| 10:39-10:42, Paper FrAR02.14 | Add to My Program |
| Robust Sensor Placement for Poisson Arrivals with False-Alarm–Aware Spatiotemporal Sensing |
|
| Kim, Mingyu | Georgia Southern University |
| Sarker, Pronoy | Georgia Southern University |
| Kim, Seungmo | Virginia Tech |
| Stilwell, Daniel J. | Virginia Tech |
| Jimenez, Jorge | Virginia Tech |
Keywords: Sensor networks, Optimization algorithms, Filtering
Abstract: This paper studies sensor placement when detection performance varies stochastically due to environmental factors over space and time and false alarms are present, but a filter is used to attenuate the effect. We introduce a unified model that couples detection and false alarms through an availability function, which captures how false alarms reduce effective sensing and filtering responses to the disturbance. Building on this model, we give a sufficient condition under which filtering improves detection. In addition, we derive a coverage-based lower bound on the void probability. Furthermore, we prove robustness guarantees showing that performance remains stable when detection probabilities are learned from limited data. We validate the approach with numerical studies using AIS vessel-traffic data and synthetic maritime scenarios. Together, these results provide theory and practical guidance for deploying sensors in dynamic, uncertain environments.
|
| |
| 10:42-10:45, Paper FrAR02.15 | Add to My Program |
| Robust Cislunar Navigation Via LFT-Based mathcal{H}_infty Filtering with Bearing-Only Measurements |
|
| Bhattacharya, Raktim | Texas A&M |
Keywords: Observers for nonlinear systems, H-infinity control, Spacecraft control
Abstract: This paper develops a robust estimation framework for cislunar navigation that embeds the Circular Restricted Three-Body Problem (CR3BP) dynamics and bearing-only optical measurements within a Linear Fractional Transformation (LFT) representation. A full-order mathcal{H}_infty observer is synthesized with explicit mathcal{L}_2 performance bounds. The formulation yields a nonlinear estimator that operates directly on the governing equations and avoids reliance on local linearizations. Dominant nonlinearities are expressed as structured real uncertainties, while measurement fidelity is represented through range-dependent weighting with Earth-Moon distances reconstructed from line-of-sight geometry. The sensing architecture assumes passive star-tracker-class optical instruments, eliminating the need for time-of-flight ranging or precision clocks. Simulations demonstrate bounded estimation errors and smooth position tracking over multiple orbital periods, with the largest deviations observed in the out-of-plane states, consistent with the stiffness of the vertical dynamics and the limitations of angle-only observability. Application to a Near Rectilinear Halo Orbit (NRHO) illustrates that the framework can achieve robust onboard navigation with bounded estimation errors using flight-representative sensors.
|
| |
| 10:45-10:48, Paper FrAR02.16 | Add to My Program |
| Learning Safety–Compatible Observers for Unknown Systems |
|
| Bae, Juho | Korea Advanced Institute of Science and Technology |
| Roh, Daegyeong | Korea Advanced Institute of Science and Technology |
| Choi, Han-Lim | KAIST |
Keywords: Observers for nonlinear systems, Robust control, Learning
Abstract: This paper presents a data–driven approach for jointly learning a robust full–state observer and its robustness certificate for systems with unknown dynamics. Leveraging incremental input–to–state stability (δISS) notions, we jointly learn a δISS Lyapunov function that serves as the robustness certificate and prove practical convergence of the estimation error under standard fidelity assumptions on the learned mod- els. This renders the observer safety–compatible: they can be consumed by certificate–based safe controllers so that, when the controller tolerates bounded estimation error, the controller’s certificate remains valid under output feedback. We further extend the approach to interconnected systems via the small– gain theorem, yielding a distributed observer design framework. We validate the approach on a variety of nonlinear systems.
|
| |
| FrB03 Tutorial Session, Grand Salon 3 |
Add to My Program |
Foundations in AI and Quantum Systems: From Control and Game-Theory to
Stochastic Deception |
|
| |
| Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
| Co-Chair: Tang, Michael | University of California, San Diego |
| Organizer: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
| Organizer: Poveda, Jorge I. | University of California, San Diego |
| |
| 13:30-15:00, Paper FrB03.1 | Add to My Program |
| A Tutorial for Foundations in AI and Quantum Systems: From Control and Game-Theory to Stochastic Deception (I) |
|
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
| Poveda, Jorge I. | University of California, San Diego |
| Anderson, Sean | University of California Santa Barbara |
| Hespanha, Joao P. | Univ. of California, Santa Barbara |
| Krstic, Miroslav | University of California, San Diego |
| Petersen, Ian R. | Australian National University |
| Tang, Michael | University of California, San Diego |
| Wadi, Ali | Georgia Institute of Technology |
| Zhu, Quanyan | New York University |
Keywords: Quantum information and control, Game theory
Abstract: In this tutorial, we explore the intersection of quantum control, game theory, and artificial intelligence deception, emphasizing their combined potential to enhance secure and strategic decision-making in complex systems. It examines how deceptive behavior can be understood and influenced within strategic frameworks, and outlines principles for designing resilient systems that leverage advantages in both quantum and learning-driven environments. While the discussion centers on specific examples, the foundational ideas are broadly applicable across a range of domains.
|
| |
| FrB04 Invited Session, Grand Salon 4 |
Add to My Program |
| Safety Verification and Optimization Protocols for Spacecraft |
|
| |
| Chair: Lippay, Zachary | Verus Research |
| Co-Chair: Soderlund, Alexander | The Ohio State University |
| Organizer: Petersen, Chris | University of Florida |
| Organizer: Phillips, Sean | Air Force Research Laboratory |
| Organizer: Soderlund, Alexander | The Ohio State University |
| |
| 13:30-13:45, Paper FrB04.1 | Add to My Program |
| Safe Landing on Small Celestial Bodies with Gravitational Uncertainty Using Disturbance Estimation and Control Barrier Functions (I) |
|
| Arenas Uribe, Felipe | University of Kentucky |
| Seigler, Thomas Michael | University of Kentucky |
| Hoagg, Jesse B. | University of Kentucky |
Keywords: Spacecraft control, Optimal control, Constrained control
Abstract: Soft landing on small celestial bodies (SCBs) poses unique challenges, as uncertainties in gravitational models and poorly characterized, dynamic environments require a high level of autonomy. Existing control approaches lack formal guarantees for safety constraint satisfaction, necessary to ensure the safe execution of the maneuvers. This paper introduces a control that addresses this limitation by integrating trajectory tracking, disturbance estimation, and safety enforcement. An extended high-gain observer is employed to estimate disturbances resulting from gravitational model uncertainties. We then apply a feedback-linearizing and disturbance-canceling controller that achieves exponential tracking of reference trajectories. Finally, we use a control barrier function based minimum-intervention controller to enforce state and input constraints through out the maneuver execution. This control combines trajectory tracking of offline generated reference trajectories with formal guarantees of safety, which follows common guidance and control architectures for spacecraft and allows aggressive maneuvers to be executed without compromising safety. Numerical simulations using fuel-optimal trajectories demonstrate the effectiveness of the controller in achieving precise and safe soft-landing, highlighting its potential for autonomous SCB missions.
|
| |
| 13:45-14:00, Paper FrB04.2 | Add to My Program |
| Shrinking Horizon MPC for Safe Multi-Spacecraft Capture of a Non-Cooperative Target (I) |
|
| Kim, Taehyeun | University of Michigan |
| Girard, Anouck | University of Michigan, Ann Arbor |
| Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Aerospace, Constrained control, Optimal control
Abstract: This paper describes the application of a Shrinking Horizon Model Predictive Controller (SHMPC) to a cooperative spacecraft capture. SHMPC is used to coordinate multiple capturing spacecraft to specified non-equilibrium states corresponding to the capture of a rotating non-cooperative target subject to state and control constraints. Robustness to model mismatch, thrust uncertainty, and inexact optimization is illustrated through numerical simulations demonstrating the benefits of SHMPC over the following open-loop trajectories using conventional feedback. Enhancements to the baseline SHMPC formulation are proposed to reduce the computational effort.
|
| |
| 14:00-14:15, Paper FrB04.3 | Add to My Program |
| Perception-Integrated Safety Critical Control Via Analytic Collision Cone Barrier Functions on 3D Gaussian Splatting (I) |
|
| Tscholl, Dario | Georgia Institute of Technology |
| Nakka, Yashwanth Kumar | Georgia Institute of Technology |
| Gunter, Brian | Georgia Institute of Technology |
Keywords: Predictive control for linear systems, Emerging control applications, Autonomous vehicles
Abstract: This letter presents a perception-driven safety filter that converts each 3D Gaussian Splat (3DGS) into a closed-form forward collision cone. The resulting constraint admits a relative-degree-one control barrier function (CBF) that is enforced online via a quadratic program (QP). By leveraging the analytic geometry of Gaussian splats, the proposed formulation yields a continuous, closed-form representation of collision avoidance constraints that is both simple and computationally efficient. In contrast to distance-based CBFs, which typically become active only when the robot is already near an obstacle, the collision cone CBF activates proactively, enabling earlier corrective action and thus smoother avoidance behavior at lower computational cost. The method is validated on a large synthetic scene consisting of approximately 170k Gaussian splats. Compared to a state-of-the-art 3DGS planner, the proposed filter reduces computation time by a factor of 3 and notably decreases trajectory jerk while maintaining safety. The approach is entirely analytic, requires no high-order CBF (HOCBF), and extends naturally to robots with physical extent via principled Minkowski-sum inflation of the splats. These properties make the method well suited for real-time navigation in cluttered, perception-derived environments on Earth and in space.
|
| |
| 14:15-14:30, Paper FrB04.4 | Add to My Program |
| Analysis and Flight Validation of Momentum-Biased Attitude Stability During Open-Loop Deployment on NISAR |
|
| Rallapalli, Aditya | UR Rao Satellite Centre |
| Kakula, Ashok Kumar | U R Rao Satellite Centre(ursc) , Isro |
| Bates, David | Jet Propulsion Laboratory |
Keywords: Spacecraft control, Stability of linear systems, Control applications
Abstract: The deployment of large unfurlable reflectors presents significant challenges for spacecraft attitude stability, particularly when performed in open-loop (no control) mode due to uncertainties in deployment dynamics and limitations on active control authority. For the NASA–ISRO Synthetic Aperture Radar (NISAR) mission, a 12 m reflector was deployed in orbit using an open-loop sequence. To minimize drift from the Sun and ensure uninterrupted power generation, a momentumbiased configuration was implemented using reaction wheels. Unlike conventional spin-stabilization, which is unsuitable for reflector missions due to telemetry and telecommand pointing constraints, the reaction-wheel momentum bias provided controlled passive stability while maintaining spacecraft orientation. The bias value was derived through analytical modeling of spacecraft dynamics with reaction wheels under a smallangle approximation about the Earth-pointing reference frame, explicitly accounting for gravity-gradient torques and inertia properties. This analysis established stability margins and guided the selection of the commanded momentum bias. The approach also maximized the allowable open-loop duration, providing sufficient time for deployment even under contingency scenarios. Pre-flight predictions showed stable behavior with minimized solar and nadir drift, which was subsequently confirmed through on-orbit telemetry. Flight data demonstrated close agreement with analytical results, providing a flightproven validation of the momentum-biased attitude stability approach for large reflector deployment.
|
| |
| 14:30-14:45, Paper FrB04.5 | Add to My Program |
| Gradient-Based Constrained Spacecraft Trajectory Optimization with Convergence Guarantees |
|
| Zheng, Yifei | University of California San Diego |
| Dower, Peter M. | University of Melbourne |
| McEneaney, William M. | Univ. California San Diego |
Keywords: Optimization algorithms, Computational methods, Aerospace
Abstract: A trajectory optimization method is devised for a class of problems with state constraints. The method is conceptually a projected gradient descent method, where the convergence of the control process is proved without reference to any discretization, and is obtained directly from the convergence of the cost function under mild regularity conditions. A corresponding algorithm for spacecraft trajectory optimization is proposed, along with two numerical examples where terminal states are prescribed. The algorithm is able to generate trajectories that require less fuel than stationary-action trajectories.
|
| |
| 14:45-15:00, Paper FrB04.6 | Add to My Program |
| Convex Sum-Of-Squares Controller Design for Relative Orbital Motion about Circular Chief Orbits |
|
| Hoyos, Jose Daniel | Purdue University |
| Sikka, Sidhdharth | Purdue |
| Rai, Ayush | Purdue University |
| Mou, Shaoshuai | Purdue University |
Keywords: Spacecraft control, Formal verification/synthesis, Optimization
Abstract: A key challenge in relative orbital motion is the design of nonlinear controllers that provide formal stability guarantees for the nonlinear dynamics while systematically incorporating operational constraints such as actuator saturation and sensor field-of-view requirements. In this paper, we present a computational framework for designing certified nonlinear controllers for orbital control under circular chief orbit. Our approach provides a unified solution leveraging three key steps: polynomial lifting to express the non-polynomial orbital dynamics in a polynomial form, the use of a dual Lyapunov theory for stability certification, and sum-of-squares optimization for tractable controller synthesis. Numerical simulations confirm effective constraint handling, including input saturation and geometric keep-in regions, and enhanced robustness to significant state measurement noise in comparison to LQR.
|
| |
| FrB05 Tutorial Session, Grand Salon 6 |
Add to My Program |
| Control Via Incentives and Information in Socio-Technical Systems |
|
| |
| Chair: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
| Co-Chair: Savla, Ketan | University of Southern California |
| Organizer: Chiu, Chih-Yuan | Georgia Institute of Technology |
| Organizer: Ferguson, Bryce L. | Dartmouth College |
| Organizer: Brown, Philip N. | University of Colorado Colorado Springs |
| Organizer: Savla, Ketan | University of Southern California |
| Organizer: Wu, Manxi | University of California Berkeley |
| Organizer: Bose, Subhonmesh | University of Illinois at Urbana Champaign |
| Organizer: Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
| Organizer: Basar, Tamer | Univ of Illinois, Urbana-Champaign |
| |
| 13:30-15:00, Paper FrB05.1 | Add to My Program |
| Control Via Incentives and Information in Socio-Technical Systems: A Tutorial (I) |
|
| Basar, Tamer | Univ of Illinois, Urbana-Champaign |
| Bose, Subhonmesh | University of Illinois at Urbana Champaign |
| Brown, Philip N. | University of Colorado Colorado Springs |
| Chiu, Chih-Yuan | Georgia Institute of Technology |
| Ferguson, Bryce L. | Dartmouth College |
| Savla, Ketan | University of Southern California |
| Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
| Wu, Manxi | University of California Berkeley |
| Maheshwari, Chinmay | Johns Hopkins University |
Keywords: Game theory, Agents-based systems, Emerging control applications
Abstract: Many shared public amenities such as transporta- tion networks and power grids suffer from congestion and depletion effects resulting from uncoordinated overconsumption by self-interested users. In response, researchers have long studied methods of encouraging pro-social consumption of shared societal resources. In this tutorial, we survey recent work in the area of behavior-influencing mechanisms in modern infrastructure-scale socio-technical systems. Two basic frame- works are considered. First, an informed social planner may use their informational advantage to influence behavior: our paper demonstrates a variety of situations in which interesting and counterintuitive principles emerge in this context. For instance, it is often the case that agents are better off if the planner reveals noisy information rather than acting with full transparency. We present a variety of distinct approaches and problem formulations for this information design setting, ranging from convergence results to analysis under nonstandard informational assumptions. Second, an infrastructure operator may leverage their monopoly power to use pricing policies to influence societal behavior. Here, we ask a robustness question and present recent work on methods of characterizing the sensitivity of social outcomes to variations in pricing policy.
|
| |
| FrB06 Invited Session, Grand Salon 7 |
Add to My Program |
| Networked Systems: Analysis and Control |
|
| |
| Chair: Gracy, Sebin | South Dakota School of Mines and Technology |
| Co-Chair: Pare, Philip E. | Purdue University |
| Organizer: Gracy, Sebin | South Dakota School of Mines and Technology |
| Organizer: Pare, Philip E. | Purdue University |
| Organizer: Ishii, Hideaki | University of Tokyo |
| |
| 13:30-13:45, Paper FrB06.1 | Add to My Program |
| Generating Differentially Private Networks with a Modified Erdős–Rényi Model (I) |
|
| Rao, Huaiyuan | Georgia Institute of Technology |
| Hawkins, Calvin | Georgia Institute of Technology |
| Benvenuti, Alexander | Georgia Institute of Technology |
| Hale, Matthew | Georgia Institute of Technology |
Keywords: Network analysis and control
Abstract: Differential privacy has been used to privately calculate numerous network properties, but existing approaches often require the development of a new privacy mechanism for each property of interest. Therefore, we present a framework for generating entire networks in a differentially private way. Differential privacy is immune to post-processing, which allows for any network property to be computed and analyzed for a private output network, without weakening its protections. We consider undirected networks and develop a differential privacy mechanism that takes in a sensitive network and outputs a private network by randomizing its edge set. We prove that this mechanism does provide differential privacy to a network's edge set, though it induces a complex distribution over the space of output graphs. We then develop an equivalent privacy implementation using a modified Erdős–Rényi model that constructs an output graph edge by edge, and it is efficient and easily implementable, even on large complex networks. Experiments implement epsilon-differential privacy with epsilon of 2.5 when computing graph Laplacian spectra, and these results show the proposed mechanism incurs 49.34% less error than the current state of the art.
|
| |
| 13:45-14:00, Paper FrB06.2 | Add to My Program |
| The Tragedy of the Commons in Multi-Population Resource Games (I) |
|
| Vahmian, Yamin | University of Colorado Colorado Springs |
| Paarporn, Keith | University of Colorado, Colorado Springs |
Keywords: Game theory, Network analysis and control, Optimization
Abstract: Self-optimizing behaviors can lead to outcomes where collective benefits are ultimately destroyed, a well-known phenomenon known as the ``tragedy of the commons". These scenarios are widely studied using game-theoretic approaches to analyze strategic agent decision-making. In this paper, we examine this phenomenon in a bi-level decision-making hierarchy, where low-level agents belong to multiple distinct populations, and high-level agents make decisions that impact the choices of the local populations they represent. We study strategic interactions in a context where the populations benefit from a common environmental resource that degrades with higher extractive efforts made by high-level agents. We characterize a unique symmetric Nash equilibrium in the high-level game, and investigate its consequences on the common resource. While the equilibrium resource level degrades as the number of populations grows large, there are instances where it does not become depleted. We identify such regions, as well as the regions where the resource does deplete.
|
| |
| 14:00-14:15, Paper FrB06.3 | Add to My Program |
| A Passivity-Agnostic Framework for Distributed Adaptive Synchronization under Unknown Leader Dynamics (I) |
|
| Wafi, Moh. Kamalul | Northeastern University |
| Montazeri Hedesh, Hamidreza | Northeastern University |
| Siami, Milad | Northeastern University |
Keywords: Adaptive control, Networked control systems, Distributed control
Abstract: We present a passivity-agnostic framework for distributed adaptive synchronization under position-only communication, bounded disturbances, and unknown leader dynamics. By passivity-agnostic we mean the design does not require the closed loop to be SPR a priori: it certifies SPR when present and recovers it by frequency shaping when absent. Followers are heterogeneous second-order systems with unknown (possibly unstable) dynamics. In the SPR regime, a structured reparameterization yields gradient-based adaptive error dynamics; Lyapunov analysis guarantees global asymptotic synchronization in the disturbance-free case, exact rejection of constant disturbances, and bounded responses to time-varying disturbances, with parameter convergence under persistent excitation. In the non-SPR regime, frequency shaping recovers effective passivity of the unshaped transfer function, enabling the same stability guarantees via standard passivity/Lyapunov arguments using Meyer-Kalman-Yakubovich (MKY) Lemma. Simulations across star, cyclic, path, and arbitrary graphs demonstrate scalable synchronization, robust tracking, and parameter adaptation under multiple disturbance profiles, confirming that the frequency-shaped non-SPR designs match the performance of the SPR case.
|
| |
| 14:15-14:30, Paper FrB06.4 | Add to My Program |
| Regulation of Rumor Propagation Via (Multi-Leader) Stackelberg Graphon Games (I) |
|
| Liu, Huaning | University of Illinois at Urbana-Champaign |
| Dayanikli, Gokce | University of Illinois Urbana-Champaign |
Keywords: Mean field games, Game theory, Control applications
Abstract: We study the control of rumor propagation in large networked populations by using Stackelberg graphon games. We first introduce a principal who wants to incentivize the spread of her preferred news and discourage the spread of non-preferred news. We define the Stackelberg graphon game equilibrium (SGGE), characterize the graphon game Nash equilibrium (GGNE) with a forward-backward differential equation system, and establish existence results. We further formulate a multi-leader model with two competing principals, each incentivizing her own preferred news. Finally, we propose a bi-level algorithm for computing (multi-leader) Stackelberg graphon game equilibria and conclude with numerical experiments where we show that existence competing principals will result in strong opinion divisions in the population.
|
| |
| 14:30-14:45, Paper FrB06.5 | Add to My Program |
| Finding Super-Spreaders in SIS Epidemics (I) |
|
| Sridhar, Anirudh | New Jersey Institute of Technology |
| Ghosh, Arnob | New Jersey Institute of Technology |
Keywords: Network analysis and control, Markov processes, Estimation
Abstract: In network epidemic models, controlling the spread of a disease often requires targeted interventions such as vaccinating high-risk individuals based on network structure. However, typical approaches assume complete knowledge of the underlying contact network, which is often unavailable. While network structure can be learned from observed epidemic dynamics, existing methods require long observation windows that may delay critical interventions. In this work, we show that full network reconstruction may not be necessary: control-relevant features, such as high-degree vertices (super-spreaders), can be learned far more efficiently than the complete structure. Specifically, we develop an algorithm to identify such vertices from the dynamics of a Susceptible-Infected-Susceptible (SIS) process. We prove that in an n-vertex graph, vertices of degree at least n^alpha can be identified over an observation window of size Omega (1/alpha), for any alpha in (0,1). In contrast, existing methods for exact network reconstruction requires an observation window that grows linearly with n. Simulations demonstrate that our approach accurately identifies super-spreaders and enables effective epidemic control.
|
| |
| 14:45-15:00, Paper FrB06.6 | Add to My Program |
| Bounds of Validity for Bifurcations of Equilibria in a Class of Networked Dynamical Systems (I) |
|
| Gupta, Pranav | Indian Institute of Technology Bombay |
| Banavar, Ravi N. | Indian Institute of Technology Bombay |
| Bizyaeva, Anastasia | Cornell University |
Keywords: Networked control systems, Neural networks, Stability of nonlinear systems
Abstract: Local bifurcation analysis plays a central role in understanding qualitative transitions in networked nonlinear dynamical systems, including dynamic neural network and opinion dynamics models. In this article we establish explicit bounds of validity for the classification of bifurcation diagrams in two classes of continuous-time networked dynamical systems, analogous in structure to the Hopfield and the Firing Rate dynamic neural network models. Our approach leverages recent advances in computing the bounds for the validity of Lyapunov–Schmidt reduction, a reduction method widely employed in nonlinear systems analysis. Using these bounds we rigorously characterize neighbourhoods around bifurcation points where predictions from reduced-order bifurcation equations remain reliable. We further demonstrate how these bounds can be applied to an illustrative family of nonlinear opinion dynamics on k−regular graphs, which emerges as a special case of the general framework. These results provide new analytical tools for quantifying the robustness of bifurcation phenomena in dynamics over networked systems and highlight the interplay between network structure and nonlinear dynamical behaviour.
|
| |
| FrB07 Regular Session, Grand Salon 9 |
Add to My Program |
| Automotive Systems |
|
| |
| Chair: Tang, Jian | Robert Bosch LLC |
| Co-Chair: Rajamani, Rajesh | Univ. of Minnesota |
| |
| 13:30-13:45, Paper FrB07.1 | Add to My Program |
| Design of a Sensorless Trailer Anti-Lock Braking System Based on Trailer Swing Detection |
|
| Jeong, Semin | Korea Advanced Institute of Science and Technology |
| Oh, Jiyeol | Korea Advanced Institute of Science and Technology |
| Choi, Seibum Ben | Korea Advanced Institute of Science and Technology |
Keywords: Automotive systems, Automotive control, Control applications
Abstract: This study proposes a trailer anti-lock braking system (ABS) for commercial travel trailers, addressing the practical concern of unknown trailer parameters and the absence of trailer sensors. In current commercial trailer systems, a control unit in the towing vehicle allows the driver to manually adjust the trailer brake gain. However, improper adjustment can cause excessive braking at the trailer wheels, leading to wheel lock-up and a loss of lateral stability, commonly referred to as the “trailer swing” phenomenon. Previous studies are based on trailer wheel speed or hitch angle sensors, which are not typically available in commercial trailers, thus limiting their applicability. Aftermarket trailer ABS also exists, but its high cost and complex installation requirements have hindered widespread adoption. To overcome this limitation, this study introduces a novel trailer ABS that does not rely on direct wheel slip measurements. Instead, trailer swing is detected by the trailer swing index, an indirect indicator of wheel lock-up, enabling braking control without additional trailer-mounted sensors. The proposed method is evaluated through Carsim/Simulink. Through the proposed method, trailer swing is detected and controlled effectively without trailer information.
|
| |
| 13:45-14:00, Paper FrB07.2 | Add to My Program |
| On the Importance of Slip Angle Estimation for Accurate Target Vehicle Tracking |
|
| Kyong, Hongjoon | University of Minnesota |
| Alai, Hamidreza | University of Minnesota |
| Kazemi Tameh, Ehsan | University of Minnesota |
| Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Automotive systems, Mechatronics, Estimation
Abstract: With an increasing demand for Level 3 autonomous driving systems, the need for accurate tracking of the trajectories of nearby vehicles using just low-cost radar has become critical. While well-designed estimation algorithms for target vehicle tracking are essential, the accuracy of the underlying dynamic model plays a pivotal role as well. Conventional vehicle kinematic models are formulated in the inertial (global) frame and do not explicitly account for the ego-vehicle’s motion. The new relative motion model in this paper considers the effects of ego-vehicle motion, leading to improved target tracking performance when the ego-vehicle executes non-zero yaw motions. This model requires the use of the ego-vehicle’s dynamic states, including velocity, yaw rate, and slip angle. Among these, slip angle estimation poses a particular challenge due to limitations in cost-effective sensing technologies. This paper highlights the importance of slip angle estimation for accurate vehicle tracking using a new relative motion model. The ego-vehicle’s slip angles are employed for target vehicle tracking. The proposed method is validated across a range of driving scenarios, including sharp cornering maneuvers and even U-turns. Experimental results demonstrate that the proposed method consistently provides reliable vehicle tracking results.
|
| |
| 14:00-14:15, Paper FrB07.3 | Add to My Program |
| Observer Design for Vector Nonlinear System with Application to Vehicle Tracking |
|
| Sharma, Gaurav | University of Minnesota |
| Zemouche, Ali | CRAN UMR CNRS 7039 & Université De Lorraine |
| Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Automotive systems, Multivehicle systems, Estimation
Abstract: This paper introduces an observer design framework for a system with multivariable vector arguments in the nonlinear process dynamics. When the partial derivatives of the nonlinear functions are bounded and monotonic, exponential convergence of the observer is guaranteed by design using a set of linear matrix inequalities (LMIs). For non-monotonic cases, a switched-gain observer is proposed, complemented by a state-scaling technique that enhances numerical stability and extends the feasibility of the LMI conditions. The framework is validated in an autonomous driving application, where the observer estimates the trajectories of surrounding vehicles. Experiments on a full-scale autonomous Chrysler Pacifica vehicle at the University of Minnesota demonstrate robust performance even in the presence of non-monotonic vector nonlinear dynamics.
|
| |
| 14:15-14:30, Paper FrB07.4 | Add to My Program |
| A Regenerative Braking Framework for Intent-Based Platooning in Real-World Traffic Situations |
|
| Sharma, Sachin | Toyota Motor North America |
| Moradipari, Ahmadreza | Toyota Motor North America |
| Avedisov, Sergei S. | Toyota North America R&D InfoTech Labs |
| Mishra, Shatadal | Toyota Motor North America |
| Nour, Mariam | Toyota InfoTech Labs |
Keywords: Cooperative control, Automotive systems, Multivehicle systems
Abstract: Traditional platooning algorithms, such as Cooperative Adaptive Cruise Control (CACC), rely on connected automated vehicles (CAVs) sharing their current position, speed, and acceleration. In contrast, intent-based platooning is an emerging technology where CAVs share their planned trajectories. In this work, both of these algorithms are adapted for Battery Electric Vehicles (BEVs) and Plug-in Hybrid Electric Vehicles (PHEVs) to assess the impact of platooning on energy consumption. The regenerative braking system common in BEVs and PHEVs is modeled and incorporated into the optimization framework of intent-based platooning. The modified framework is then compared with the traditional CACC and intent-based platooning frameworks through simulations based on real vehicle speed profiles collected in free-flow and congested highway driving. The results show that the intent-based platooning framework adapted for BEVs and PHEVs achieves an 7-16% improvement in energy efficiency compared to the traditional intent-based platooning framework, and an 14-33% improvement compared to CACC.
|
| |
| 14:30-14:45, Paper FrB07.5 | Add to My Program |
| Bayesian Optimization for ADAS Development: Balancing Safety and Comfort in an AEB Use Case (I) |
|
| Rigotti Caiano, Solano | IPEN |
| Galbier Lopes, Ygor | Bosch |
| Mafort, Leimar | Compnay |
| Tang, Jian | Robert Bosch LLC |
Keywords: Optimization algorithms, Simulation, Automotive systems
Abstract: This work integrated a hybrid Bayesian–Optimisation and Genetic–Algorithm (BO–GA) workflow in the IPG CarMaker’s software-in-the-loop (SIL) environment to assist Advanced Driver Assistance System (ADAS) development. By replacing exhaustive grid searches with a Gaussian-process surrogate model and an expected-improvement acquisition function, the approach delivers substantial cost savings and delineates operating boundaries for subsequent software development. An Autonomous Emergency Braking (AEB) use case shows how the approach concentrates evaluations in the most critical regions of the scenario space, specifically where the timeto-collision (TTC) satisfies < 1.0s or the jerk exceeds 10m/s3 —thereby meeting both safety and comfort requirements. Relative to conventional parameter sweeps, the method cuts the total efforts by approximately 75% while still producing interpretable safety–comfort trade-off curves that guide further development.
|
| |
| 14:45-15:00, Paper FrB07.6 | Add to My Program |
| Lateral String Stability for Vehicle Platoons |
|
| Li, Sixu | Texas A&M Univ |
| Darbha, Swaroop | Texas A&M Univ |
| Zhou, Yang | Texas A&M Univ |
Keywords: Multivehicle systems, Automotive control, Control applications
Abstract: Connected and automated vehicle (CAV) platooning promises gains in energy efficiency and traffic throughput and, most critically, in safety. These safety benefits hinge on string stability, which determines how disturbances propagate along a platoon. While longitudinal string stability is well studied, lateral string stability, which governs the propagation of path-tracking errors that can lead to unsafe deviations from the intended path, remains underexplored. Its importance is increasing as autonomous vehicles rely more heavily on onboard sensing and map‑free navigation, where sensor occlusion and dense formations amplify safety risks. This paper presents a new framework for lateral string stability that directly addresses safety‑critical path‑relative tracking errors and enables consistent comparison across vehicles following the same road geometry. Central to this framework is an arc‑length (Eulerian) viewpoint, a departure from traditional analyses, that clarifies how tracking errors at a given point on the path propagate from one vehicle to the next. A formal definition of lateral string stability is introduced along with two control strategies: an onboard‑sensing‑only controller and a novel learn‑from‑predecessor approach utilizing vehicle-to-vehicle (V2V) communication. We show that onboard sensing alone cannot guarantee attenuation of path-tracking errors, imposing a fundamental safety limitation, whereas V2V communication enables true error attenuation.
|
| |
| FrB08 Regular Session, Grand Salon 10-13 |
Add to My Program |
| Cooperative Control |
|
| |
| Chair: Gaspar, Peter | SZTAKI |
| Co-Chair: Zelazo, Daniel | Technion - Israel Institute of Technology |
| |
| 13:30-13:45, Paper FrB08.1 | Add to My Program |
| Aligning LLMs with Human Comfort Preferences in Human-Robot Collaboration Via Comfort-Guided Supervised Fine-Tuning |
|
| Yan, Yuchen | Clemson University |
| Jia, Yunyi | Clemson Universtiy |
Keywords: Cooperative control, Agents-based systems, Optimization
Abstract: Aligning large language models (LLMs) with human preferences has been a critical challenge in artificial intelligence research. While Reinforcement Learning from Human Feedback (RLHF) is a popular approach to this alignment, it often relies on explicit human-provided rewards, which are costly, inconsistent, and difficult to scale. In this paper, we propose a comfort-guided supervised fine-tuning framework that replaces direct human feedback with reward signals computed from pre-built human comfort models. These models capture spatial, cognitive, and interactional preferences, enabling LLMs to generate decisions that align with human comfort expectations in collaborative scenarios. By fine-tuning LLMs using comfort-based and efficiency-based reward functions derived from these models, our method provides a scalable and systematic alternative to traditional feedback-driven learning. We apply this framework to a human-robot collaboration task, where an LLM generates optimal action plans by balancing efficiency and comfort. Our results demonstrate that fine-tuning with an extremely small dataset (1MB) and model-generated reward signals significantly improves alignment with human expectations, yielding decisions that are both effective and intuitively preferred by human collaborators. This study introduces a lightweight and efficient strategy to embed human comfort into AI-driven decision-making, reducing reliance on large-scale human annotation while maintaining high-quality preference alignment.
|
| |
| 13:45-14:00, Paper FrB08.2 | Add to My Program |
| Energy-Based Scheduling for Collaborative Robot Mobility |
|
| Nguyen, Alexander A. | University of California, Irvine |
| Jabbari, Faryar | Univ. of California at Irvine |
| Egerstedt, Magnus | University of California, Irvine |
Keywords: Cooperative control, Agents-based systems, Optimization
Abstract: This paper presents an energy-based, pairwise collaborative scheduling framework for two robots with different mobility characteristics traversing a landscape. The collaborative scheduling problem is formulated as a mixed integer linear program (MILP), whose solution determines when the two robots should work together to accomplish their respective tasks with energy remaining in their batteries. The MILP is framed in two ways, considering net energy expenditure and final stored energy (i.e., energy available at the end of the task) as performance measures: (I) optimizing the performance of the overall team (e.g., minimizing the combined net energy expenditure of the two robots) and (II) optimizing the performance of individuals (e.g., minimizing the maximum net energy expenditure for the individual robots). Additionally, we introduce the notion of the ``cost of fairness'' and demonstrate the proposed framework on scenarios in which collaboration between two heterogeneous robots is necessary.
|
| |
| 14:00-14:15, Paper FrB08.3 | Add to My Program |
| A Generalized Voronoi Graph Based Coverage Control Approach for Non-Convex Environment |
|
| Guo, Zuyi | Zhejiang University |
| Zheng, Ronghao | Zhejiang University, ZJU |
| Liu, Meiqin | Zhejiang University |
| Zhang, Senlin | Zhejiang University |
Keywords: Distributed control, Cooperative control, Decentralized control
Abstract: To address the challenge of efficient coverage by multi-robot systems in non-convex regions with multiple obstacles, this paper proposes a coverage control method based on the Generalized Voronoi Graph (GVG). This method partitions the non-convex region into multiple sub-regions using GVG, overcoming the limitations of traditional Voronoi partitioning, which fails to handle obstacles and non-convex boundaries. Besides, a weighted load-balancing algorithm is developed, which considers the quality differences among sub-regions. By iteratively optimizing the robot allocation ratio, the number of robots in each sub-region is matched with the sub-region quality to achieve load balance. The convergence of the method is proved and its performance is evaluated through simulations.
|
| |
| 14:15-14:30, Paper FrB08.4 | Add to My Program |
| Formation Control Via Rotation Symmetry Constraints |
|
| Martinez, Zamir | Technion - Israel Institute of Technology |
| Zelazo, Daniel | Technion - Israel Institute of Technology |
Keywords: Cooperative control, Distributed control, Control of networks
Abstract: This work introduces a distributed formation control strategy for multi-agent systems based solely on rotation symmetry constraints. We propose a potential function that enforces inter-agent rotational symmetries, whose gradient defines a control law that drives the agents toward a desired planar symmetric configuration. We show that only n-1 edges (the minimal connectivity requirement) are sufficient to implement the strategy, where n is the number of agents. We further augment the design to address the maneuvering problem, enabling the formation to undergo coordinated translations, rotations, and scaling along a predefined virtual trajectory. Simulation examples are provided to validate the effectiveness of the proposed method.
|
| |
| 14:30-14:45, Paper FrB08.5 | Add to My Program |
| An Integrated Design of Robust Hinf Control with Proximal Policy Optimization for the Cooperation of Robot Vehicles |
|
| Lelkó, Attila | SZTAKI Institute for Computer Science and Control |
| Nemeth, Balazs | SZTAKI |
| Gaspar, Peter | SZTAKI |
Keywords: Cooperative control, Reinforcement learning, Robust control
Abstract: The aim of this study is to propose a reinforcement-learning-based synthesis method for the integrated design of a robust H-infinity control and a deep neural network. This integration improves closed-loop performance through Proximal Policy Optimization (PPO), i.e. the Hinf controller and the deep neural network are trained within the same optimization process; this joint training constitutes the main novelty of the paper. The resulting control system provides performance guarantees against uncertainties which are taken into account both in the Hinf design process and in the training environment. The effectiveness of the integrated design method is illustrated through the coordinated control of simultaneously moving robot vehicles, where reduced mission time and collision avoidance are achieved even under significant uncertainties.
|
| |
| 14:45-15:00, Paper FrB08.6 | Add to My Program |
| Multi-Robot Allocation for Information Gathering in Non-Uniform Spatiotemporal Environments |
|
| Naveed, Kaleb Ben | University of Michigan, |
| Lee, Haejoon | University of Michigan |
| Panagou, Dimitra | University of Michigan |
Keywords: Cooperative control, Autonomous robots, Sensor networks
Abstract: Autonomous robots are increasingly deployed to estimate spatiotemporal fields (e.g., wind, temperature, gas concentration) that vary across space and time. We consider environments divided into non-overlapping regions with distinct spatial and temporal dynamics, termed non-uniform spatiotemporal environments. Gaussian Processes (GPs) are commonly used to estimate such fields, where the kernel encodes spatial and temporal correlations through its lengthscales. When these lengthscales are misspecified, the resulting uncertainty estimates can be highly inaccurate. Existing GP methods often assume a single global lengthscale or update only periodically; some allow spatial variation but ignore temporal changes. To address these limitations, we propose a two-phase framework for multi-robot field estimation: Phase 1 uses a variogram-driven planner to learn region-specific spatial lengthscales, while Phase 2 reallocates robots based on current uncertainty and updates sampling as temporal lengthscales are refined. To encode uncertainty, we use clarity, an information metric from our earlier work. We evaluate the proposed method across diverse environments and provide convergence analysis for spatial lengthscale estimation, along with dynamic regret bounds quantifying the gap to the oracle allocation sequence.
|
| |
| FrB09 Invited Session, Grand Salon 12 |
Add to My Program |
| Estimation for Energy Storage Systems |
|
| |
| Chair: Zhang, Dong | University of Oklahoma |
| Co-Chair: Soudbakhsh, Damoon | Temple University |
| Organizer: Docimo, Donald | Texas Tech University |
| Organizer: Soudbakhsh, Damoon | Temple University |
| Organizer: Zhang, Dong | University of Oklahoma |
| Organizer: Song, Ziyou | University of Michigan, Ann Arbor |
| Organizer: Araujo Xavier, Marcelo | Amazon Leo |
| Organizer: Moura, Scott | University of California, Berkeley |
| Organizer: Lin, Xinfan | University of California, Davis |
| Organizer: Cui, Xiaofan | University of California, Los Angeles |
| Organizer: Filgueira da Silva, Samuel | The Ohio State University |
| Organizer: Tang, Shuxia | Texas Tech University |
| Organizer: Dey, Satadru | The Pennsylvania State University |
| |
| 13:30-13:45, Paper FrB09.1 | Add to My Program |
| Combined State and Parameter Estimation in Parallel-Connected Lithium-Ion Battery Cells (I) |
|
| Espin, Jorge | University of Oklahoma |
| Zhang, Dong | University of Oklahoma |
Keywords: Estimation, Energy systems, Observers for nonlinear systems
Abstract: Lithium-ion batteries in parallel connection present significant challenges for monitoring and control due to the coupled dynamics of individual cells and the lack of direct cell-level sensing. The inability to monitor cell-level state-of-charge (SOC), individual branch currents, and key parameters limits the effectiveness of battery management strategies and compromises system safety. This paper proposes an adaptive observer framework that addresses these limitations through a descriptor system formulation capturing the coupled dynamics of parallel-connected cells while enforcing Kirchhoff's laws as algebraic constraints. The method enables simultaneous estimation of cell-level states and key parameters, particularly charge capacity, using exclusively total voltage and current measurements. Numerical simulations under a dynamic current profile demonstrate accurate reconstruction of SOC, cell-level currents, and charge capacities of individual cells, even with large initial condition mismatches and parameter heterogeneity. These results establish the potential of this observer-based solution for battery management systems seeking comprehensive cell-level awareness without additional sensing infrastructure.
|
| |
| 13:45-14:00, Paper FrB09.2 | Add to My Program |
| Machine Learning Detection of Lithium Plating in Lithium-Ion Cells: A Gaussian Process Approach (I) |
|
| Patnaik, Ayush | UC Davis |
| Zufall, Adam | University of California, Davis |
| Robinson, Stephen Kern | University of California Davis |
| Lin, Xinfan | University of California, Davis |
| Fogelquist, Jackson | University of California, Davis |
| Ji, Yiwei | Washington University in St. Louis |
| Bai, Peng | Washington University in St. Louis |
Keywords: Machine learning, Aerospace, Energy systems
Abstract: Lithium plating during fast charging is a critical degradation mechanism that accelerates capacity fade and can trigger catastrophic safety failures. Recent work has shown that plating onset can manifest in incremental-capacity analysis as an additional high-voltage feature above 4.0 V, often appearing as a secondary peak or shoulder distinct from the main intercalation peak complex; however, conventional methods for computing dQ/dV rely on finite differencing with filtering, which amplifies sensor noise and introduces bias in feature location. In this paper, we propose a Gaussian Process (GP) framework for lithium plating detection by directly modeling the charge-voltage relationship Q(V) as a stochastic process with calibrated uncertainty. Leveraging the property that derivatives of GPs remain GPs, we infer dQ/dV analytically and probabilistically from the posterior, enabling robust detection without ad hoc smoothing. The framework provides three key benefits: (i) noise-aware inference with hyperparameters learned from data, (ii) closed-form derivatives with credible intervals for uncertainty quantification, and (iii) scalability to online variants suitable for embedded BMS. Experimental validation on Li-ion coin cells across a range of C-rates (0.2C-1C) and temperatures (0-40^circC) demonstrates that the GP-based method reliably resolves distinct high-voltage secondary peak features under low-temperature, high-rate charging, while correctly reporting no features in non-plating cases. The concurrence of GP-identified differential features, reduced charge throughput, capacity fade measured via reference performance tests, and post-mortem microscopy confirmation supports the interpretation of these signatures as plating-related, establishing a practical pathway for real-time lithium plating detection.
|
| |
| 14:00-14:15, Paper FrB09.3 | Add to My Program |
| Pulse-Based Continual Learning (PCL) for Second-Life Battery SOH Estimation without Catastrophic Forgetting under Data Heterogeneity and Scarcity (I) |
|
| Tao, Shengyu | Chalmers University of Technology |
| Wang, Yezhen | Tsinghua University |
| Jiang, Shida | Univeristy of California, Berkeley |
| Lee, Jaewoong | University of California, Berkeley |
| Moura, Scott | University of California, Berkeley |
| Zhang, Xuan | Tsinghua-Berkeley Shenzhen Institute |
| Zou, Changfu | Chalmers University of Technology |
Keywords: Energy systems, Estimation, Machine learning
Abstract: Second-life batteries are critical to sustainable energy transi-tions, but their diverse historical usage history and limited field data pose major challenges for their reliable health as-sessment. To address this problem, we propose a pulse-based continual learning (PCL) framework with replay-buffer strategy that preserves prior knowledge while adapting to heterogeneous and data-scarce test conditions. To ensure the rapidness of data curation of second-life batteries, rapid pulse current injection are performed at varying state of charge (SOC) levels, widths, amplitudes, and polarizations. The extracted features from those response voltage signals are utilized as the input for PCL model training. The proposed PCL method treats these pulse current test conditions as do-main-incremental tasks and applies a replay-buffer strategy to balance model adaptation and knowledge retention. We performed extensive physical experiments on a broad set of second-life cells, encompassing multiple chemistries (NMC, LMO, LFP), capacities from 2.1 Ah to 35 Ah, and diverse physical forms. The results show that the average SOH esti-mation accuracy across experimental conditions is over 90%, while the forgetting measure for historical tasks is near-zero. The PCL effectively learns new tasks while retaining estab-lished knowledge, delivering accurate and consistent SOH estimates with evolving second-life conditions. Therefore, the proposed PCL method is robust and scalable, providing a pathway toward efficient second-life battery estimation, screening, regrouping with implications for sustainable re-using and recycling.
|
| |
| 14:15-14:30, Paper FrB09.4 | Add to My Program |
| Multi-Goal Mission Planning in Windy Environments Using Boundary Expansion Estimates of Reachable Regions (I) |
|
| Hayes, Kaelea | The Pennsylvania State University |
| Pangborn, Herschel | The Pennsylvania State University |
| Brennan, Sean | The Pennsylvania State University |
Keywords: Aerospace, Flight control, Optimization algorithms
Abstract: Unmanned aerial vehicles frequently operate in outdoor environments subject to wind disturbances. In areas where these wind disturbances are large, for example where wind speeds approach or surpass the maximum vehicle speed, the most energy efficient route from a start to a goal is often not a straight path. Improvements could potentially be obtained by working with the wind, rather than against it. This paper proposes a grid-free method to quickly solve multi-goal path planning problems in strong wind fields. By propagating an approximation of the reachable set boundary forward in discrete time, the method provides a way to determine trajectories between locations in strong wind fields. It also provides an approximation of the region reachable by the vehicle at after a given time interval, meaning that goals outside of this region that must be reached within that interval can be eliminated from consideration. Additionally, a time-based cost for travel between two locations can be determined by the number of set expansions required to connect them. Under the assumption that time of travel and energy consumption are inherently linked, these properties are applied to solve an energy-optimizing version of the traveling salesman problem (TSP) where the solution is the energy-optimizing path between all feasible goals within a strong wind field, namely one whose localized maximum speeds are faster than the vehicle's flight speed.
|
| |
| 14:30-14:45, Paper FrB09.5 | Add to My Program |
| Sparse-Frequency EIS Measurements and Impedance Spectrum Reconstruction for Lithium-Ion Cell Modeling |
|
| Morgan, Timothy | University of Alabama in Huntsville |
| Sahoo, Avimanyu | University of Alabama in Huntsville |
Keywords: Nonlinear systems identification, Energy systems, Automotive systems
Abstract: In this paper, we propose a sparse-frequency EIS measurement and reconstruction framework that (i) acquires impedance data at a limited number of frequencies and (ii) reconstructs the full spectrum using a Chebyshev-based pseudospectral formulation. The frequencies are chosen according to Chebyshev–Lobatto nodes mapped onto a logarithmic frequency axis. The coefficients of the Chebyshev basis are efficiently obtained through a Type-I discrete cosine transform (DCT-I) with end-point half-weighting. Based on this reconstruction, the parameters of the corresponding ECM are then derived analytically, eliminating the need for iterative Nonlinear Least Squares (NLLS) fitting. Furthermore, we provide an analytic characterization of the reconstruction error and establish its dependence on both the frequency range and the number of sampled points. Finally, we experimentally validate the approach on an 18650 lithium-ion cell, showing that sampling at only six frequencies between 0.1 Hz and 1000 Hz is sufficient to reproduce the impedance spectra with high fidelity. In addition, fewer measurements show a 70% reduction in measurement time. Across the full frequency band, the synthesized ECM faithfully overlays the measured Nyquist curves, achieving sub-milliohm absolute error.
|
| |
| 14:45-15:00, Paper FrB09.6 | Add to My Program |
| A Physics-Informed Neural-Network Model for Rechargeable Thick-Electrode Lithium-Metal Battery Cells That Is Accurate at High C-Rate |
|
| Hileman, Wesley Allen | University of Colorado Colorado Springs |
| Lee, Byeong Kil | University of Colorado Colorado Springs |
| Trimboli, Michael | University of Colorado, Colorado Springs |
| Plett, Gregory L. | University of Colorado Colorado Springs |
Keywords: Modeling, Grey-box modeling, Energy systems
Abstract: We construct a physics-informed machine learning (PIML) model for rechargeable thick-electrode lithium-metal battery (LMB) cells by combining an enhanced single-particle model (SPMe) with a feedforward neural network (FNN). The hybrid model incorporates an additional pole necessary for modeling variation in electrochemical variables across the thickness of the electrode and guarantees bounded-input bounded-output stability by design. We train the model against the predictions of a Newman model simulated with the PyBaMM package. The trained PIML model predicts the voltage of a cell with a 200µm electrode at 1.8C with 93% less error than a plain SPMe model while remaining tractable for embedded applications. We provide Python code to simulate the Newman model, simulate the SPMe, and train the PIML model.
|
| |
| FrB10 Regular Session, Grand Salon 15 |
Add to My Program |
| Power Systems |
|
| |
| Chair: Yebra, Luis José | CIEMAT-Plataforma Solar De Almería |
| Co-Chair: You, Pengcheng | Peking University |
| |
| 13:30-13:45, Paper FrB10.1 | Add to My Program |
| DC Shipboard Microgrid Control Using Online Multilayer Neural Network Lifelong Learning |
|
| Shahed, Md Tanvir | Missouri University of Science and Technology |
| Farzanegan, Behzad | Missouri University of Science and Technology |
| Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Keywords: Power systems, Control applications, Reinforcement learning
Abstract: This work proposes an adaptive optimal control scheme for DC shipboard power systems (SPS) with multiple distributed generators (DGs), leveraging a multilayer neural network (MNN) to enhance the performance of a supplementary energy storage system (ESS) under pulsed power load (PPL) conditions. PPLs impose large, short-duration power demands that challenge system stability due to nonlinear dynamics and operational constraints. The proposed reinforcement learning (RL) framework with lifelong hybrid learning (LHL) ad- dresses these challenges by pursuing three objectives: rapid ESS charging, DC bus voltage regulation, and proportional load current sharing among DG converters. An actor–critic MNN approximates the value function and control policy, where the critic network is updated using a hybrid scheme that refines weights both at and within sampling instants for faster convergence. A weight consolidation mechanism further enables lifelong learning, mitigating catastrophic forgetting and reducing overall cost, while the actor adapts using control input errors. Experimental validation on the OPAL-RT OP4512 hardware-in-the-loop (HIL) platform demonstrates that the proposed controller effectively achieves the control objectives under dynamic load conditions.
|
| |
| 13:45-14:00, Paper FrB10.2 | Add to My Program |
| Learning-Augmented Primal-Dual Control Design for Secondary Frequency Regulation |
|
| Yu, Yixuan | Peking University |
| Bansal, Rajni Kant | Indian Institute of Management, Ahmedabad |
| Jiang, Yan | The Chinese University of Hong Kong, Shenzhen |
| You, Pengcheng | Peking University |
Keywords: Power systems, Smart grid, Machine learning
Abstract: Frequency stability is fundamental to the secure operation of power systems. With growing uncertainty and volatility introduced by renewable generation, secondary frequency regulation - responsible for restoring the nominal system frequency - must now deliver enhanced performance not only in the steady state but also during transients. This paper presents a systematic framework to embed learning in the design of a primal–dual controller that provides provable (potentially exponential) stability and steady-state optimality, while simultaneously improving key transient metrics, including frequency nadir and control effort, in a data-driven manner. In particular, we employ the primal-dual dynamics of an optimization problem that encodes steady-state objectives to realize secondary frequency control with asymptotic stability guarantee. To augment transient performance of the controller via learning, a change of variables on control inputs, which will be deployed by neural networks, is proposed such that under mild conditions, stability and steady-state optimality are preserved. It further allows us to define a learning goal that accounts for the exponential convergence rate, frequency nadir and accumulated control effort, and use sample trajectories to enhance these metrics. Simulation results validate the theories and demonstrate superior transient performance of the learning-augmented primal-dual controller.
|
| |
| 14:00-14:15, Paper FrB10.3 | Add to My Program |
| Improved Voltage Regulation with Optimal Design of Decentralized Volt-VAr Control |
|
| Russell, Daniel | University of Vermont |
| Hamilton, Dakota | The University of Vermont |
| Almassalkhi, Mads | University of Vermont |
| Ossareh, Hamid | University of Vermont |
Keywords: Power systems, Stability of linear systems, Decentralized control
Abstract: Integration of distributed energy resources has created a need for autonomous, dynamic voltage regulation. Decentralized Volt-VAr Control (VVC) of grid-connected inverters presents a unique opportunity for voltage management but, if designed poorly, can lead to unstable behavior when in feedback with the grid. We model the grid-VVC closed-loop dynamics with a linearized power flow approach, leveraging historical data, which shows improvement over the commonly used LinDistFlow model. This model is used to design VVC slopes by minimizing steady-state voltage deviation from the nominal value, subject to a non-convex spectral radius stability constraint, which has not been previously implemented within this context. We compare this constraint to existing convex restrictions and demonstrate, through simulations on a realistic feeder, that using the spectral radius results in more effective voltage regulation.
|
| |
| 14:15-14:30, Paper FrB10.4 | Add to My Program |
| A Spatio-Temporal Attention and Transformer Framework for Hybrid Attack Detection in Cyber-Physical Systems |
|
| Lau, Clement Kai Xuen | Shanghai Jiao Tong University |
| Wu, Jing | Shanghai Jiao Tong University |
| Long, Chengnian | Shanghai Jiao Tong University |
Keywords: Power systems
Abstract: In this paper, we present a Spatio-Temporal Graph with Deep Embedded Clustering framework (STaG-DEC) that jointly detects, localizes, and classifies hybrid threats. It employs a Graph Convolutional Attention Network (GCAT) to extract spatial dependencies and fuses the resulting embeddings with temporally differenced measurements via concatenation and linear projection before the Transformer encoder, which suppresses slow operational drift, highlights abrupt attack-induced transients, and preserves long-range dependencies. To address class imbalance and overlapping patterns, standard crossentropy with a DEC-based joint loss is proposed that promotes compact and better-separated latent clusters and stabilizes minority-class decision boundaries, thereby enabling precise node-level localization and classification across varying attack distributions. Experiments on IEEE 14-bus system demonstrate our superior detection performance with notably improved replay detection and robust behavior under imbalanced conditions.
|
| |
| 14:30-14:45, Paper FrB10.5 | Add to My Program |
| Microgrids Optimal Radial Reconfiguration and Islanding Via FORWARD Algorithm |
|
| Vendrell Gallart, Joan | University of California Irvine |
| Kia, Solmaz S. | University of California Irvine (UCI) |
| Bent, Russell | Los Alamos National Laboratory |
Keywords: Smart grid, Power systems, Optimization algorithms
Abstract: Designing optimal radial configurations for microgrids involves Mixed-Integer Non-Linear Programming (MINLP) formulations that are often computationally prohibitive for large-scale systems. To overcome this, we reformulate the distribution optimization problem—where generator locations and capacities are predetermined—into a set-valued optimization framework. This allows us to leverage the recently proposed FORWARD algorithm to efficiently explore the radial configuration space while strictly adhering to physical power flow constraints. The resulting feasible configurations serve as high-quality suboptimal solutions or as effective warm-start points for conventional MINLP solvers. The key innovation lies in abstracting the microgrid network structure to enable rapid topological search while maintaining coupling with physical power flow requirements. Our approach introduces several strategic modifications: a phase-expanded node representation for multi-phase AC systems, physics-informed weight functions incorporating impedance characteristics, and constraint integration strategies that embed power system limitations directly within the sampling process. Numerical evaluations on IEEE benchmarks and synthetic networks of up to 400 nodes demonstrate polynomial-time performance, whereas traditional MINLP solvers often fail beyond 70 buses. The algorithm achieves solutions in seconds with competitive quality, enabling real-time microgrid management and large-scale distribution~planning.
|
| |
| 14:45-15:00, Paper FrB10.6 | Add to My Program |
| Dynamic Modeling and Control of a Parabolic Trough Solar Collection Line for Power Generation |
|
| Yebra, Luis José | CIEMAT-Plataforma Solar De Almería |
| Rhinehart, R. Russell | Oklahoma State Univ. - Retired |
Keywords: Modeling, Distributed parameter systems, Process Control
Abstract: This is an application paper that provides a simulator and a control algorithm for a thermal collection line in a parabolic trough collector for solar generation of electricity. The simulation is designed to match the TCP-100 facility of the Plataforma Solar de Almeria, a full-scale CIEMAT research center, in Spain. Of interest to the research community would be the reveal of control issues and a solution that includes two nonlinear algorithms, explanation of a simulator that can serve as a benchmark for control algorithm testing, and comprehensive issues that need to be included in evaluating control algorithms. Of interest to the practitioner community would be the use of simple nonlinear control techniques embedded in a traditional cascade structure to solve nonlinear and deadtime issues. The simulator includes nominal direct normal irradiance (DNI) that changes over the day; random variations in DNI, optical efficiency, inlet temperature, ambient losses, and flow friction factor; and error on all measurements (calibration and noise).
|
| |
| FrB11 Regular Session, Grand Salon 16 |
Add to My Program |
| Robotics II |
|
| |
| Chair: Stiffler, Nicholas | University of Dayton |
| Co-Chair: Han, Feng | New York Institute of Technology |
| |
| 13:30-13:45, Paper FrB11.1 | Add to My Program |
| Learning Human Gait with Muscle Control and Metabolic Cost Integration |
|
| Drewing, Nadine | Technical University of Darmstadt |
| Firas, Al-Hafez | TU Darmstadt |
| Guoping, Zhao | Southeast University |
| Peters, Jan | Max-Planck Institute |
| Seyfarth, Andre | TU Darmstadt |
| Findeisen, Rolf | TU Darmstadt |
| Ahmad Sharbafi, Maziar | TU Darmstadt |
Keywords: Robotics, Biomedical, Reinforcement learning
Abstract: Human locomotion involves complex coordination of over-actuated muscle systems and joints, making simulation and control design highly challenging. While recent reinforcement and imitation learning methods can replicate human-like kinematics, they often fail to produce physiologically realistic force patterns, largely due to the limited availability and consideration of reference force plate or electromyography (EMG) data. This paper presents a hybrid imitation learning framework that integrates muscle-driven simulations with reinforcement learning to address over-actuation and to account for the fact that the available data reflects closed-loop actions involving muscle control. A key contribution is the incorporation of metabolic cost into the reward function, shaping energetically efficient and physiologically plausible controllers. Simulation results demonstrate that the learned policies generate muscle activations and ground reaction forces that align more closely with OpenSim references and experimental data than standard imitation learning. The method provides a scalable tool for developing and validating closed-loop control strategies for assistive systems such as exoskeletons and prostheses, while highlighting broader implications for learning-based control of over-actuated biomechanical systems.
|
| |
| 13:45-14:00, Paper FrB11.2 | Add to My Program |
| Enhanced-Balance Stabilization of Underactuated Robots with Learned Equilibrium Manifold |
|
| Chen, Lixuan | New York Institute of Technology |
| Han, Feng | New York Institute of Technology |
Keywords: Robotics, Control applications, Machine learning
Abstract: Existing control designs for underactuated balance robots often rely on static equilibrium, restricting system autonomy. While dynamic equilibrium resolves this, it is computationally demanding to obtain, especially under unknown dynamics. This paper proposes a machine learning-based dynamic equilibrium prediction framework that captures inherent dynamic coupling directly from motion data, requiring no explicit equilibrium data for training. Leveraging this predicted equilibrium improves control design, enabling simultaneous task execution of the actuated subsystem and robust balance stabilization of the unactuated subsystem. Simulations validate the framework's effectiveness and enhanced performance.
|
| |
| 14:00-14:15, Paper FrB11.3 | Add to My Program |
| Modeling and Control of Multirotor Aerial Vehicles with Telescoping Extension Arm Morphology |
|
| Dauchert, Samuel | University of South Carolina |
| Zhang, Xin | University of Southern Maine |
| Stiffler, Nicholas | University of Dayton |
| Wang, Xiaofeng | University of South Carolina |
Keywords: Robotics, Modeling, Predictive control for nonlinear systems
Abstract: This paper presents a novel multirotor design featuring telescoping extension arm morphology (TEAM), together with a generalized control allocation framework. TEAM enables continuous and asymmetric arm extension, offering a level of adaptability not achievable with prior morphing designs, which were constrained to symmetric or discrete arm configurations. To manage the nonlinear actuator interactions introduced by TEAM, we introduce a joint mapping loop based on model predictive control (MPC). This formulation simultaneously determines propeller speeds and arm lengths, providing optimal control allocation in real time. The framework integrates seamlessly with standard cascaded flight control architectures: conventional position and attitude controllers remain unchanged, while MPC operates as a third loop with minimal structural modification. This combined design–control approach represents the first generalized framework for morphing multirotors with fully telescoping arms. The simulation results demonstrate that the proposed system achieves accurate trajectory tracking under aggressive maneuvers and center-of-mass offsets caused by payloads, highlighting its potential for robust and efficient aerial logistics.
|
| |
| 14:15-14:30, Paper FrB11.4 | Add to My Program |
| Synergy-Driven Prosthetic Hand Control for Dexterous In-Hand Manipulation |
|
| Wang, Chu | University of Melbourne |
| Mohammadi, Alireza | University of Melbourne |
| Eden, Jonathan Paul | The University of Melbourne |
| Tan, Ying | The University of Melbourne |
| Choong, Peter | The University of Melbourne |
| Oetomo, Denny | The University of Melbourne |
Keywords: Robotics, Human-in-the-loop control, Simulation
Abstract: Dexterous in-hand manipulation (DIM) remains a major challenge for prosthetic hand control, as users must generate complex multi-joint motions from low-dimensional inputs. Standard principal component analysis (PCA) captures static postural synergies but ignores continuous movement over time, limiting smooth finger coordination and practical prosthetic control. We apply multivariate functional PCA (mfPCA) to model joint trajectories as multivariate functional data, capturing both the temporal dynamics and the inter-joint coordination. Simulations with a high-degree-of-freedom robotic hand show that, with sufficient components, mfPCA-reconstructed trajectories closely reproduce the original object manipulation at joint and task levels. To enable practical low-dimensional control, we introduce a clustering-based strategy that maps synergy weights onto a low-dimensional plane, supporting stable task reproduction and adjustable variation. These results provide a framework for natural, low-burden prosthetic hand control in DIM tasks.
|
| |
| 14:30-14:45, Paper FrB11.5 | Add to My Program |
| Control Separation and Coordination of Trajectory Tracking and Balance Stabilization for Underactuated Balance Robots |
|
| Han, Feng | New York Institute of Technology |
Keywords: Robotics, Predictive control for nonlinear systems, Control applications
Abstract: Control input for the actuated subsystem and unactuated subsystem in underactuated robots’ control is separated into two groups. A coordinated scheme is proposed to design control for each subsystem to achieve trajectory tracking and stabilization, including a computed torque control for actuated subsystem tracking and an MPC-based control for unactuated subsystem balance. The MPC-based scheme in a subspace simultaneously predicts equilibrium and solves control design by embedding the unactuated dynamics as a nonlinear algebraic constraint. The effectiveness of the coordinated control is verified by extensive validations and comparisons.
|
| |
| 14:45-15:00, Paper FrB11.6 | Add to My Program |
| Force-Augmented LIPM–MPC for Stable Humanoid Loco–Manipulation |
|
| Kim, Baekseok | University of Nevada, Las Vegas |
| Oh, Paul | University of Nevada Las Vegas |
Keywords: Robotics, Control applications, Optimal control
Abstract: Humanoid robots must maintain stability under external forces during loco-manipulation tasks such as cart pushing or ladder carrying. This paper proposes a model predictive control (MPC) framework that extends the Linear Inverted Pendulum Model (LIPM) by incorporating external wrenches as additive offsets in the zero-moment point (ZMP) formulation. Through an equivalent ZMP transformation, the approach preserves the linear and convex structure of the MPC problem, enabling real-time feasibility while retaining physical interpretability. The method was validated in MuJoCo simulations with a full-size humanoid robot under various manipulation forces. Results demonstrate stable locomotion during lateral disturbances, ladder transportation, wheelbarrow maneuvering, and heavy-object pushing. These findings show that the proposed framework bridges the gap between efficient LIPM-based walking and robust multi-contact manipulation, advancing humanoid capability in real-world tasks.
|
| |
| FrB12 Regular Session, Grand Salon 18 |
Add to My Program |
| Discrete Event Systems |
|
| |
| Chair: Liu, Ruotian | Polytechnic University of Bari |
| Co-Chair: Zenati, Abdelhafid | City Univesity of London |
| |
| 13:30-13:45, Paper FrB12.1 | Add to My Program |
| Efficient Robustness Verification for Non-Blocking Discrete Event Systems |
|
| Dony, Md Nur-A-Adam | Tennessee Technological University |
| Rizvi, Syed Ali Asad | Tennessee Technological University |
Keywords: Discrete event systems, Automata, Supervisory control
Abstract: This paper introduces a mathematical framework that helps us understand when a discrete event system (DES) begins to experience blocking due to transition deletions. We have developed a string-based analysis to determine if non-blocking conditions persist following these deletions. Our contributions can be summarized in three key points: a state-specific, necessary-and-sufficient characterization of blocking caused by transition deletions, a definition of robust deletions that maintain non-blocking conditions, and an algorithm that identifies critical transitions through minimal blocking sets. This algorithm streamlines computation by focusing on structured candidates instead of considering all possible deletions. We present case studies in manufacturing and autonomous vehicles that effectively identify critical transitions and blocking scenarios while significantly reducing complexity.
|
| |
| 13:45-14:00, Paper FrB12.2 | Add to My Program |
| Multiple Global Secrets Protection in Discrete Event Systems Via Integer Linear Programming |
|
| Liu, Ruotian | Polytechnic University of Bari |
| Duan, Wei | Xidian Universisty |
| Hu, Shaopeng | Xidian University |
| Mangini, Agostino Marcello | Politecnico Di Bari |
| Fanti, Maria Pia | Polytechnic of Bari |
Keywords: Discrete event systems, Optimization, Automata
Abstract: This work addresses the problem of protecting multiple global secrets in discrete event systems modeled by nondeterministic finite automata. Specifically, a global secret in a system is assumed to be composed of one or more states, each assigned a specific security level. We say that a state is protected if any sequence of events leading to it from the initial state contains a quantity of protected events that is equal to or greater than the required security level. Correspondingly, a global secret is said to be protected if the cumulative weight of its containing protected states (or all containing states are considered in the worst-case scenario) satisfies a user-defined protection threshold. Our objective is to develop an event protection policy that is capable of protecting global secrets. To do so, we build an augmented automaton which eliminates the difficulty of analyzing the infinite sequences when verifying protection. Then, an optimal protection policy with minimum cost can be obtained by solving an integer linear programming problem. Examples are given to illustrate the effectiveness of our proposed protection strategy.
|
| |
| 14:00-14:15, Paper FrB12.3 | Add to My Program |
| Liveness, Reachability, and Reversibility of Signal Interpreted Petri Nets |
|
| Köhler, Andreas | University of Duisburg-Essen |
| Zhang, Ping | University of Duisburg-Essen |
Keywords: Discrete event systems, Petri nets, Process Control
Abstract: This paper proposes a set of novel sufficient conditions that guarantee three behavioral properties in signal interpreted Petri nets (SIPNs), i.e., liveness, reachability, and reversibility. SIPNs provide a modeling formalism for representing the control algorithm of discrete manufacturing systems. The liveness, reachability, and reversibility properties ensure that the desired control actions remain perpetually executable, the system states are reachable, and that the system can always return to its initial state, respectively. The sufficient conditions are derived based on the Petri net state equation and the enabling rules of the transitions in SIPNs. Moreover, it is shown how the reachability and reversibility can be computationally verified based on an integer linear programming problem. The computational complexity for verifying the properties is polynomial with respect to the number of markings in the SIPN when the reachable set is already available.
|
| |
| 14:15-14:30, Paper FrB12.4 | Add to My Program |
| Modular Control of Critical Observability and Weak Opacity in Discrete Event System |
|
| Miao, Shaowen | The Hong Kong University of Science and Technology (Guangzhou) |
| Komenda, Jan | Czech Academy of Sciences |
| Ji, Yiding | Hong Kong University of Science and Technology (Guangzhou) |
| Yin, Xiang | Shanghai Jiao Tong University |
Keywords: Discrete event systems, Supervisory control, Automata
Abstract: Large-scale systems found in technological frameworks often consist of interacting modules, where ensuring security is critical. In discrete-event systems, normality ensures that, based on an system observations, one can unambiguously determine whether the corresponding plant behavior is safe. In this paper, we first investigate the computation of an earlier form of normality, known as (L,P)-normality, in modular discrete-event systems. We show that, under the condition that all shared events are observable, the supremal (L,P)-normal sublanguages can be computed in a modular fashion. We then apply these results to the enforcement of two related notions that share similar physical interpretations with (L,P)-normality---namely, critical observability and weak opacity. By integrating our contributions with existing results on controllability and normality, we develop a modular supervisory control approach to enforce these properties and provide examples to illustrate the enforcement framework.
|
| |
| 14:30-14:45, Paper FrB12.5 | Add to My Program |
| Well-Posedness and Stability Analysis of Positive System Networks under Link-Dependent Delays |
|
| Bell, Madeleine | City, University of London |
| Zenati, Abdelhafid | City Univesity of London |
| Aouf, Nabil | City University of London |
Keywords: Networked control systems, Stability of nonlinear systems, Delay systems
Abstract: Nonlinear positive system networks with heterogeneous time-delay communications exhibit strong nonlinear effects that are highly sensitive to delays, which complicates their analysis and limits the generalisation of existing results. This paper investigates the stability and well-posedness of such networks. We first establish two fundamental properties of the considered dynamics. Well-posedness guarantees that communication delays do not introduce singularities into the solution and ensures that, for any admissible initial conditions, the system admits a unique trajectory evolving continuously over time. Positivity ensures that all trajectories remain non-negative for admissible initial conditions. Building upon these properties, we develop a new analysis framework that employs the theory of sequences of functions to construct a less-conservative linear Lyapunov function candidate. We show that stability hinges on the definiteness of a newly derived matrix-valued function guided by Barbalat's theorem. The theoretical results are validated through numerical simulations, demonstrating the effectiveness of the obtained results.
|
| |
| FrB13 Regular Session, Grand Salon 19 |
Add to My Program |
| Agent Based Systems I |
|
| |
| Chair: Lian, Bosen | Auburn University |
| Co-Chair: Xue, Wenqian | University of Florida |
| |
| 13:30-13:45, Paper FrB13.1 | Add to My Program |
| Fully Distributed GNE Algorithms for Multi-Robot Placement without Consensus on Multipliers |
|
| Yin, Shao-An | University of Minnesota |
| Hong, Mingyi | Iowa State University |
| Elia, Nicola | University of Minnesota |
Keywords: Agents-based systems, Autonomous robots, Autonomous systems
Abstract: Recent machine learning research has increasingly focused on equilibrium analysis in non-cooperative games rather than solely on optimal solutions. Many such problems involve shared constraints and can be formulated as Generalized Nash Equilibrium Problems (GNEPs). For strongly monotone games, existing methods compute consensus-based variational GNEs (v-GNEs) by exchanging Lagrange multipliers. We propose a fully distributed continuous-time algorithm for shared linear equality constraints that converges without multiplier exchange and reaches any GNE, reducing communication overhead and improving privacy. Discrete-time schemes are also provided, and the method is validated on a multi-robot placement task.
|
| |
| 13:45-14:00, Paper FrB13.2 | Add to My Program |
| Feature-Based Perception-Aware Multi-UAV Trajectory Planning |
|
| Yang, Teaya | University of California, Berkeley |
| Brommer, Christian | Oak Ridge Associated Universities |
| Mueller, Mark W. | University of California, Berkeley |
Keywords: Agents-based systems, Autonomous systems, Cooperative control
Abstract: Multi-agent UAV systems are well-suited for large-scale data collection and transportation, though navigation in unstructured, GPS-denied environments remains challenging. Vision-based navigation enables operation without external infrastructure, but the uncertainty in state estimation limits its reliability in multi-agent settings. We propose a trajectory planning framework that incorporates estimator uncertainty by exploiting visual feature observations between agents. The framework maintains a coherent shared map through multi-agent frame alignment to prevent independent vision drift, and employs a perception-aware reward that favors trajectories with stronger feature visibility and cross-agent redundancy. Flight data from a controlled two-UAV experiment demonstrate that our alignment module can effectively reduce relative distance error, validating its role in maintaining inter-agent consistency. Simulations show that perception-aware rewards improve feature visibility and coordination while maintaining goal-reaching performance.
|
| |
| 14:00-14:15, Paper FrB13.3 | Add to My Program |
| Dynamic Coalitions in Games on Graphs with Preferences Over Temporal Goals |
|
| Yilmaz, And Kaan Ata | University of Texas at Austin |
| Kulkarni, Abhishek | Vijil, Inc |
| Topcu, Ufuk | The University of Texas at Austin |
Keywords: Agents-based systems, Autonomous systems, Discrete event systems
Abstract: In multiplayer games with sequential decision-making, self-interested agents may form and dissolve temporary coalitions to maximize satisfaction of temporal goals beyond individual capabilities. We study a class of multiplayer games played on graphs where, in each round, a designated leader declares a coalition it joins and prescribes a joint action. Coalition members must comply unless their assigned action is inadmissible. The remaining agents act independently using admissible strategies. Each agent has temporal goals prioritized by qualitative preferences. We present a unified framework that integrates coalition formation with rational strategy synthesis under partial-order preferences. First, we generalize classical admissibility by introducing maximal sure-winning strategies to track goal satisfaction across evolving coalitions. Second, we define a multi-dimensional value function that captures preference-aware guarantees for all agents and enables efficient coalition rationality analysis. Third, we develop a polynomial-time algorithm to synthesize admissible strategies for all agents and proves their existence for this class of games. Experiments in a blocks-world domain demonstrate the advantages of dynamic over fixed coalitions, revealing counter-intuitive behaviors: agents with aligned preferences may fail to cooperate, while those with conflicting goals may form beneficial coalitions.
|
| |
| 14:15-14:30, Paper FrB13.4 | Add to My Program |
| Control of Heterogeneous Multi-Agent Systems with Active Leader |
|
| Xu, Yicheng | University of California Irvine |
| Jabbari, Faryar | Univ. of California at Irvine |
Keywords: Agents-based systems, LMIs, Distributed control
Abstract: This paper investigates the distributed output feedback control problem for heterogeneous multi-agent systems under a leader-following architecture with guaranteed L_2 performance. A scalable synthesis framework is developed, where the design of the leader state observer and the local dynamic output feedback controller are decoupled, thereby significantly reducing the computational complexity for large-scale networks. The proposed approach enables each agent to implement the controller using only local information. Linear matrix inequality conditions are derived to guarantee desirable tracking and disturbance attenuation. The dimension of the LMIs to be solved does not increase with the number of agents. The feasibility condition is established, while the effectiveness is demonstrated through a simulation example.
|
| |
| 14:30-14:45, Paper FrB13.5 | Add to My Program |
| Distributed Model Predictive Control with Delay for Formation of Multi-Agent Systems |
|
| Chiang, Ming-Li | National Taiwan University |
| Yang, Qi-Hong | National Taiwan Ocean University |
Keywords: Agents-based systems, Cooperative control, Distributed control
Abstract: In this paper, we propose a distributed model predictive control (DMPC) framework for cooperative formation of multi-agent systems that composed of unmanned aerial vehicles (UAVs) and an unmanned ground vehicle (UGV). The designed system considers formation maintenance, collision avoidance under information delays, uncertain deployments, and obstacle environments. A linear recursive estimation with error-shift compensation reconstructs the delayed neighbor trajectories, and the risk-aware re-optimization mechanism refines control when inter-agent distances fall below a threshold. Simulations results along with real-world experiments are provided to demonstrate robust formation tracking and collision avoidance.
|
| |
| 14:45-15:00, Paper FrB13.6 | Add to My Program |
| Data-Driven Min-Max Strategy for Multiplayer Multiagent Systems |
|
| Zhang, Yizhong | Auburn University |
| Colon, Christopher | Auburn University |
| Lian, Bosen | Auburn University |
Keywords: Agents-based systems, Reinforcement learning, Optimal control
Abstract: The paper designs data-driven distributed optimal consensus control for networked multiple dynamic systems, with each system (representing an agent) having multiple control inputs (representing players). We formulate differential graphical games that incorporate the evaluation of continuous-time multiplayer multiagent systems (MMSs), the cost functions of all players in all agents, and the communication network. The min-max strategy is designed for each player to prepare the optimal response to the worst-case reactions from the other players in the same and different agents. This strategy provides each player with distributed algebraic Riccati equations (AREs) and designs the distributed optimal control based on local feedback, further certified by ensured solutions in AREs, the achievement of min-max strategy, and the asymptotic stability of local error systems. The model-free, data-driven reinforcement learning (RL) algorithm is further developed to solve the min-max solutions using online sampled local trajectories without knowing system dynamics. Simulations verify the strategy and algorithm.
|
| |
| FrB14 Regular Session, Grand Salon 21 |
Add to My Program |
| Machine Learning III |
|
| |
| Chair: Tembine, Hamidou | UQTR and Timadide |
| Co-Chair: Baheri, Ali | Rochester Institute of Technology |
| |
| 13:30-13:45, Paper FrB14.1 | Add to My Program |
| Diffusion-Based Decentralized Federated Multi-Task Representation Learning |
|
| Kang, Donghwa | Iowa State University |
| Moothedath, Shana | Iowa State University |
Keywords: Statistical learning, Decentralized control, Agents-based systems
Abstract: We develop Dif-AltGDmin, a diffusion-based decentralized federated algorithm for multi-task representation learning (MTRL). We consider multi-task linear regression where multiple models share a common low-dimensional representation, and propose an alternating projected gradient descent and minimization approach to recover the shared low-rank feature matrix without a central coordinator. We provide provable guarantees on sample and iteration complexity to achieve ε-accuracy, and show that Dif-AltGDmin achieves significantly reduced communication complexity compared to existing decentralized approaches. Numerical simulations confirm that our algorithm matches the convergence of its centralized counterpart while remaining communication-efficient.
|
| |
| 13:45-14:00, Paper FrB14.2 | Add to My Program |
| Sliced Distribution Matching Based on Cumulative Distribution Functions with Applications to Control |
|
| Tzikas, Alexandros | Stanford University |
| Jamgochian, Arec | Stanford University |
| Ure, Nazim Kemal | Stanford University |
| Kochenderfer, Mykel | Stanford University |
| Boyd, Stephen | Stanford University |
Keywords: Statistical learning, Optimization, Stochastic optimal control
Abstract: Computing the similarity between two probability distributions is a recurring theme across control. We introduce a unified family of distances between the probability distributions of two random variables that is based on the discrepancy between the cumulative distribution functions of random linear one-dimensional projections of the random variables. Our proposed distance is interpretable, computationally simple, and admits a differentiable approximation. We establish asymptotic theoretical guarantees for sample-based estimators of the distance. We empirically study the use of the distance in a two-sample test and demonstrate its ability to distinguish different distributions. Finally, we show that the distance allows for simple gradient-based solutions in control by studying distribution steering and ergodic control.
|
| |
| 14:00-14:15, Paper FrB14.3 | Add to My Program |
| Logarithmic Regret and Polynomial Scaling in Online Multi-Step-Ahead Prediction |
|
| Qian, Jiachen | University of California San Diego |
| Zheng, Yang | University of California San Diego |
Keywords: Statistical learning, Stochastic systems, Kalman filtering
Abstract: This paper studies the problem of online multi-step-ahead prediction for unknown linear stochastic systems. Using conditional distribution theory, we derive an optimal parameterization of the prediction policy as a linear function of future inputs, past inputs, and past outputs. Based on this characterization, we propose an online least-squares algorithm to learn the policy and analyze its regret relative to the optimal model-based predictor. We show that the online algorithm achieves logarithmic regret with respect to the optimal Kalman filter in the multi-step setting. Furthermore, with new proof techniques, we establish an almost-sure regret bound that does not rely on fixed failure probabilities for sufficiently large horizons N. Finally, our analysis also reveals that, while the regret remains logarithmic in N, its constant factor grows polynomially with the prediction horizon H, with the polynomial order set by the largest Jordan block of eigenvalue 1 in the system matrix.
|
| |
| 14:15-14:30, Paper FrB14.4 | Add to My Program |
| CHMAS: A Coupled Hierarchical Framework for Multi-Agent Reinforcement Learning |
|
| Wang, Dongming | University of California, Riverside |
| Xu, Jie | University of California, Riverside |
| Zhang, Yanyu | University of California, Riverside |
| Ren, Wei | University of California, Riverside |
Keywords: Reinforcement learning, Hierarchical control, Agents-based systems
Abstract: Multi-agent reinforcement learning (MARL) systems face fundamental challenges in balancing global coordination with local execution across different temporal scales. This paper introduces the Coupled Hierarchical Multi-Agent System (CHMAS), a novel framework that decomposes multi-agent decision-making into centralized strategic planning and distributed tactical execution with bidirectional information flow. The strategic layer integrates all agents' states with an exclusive global environmental state to generate guidance actions every T timesteps, while tactical agents execute distributed policies augmented by strategic guidance and local neighborhood observations. Unlike existing hierarchical approaches with unidirectional control, CHMAS establishes a feedback mechanism where accumulated tactical rewards influence strategic objectives through a coupling coefficient lambda, ensuring strategic plans remain grounded in tactical feasibility. To address the non-stationarity inherent in hierarchical learning, we propose an asynchronous update protocol where strategic parameters update every N_f tactical episodes, allowing tactical policies to converge to quasi-stationary points between strategic changes. We present both a general bi-level formulation capturing full system dynamics and a tractable additive approximation enabling rigorous analysis. Theoretical analysis proves that this asynchronous scheme achieves mathcal{O}(log K/sqrt{K}) convergence for the strategic layer after K strategic updates under standard assumptions. Experimental validation in a multi-agent foraging domain demonstrates successful learning of spatially partitioned exploration strategies, with both layers converging stably despite hierarchical coupling.
|
| |
| 14:30-14:45, Paper FrB14.5 | Add to My Program |
| Density-Ratio Weighted Behavioral Cloning: Learning Control Policies from Corrupted Datasets |
|
| Karpoora Sundara Pandian, Shriram | Rochester Institute of Technology |
| Baheri, Ali | Rochester Institute of Technology |
Keywords: Reinforcement learning, Learning, Machine learning
Abstract: Offline reinforcement learning (RL) enables policy optimization from fixed datasets, making it suitable for safety-critical applications where online exploration is infeasible. However, these datasets are often contaminated by adversarial poisoning, system errors, or low-quality samples, leading to degraded policy performance in standard behavioral cloning (BC) and offline RL methods. This paper introduces Density-Ratio Weighted Behavioral Cloning (Weighted BC), a robust imitation learning approach that uses a small, verified clean reference set to estimate trajectory-level density ratios via a binary discriminator. These ratios are clipped and used as weights in the BC objective to prioritize clean expert behavior while down-weighting or discarding corrupted data, without requiring knowledge of the contamination mechanism. We provide theoretical guarantees on convergence to the clean policy, finite-sample bounds, and robustness under varying contamination levels. A comprehensive evaluation framework is established, incorporating diverse poisoning protocols (reward, state, transition, and action) on D4RL continuous control benchmarks. Experiments demonstrate that Weighted BC maintains near-optimal performance even at high contamination ratios outperforming baselines such as traditional BC, BCQ, and BRAC.
|
| |
| 14:45-15:00, Paper FrB14.6 | Add to My Program |
| Debiased GPT in Mean-Field-Type Games |
|
| Tembine, Hamidou | UQTR and Timadie |
Keywords: Neural networks, Game theory
Abstract: Bias pervades classical approximation methods in machine learning, including neural networks, kernel-based estimators, and density estimators, as well as advanced techniques such as multi-level Monte Carlo when applied to density-dependent functionals. Although some approaches can be rendered asymptotically unbiased under infinite computational budgets, such conditions are unattainable in practice. This limitation is particularly acute in mean-field-type games with Nemytskii-type state coefficients and payoffs, which arises in generative machine intelligence in audio, video, and image domains. Existing methodologies for these problems such as score function estimation inherit and propagate structural bias through all stages of computation. Here we introduce a debiasing framework for mean-field-type transformers that eliminates bias across these settings.
|
| |
| FrB15 Regular Session, Grand Salon 22 |
Add to My Program |
| Stability of Nonlinear Systems I |
|
| |
| Chair: Rogers, Jonathan | Naval Surface Warfare Center, Philadelphia Division |
| Co-Chair: Liu, Jun | University of Waterloo |
| |
| 13:30-13:45, Paper FrB15.1 | Add to My Program |
| A Lyapunov-Based Small-Gain Theorem for Fixed-Time Stability |
|
| Tang, Michael | University of California, San Diego |
| Krstic, Miroslav | University of California, San Diego |
| Poveda, Jorge I. | University of California, San Diego |
Keywords: Stability of nonlinear systems, Lyapunov methods
Abstract: This paper introduces a novel Lyapunov-based small-gain methodology for establishing fixed-time stability (FxTS) guarantees in interconnected dynamical systems. Specifically, we consider interconnections in which each subsystem admits an individual fixed-time input-to-state stability (ISS) Lyapunov function that certifies FxT-ISS. We then show that if a nonlinear small-gain condition is satisfied, then the entire interconnected system is FxTS. Our results are analogous to existing Lyapunov-based small-gain theorems developed for asymptotic and finite-time stability, thereby filling an important gap in the stability analysis of interconnected dynamical systems. The proposed theoretical tools are further illustrated through analytical and numerical examples, including the first result on fixed-time feedback optimization of dynamical systems without time-scale separation between the plant and the controller.
|
| |
| 13:45-14:00, Paper FrB15.2 | Add to My Program |
| Stability Analysis of Fast Extremum Seeking Control for Wiener Systems Using Online Complex Curve Fitting |
|
| Palacios Roman, Juan Javier | Eindhoven University of Technology |
| van Berkel, Matthijs | Dutch Institute for Fundamental Energy Research |
| Heemels, W.P.M.H. (Maurice) | Eindhoven University of Technology |
| van Keulen, Thijs | Eindhoven University of Technology |
Keywords: Stability of nonlinear systems, Adaptive control, Optimization
Abstract: In this paper, we show uniform semi-global practical asymptotic stability of fast extremum seeking control (ESC) for single-input single-output Wiener systems. While classic ESC requires a time-scale separation between plant and dither, the fast ESC method circumvents this time-scale separation by exploiting limited knowledge of the frequency response of the linear part of the Wiener system, thereby achieving faster convergence. The assumptions under which the fast ESC method works are relaxed compared to existing work and explicit bounds on the design parameters of the fast ESC scheme are provided. A numerical case study illustrates the enhanced convergence and the robustness of the fast ESC method.
|
| |
| 14:00-14:15, Paper FrB15.3 | Add to My Program |
| Period-Aware Asymptotic Gain with Application to a Periodically Forced Synchronization Circuit |
|
| Ponomarev, Anton | Karlsruhe Institute of Technology |
| Gröll, Lutz | Karlsruhe Institute of Technology |
| Hagenmeyer, Veit | Karlsruhe Institute of Technology (KIT) |
Keywords: Stability of nonlinear systems, Computational methods, Power electronics
Abstract: The classical asymptotic gain (AG) is a concept known from the input-to-state stability theory. Given a uniform input bound, AG estimates the asymptotic bound of the output. Sometimes, however, more information is known about the input than just a bound. In this paper we consider the case of a periodic input. Under the assumption that the system converges to a periodic solution, we introduce a new gain, called period-aware asymptotic gain (PAG), which employs periodicity to enable a sharper asymptotic estimation of the output. Since the PAG can distinguish between short-period ("high-frequency") and long-period ("low-frequency") signals, it is able to rigorously quantify such properties as bandwidth, resonant behavior, and high-frequency damping. We discuss how the PAG can be computed and illustrate it with a numerical example from the field of power electronics.
|
| |
| 14:15-14:30, Paper FrB15.4 | Add to My Program |
| On the Instability of Nesterov's ODE under Non-Conservative Vector Fields |
|
| Ochoa, Daniel E. | University of California Santa Cruz |
| Abdelgalil, Mahmoud | University of California, San Diego |
| Poveda, Jorge I. | University of California, San Diego |
Keywords: Stability of nonlinear systems, Hybrid systems, Optimization
Abstract: We study the instability of Nesterov's ODE in non-conservative settings, where the driving term is not necessarily the gradient of a potential function. While convergence properties under Nesterov's ODE are well-characterized for settings with gradient-based driving terms, we show that the presence of emph{arbitrarily small} non-conservative terms can lead to instability. To resolve the instability issue, we study a regularization mechanism based on restarting. The resulting system is a hybrid dynamical system that mirrors Nesterov's ODE during intervals of flow, and implements restarts of the momentum state through discrete periodic jumps. For this system, we establish novel explicit bounds on the resetting period that ensure the decrease of a suitable Lyapunov function, guaranteeing stability as well as "accelerated" convergence rates under suitable smoothness and monotonicity properties on the driving term. Numerical simulations support our results.
|
| |
| 14:30-14:45, Paper FrB15.5 | Add to My Program |
| A Converse Control Lyapunov Theorem for Joint Safety and Stability |
|
| Quartz, Thanin | University of Waterloo |
| Fitzsimmons, Maxwell | University of Waterloo |
| Liu, Jun | University of Waterloo |
Keywords: Stability of nonlinear systems, Lyapunov methods, Constrained control
Abstract: We show that the existence of a strictly compatible pair of control Lyapunov and control barrier functions is equivalent to the existence of a single smooth Lyapunov function that certifies both asymptotic stability and safety. This characterization complements existing literature on converse Lyapunov functions by establishing a partial differential equation (PDE) characterization with prescribed boundary conditions on the safe set, ensuring that the safe set is exactly certified by this Lyapunov function. The result also implies that if a safety and stability specification cannot be certified by a single Lyapunov function, then any pair of control Lyapunov and control barrier functions necessarily leads to a conflict and cannot be satisfied simultaneously in a robust sense.
|
| |
| 14:45-15:00, Paper FrB15.6 | Add to My Program |
| Control Oriented Real-Time Nonlinear Dynamic Pseudo Inverse for Non-Minimum Phase Systems |
|
| Sempertegui, Miguel | Ohio University |
| Zhu, J. Jim | Ohio University |
| Lawrence, Douglas A. | Ohio University |
Keywords: Nonlinear output feedback, Control applications, Stability of nonlinear systems
Abstract: This paper presents a novel algorithm for the dynamic inversion of nonlinear systems within the output tracking control framework, where the inverse serves as an open-loop nominal controller to facilitate feedback stabilization of the tracking error dynamics along a nominal trajectory. The desired dynamic response for the open-loop controlled system is treated as a pseudo identity whereby the causal and stable pseudo inverse is defined. For non-minimum phase systems, the Fliess canonical form is used to enable a practical pseudo inverse design without internal dynamics stabilization or reverse-time previewing. Unlike exact inversion, which does not have the desired dynamics, and can be unstable or anti-causal, the proposed pseudo inverse not only provides the desired dynamics of the open-loop controlled systems but is also stable and implementable in real time. A nontrivial application example illustrates the algorithm.
|
| |
| FrB16 Regular Session, Grand Salon 24 |
Add to My Program |
| Neural Networks |
|
| |
| Chair: Xu, Xiangru | University of Wisconsin-Madison |
| Co-Chair: Ruths, Justin | University of Texas at Dallas |
| |
| 13:30-13:45, Paper FrB16.1 | Add to My Program |
| Forward and Backward Reachability Analysis of Closed-Loop Recurrent Neural Networks Via Hybrid Zonotopes |
|
| Zhang, Yuhao | University of Wisconsin-Madison |
| Xu, Xiangru | University of Wisconsin-Madison |
Keywords: Neural networks, Machine learning, Formal verification/synthesis
Abstract: Recurrent neural networks (RNNs) are widely employed to model complex dynamical systems due to their hidden-state structure, which inherently captures temporal dependencies. This work presents a hybrid zonotope–based approach for computing exact forward and backward reachable sets of closed-loop RNN systems with ReLU activation functions. The method formulates state-pair sets to compute reachable sets as hybrid zonotopes without requiring unrolling. To improve scalability, a tunable relaxation scheme is proposed that ranks unstable ReLU units across all layers using a triangle-area score and selectively applies convex relaxations within a fixed binary limit in the hybrid zonotopes. This scheme enables an explicit trade-off between computational complexity and approximation accuracy, with exact reachability as a special case.
|
| |
| 13:45-14:00, Paper FrB16.2 | Add to My Program |
| State-Constrained Online Adaptive Control for Robotic Manipulators |
|
| Dey, Ritirupa | University of South Carolina |
| Sahoo, Avimanyu | University of Alabama in Huntsville |
| Narayanan, Vignesh | University of South Carolina |
Keywords: Neural networks, Constrained control, Data driven control
Abstract: In this letter, we present a neural network (NN)-based online adaptive controller for an n-link robot manipulator. The proposed controller utilizes a control barrier function (CBF) to enforce joint position and velocity constraints while simultaneously guaranteeing accurate trajectory tracking of the robot. In particular, we first map the constraints from the state space into the filtered-tracking error space. We demonstrate that maintaining forward invariance of the constrained error set guarantees constraint satisfaction in the state space. Additionally, we analytically establish that this transformation creates a boundary-to-boundary relationship between the two sets. The resulting NN-driven control torque ensures accurate trajectory tracking of the closed-loop robot manipulator while adhering to the prescribed state constraints even in the presence of uncertain dynamics. Finally, we validate the effectiveness of the proposed control scheme both analytically and numerically.
|
| |
| 14:00-14:15, Paper FrB16.3 | Add to My Program |
| Safety-Critical Adaptive Spiking Multilayer Neural Network Control of Nonlinear Systems |
|
| Ganie, Irfan Ahmad | Wilkes University |
| Jagannathan, Sarangapani | Missouri University of Science & Technology |
Keywords: Neural networks, Adaptive control, Robotics
Abstract: This paper introduces a novel safety-critical adaptive spiking neural network (SNN) control framework for uncertain nonlinear systems. On the safety side, we formulate the Augmented Barrier States (ABS) methodology into a unified safety-embedded tracking framework, establishing rigorous safety equivalence results that embed safety constraints directly within the closed-loop dynamics. On the learning side, a deep SNN architecture is developed with online adaptation enabled through direct error-driven weight update laws applied at every layer. Unlike conventional gradient approaches that struggle with instability due to the discontinuous nature of spike generation, the proposed method incorporates singular value decomposition (SVD) to regularize gradient flow, enhancing stability and convergence during online learning. The resulting controller guarantees uniformly ultimately bounded tracking while strictly preserving safety constraints for nonlinear systems, ensuring both adaptability and formal safety guarantees. Simulation studies on two-link manipulator validate the framework, demonstrating robust safety preservation, efficient real-time adaptation, and 80% computational energy savings compared to existing ANN-based approaches.
|
| |
| 14:15-14:30, Paper FrB16.4 | Add to My Program |
| TRASE-NODEs: Trajectory Sensitivity-Aware Neural Ordinary Differential Equations for Efficient Dynamic Modeling |
|
| Al-Janahi, Fatima | The University of Texas at Austin |
| Ko, Min-Seung | The University of Texas at Austin |
| Zhu, Hao | The University of Texas at Austin |
Keywords: Neural networks, Nonlinear systems identification, Power systems
Abstract: Modeling dynamical systems is crucial across the science and engineering fields for accurate prediction, con- trol, and decision-making. Recently, machine learning (ML) approaches, particularly neural ordinary differential equations (NODEs), have emerged as a powerful tool for data-driven modeling of continuous-time dynamics. Nevertheless, standard NODEs require a large number of data samples to remain consistent under varying control inputs, posing challenges to generate sufficient simulated data and ensure the safety of control design. To address this gap, we propose trajectory-sensitivity- aware (TRASE-)NODEs, which construct an augmented system for both state and sensitivity, enabling simultaneous learning of their dynamics. This formulation allows the adjoint method to update gradients in a memory-efficient manner and ensures that time-invariant control set-point effects are captured in the learned dynamics. We evaluate TRASE-NODEs using damped oscillator and inverter-based resources (IBRs). The results show that TRASE-NODEs generalize better from the limited training data, yielding lower prediction errors than standard NODEs for both examples. The proposed framework offers a data-efficient, control-oriented modeling approach suitable for dynamic systems that require accurate trajectory sensitivity prediction.
|
| |
| 14:30-14:45, Paper FrB16.5 | Add to My Program |
| Exact Minimal Perturbations to Quantify Robustness of Neural Networks |
|
| Chen, Justin | University of Texas at Dallas |
| Ruths, Justin | University of Texas at Dallas |
Keywords: Neural networks, Optimization algorithms, Emerging control applications
Abstract: Certification of neural networks supports their integration into commercial and safety-critical applications with formal guarantees. The Lipschitz constant, often used to certify neural networks, is a problematic proxy for robustness. Recent work on adversarial attacks has made it clear that the output of neural networks can change dramatically under minor perturbations. We expand the use of the hybrid zonotope representation of ReLU feedforward neural networks to calculate the minimum perturbation required to change a network's classification.
|
| |
| 14:45-15:00, Paper FrB16.6 | Add to My Program |
| Neural Hybrid Equations: Models, Basic Properties, and Approximation Results |
|
| Ochoa, Daniel E. | University of California Santa Cruz |
| Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Hybrid systems, Neural networks
Abstract: This paper establishes approximation results for neural network approximators of hybrid dynamical systems. We propose neural hybrid equations, which are hybrid systems that use neural networks as the flow map and the jump map while preserving the hybrid structure. Our main result proves that solutions to neural hybrid equations, with Lipschitz non-affine activation functions, can approximate solutions of nominal hybrid systems with Lipschitz vector fields with arbitrary precision on compact time domains. For the subclass of neural hybrid equations with rectifying linear unit (ReLU) activation functions, we establish O(1/N) convergence rates where N is the number of neurons. We illustrate our results with a numerical example.
|
| |
| FrB17 Regular Session, Churchill A1 |
Add to My Program |
| Stochastic Optimal Control I |
|
| |
| Chair: Leahy, Kevin | Worcester Polytechnic Institute |
| Co-Chair: Ma, Tong | Northeastern University |
| |
| 13:30-13:45, Paper FrB17.1 | Add to My Program |
| Belief Space Control of Safety-Critical Systems under State-Dependent Measurement Noise |
|
| Walia, Rohan | Worcester Polytechnic Institute |
| Black, Mitchell | MIT Lincoln Laboratory |
| Schoer, Andrew | MIT Lincoln Laboratory |
| Leahy, Kevin | Worcester Polytechnic Institute |
Keywords: Stochastic optimal control, Emerging control applications, Autonomous systems
Abstract: Safety-critical control is imperative for deploying autonomous systems in the real world. Control Barrier Functions (CBFs) offer strong safety guarantees when accurate system and sensor models are available. However, widely used additive, fixed-noise models are not representative of complex sensor modalities with state-dependent error characteristics. Although CBFs have been designed to mitigate uncertainty using fixed worst-case bounds on measurement noise, this approach can lead to overly-conservative control. To solve this problem, we extend the Belief Control Barrier Function (BCBF) framework to accommodate state-dependent measurement noise via the Generalized Extended Kalman Filter (GEKF) algorithm, which models measurement noise as a linear function of the state. Using the original BCBF framework as baseline, we demonstrate the performance of the BCBF-GEKF approach through simulation results on a 1D single integrator setpoint tracking scenario and a trajectory tracking scenario using 2D unicycle kinematics. Our results confirm that the BCBF-GEKF approach offers less conservative control with greater safety.
|
| |
| 13:45-14:00, Paper FrB17.2 | Add to My Program |
| Chance-Constrained Covariance Steering for Discrete-Time Markov Jump Linear Systems |
|
| Shrivastava, Shaurya | Purdue University |
| Oguri, Kenshiro | Purdue University |
Keywords: Stochastic optimal control, Switched systems, Optimization
Abstract: In this paper, we solve the chance-constrained covariance steering problem for discrete-time Markov Jump Linear Systems (MJLS) using a convex optimization framework. We derive the analytical expressions for the mean and covariance trajectories of time-varying discrete-time MJLS and show that they cannot be separated even without chance constraints, unlike the single-mode (non-hybrid) dynamics case. To solve the covariance steering problem, we propose a two-step convex optimization framework and then incorporate chance constraints through an iterative framework. Since the stochastic hybrid nature of MJLS introduces non-Gaussian distributions unlike single-mode covariance steering, we also propose deterministic formulations for non-Gaussian chance constraints based on Cantelli's inequality and the multivariate Chebyshev inequality. Both problems are originally nonconvex, and we derive convex relaxations which are proved to be lossless at optimality using the Karush–Kuhn–Tucker (KKT) conditions. Numerical simulations demonstrate the proposed method by achieving target covariances while respecting chance constraints under additive noise, bias, and Markovian jump dynamics.
|
| |
| 14:00-14:15, Paper FrB17.3 | Add to My Program |
| Stabilization and Optimality of Pursuer and Evader Policies for Stochastic Differential Games |
|
| Haddad, Wassim M. | Georgia Inst. of Tech |
| Chitre, Ronit | Graduate Research Assistant |
Keywords: Stochastic optimal control, Optimal control, Stochastic systems
Abstract: This paper examines a two-player, zero-sum stochastic differential game problem over an infinite time horizon, where the players employ controller (pursuer) and stopper (evader) policies for a nonlinear system driven by Brownian motion. The pursuer's goal is to minimize a nonlinear-nonquadratic performance criterion while guaranteeing stochastic stabilization and the evader's goal is to maximize the performance criterion. By establishing a connection between stochastic Lyapunov stability theory and the stochastic Hamilton-Jacobi-Isaacs equation, we derive explicit optimal strategies for both players that guarantee stochastic stabilization as well as enforcing a saddle point condition on a nonlinear nonquadratic performance criterion. We demonstrate that global asymptotic stability in probability is ensured through a Lyapunov function, which also serves as the solution to the steady-state stochastic Hamilton-Jacobi-Isaacs equation, thereby guaranteeing both closed-loop stability in probability and optimality. Furthermore, we develop optimal feedback controller and stopper policies for affine nonlinear systems using an inverse optimality framework tailored to the game problem. These results extend existing linear feedback pursuer-evader policies to nonlinear controllers and stoppers that optimize general polynomial and multilinear performance criteria.
|
| |
| 14:15-14:30, Paper FrB17.4 | Add to My Program |
| Partial Stability and Optimal Control for Stochastic Dynamical Systems |
|
| Haddad, Wassim M. | Georgia Inst. of Tech |
| Chitre, Ronit | Graduate Research Assistant |
Keywords: Stochastic optimal control, Optimal control, Stochastic systems
Abstract: This paper presents a unified framework for the optimal nonlinear analysis and feedback control design of stochastic dynamical systems, specifically targeting partial stability and partial-state stabilization. We ensure partial asymptotic stability in probability for the closed-loop nonlinear system using a Lyapunov function that is both positive definite and decrescent with respect to a subset of the system state. This Lyapunov function corresponds to the steady-state solution of the stochastic Hamilton-Jacobi-Bellman equation, thereby establishing both probabilistic partial stability and optimality. The proposed framework lays the underpinning for generalizing linear-quadratic optimal control strategies to more complex settings involving nonlinear, nonquadratic, and partial-state stochastic stabilization problems. We also explore the connection between our approach and optimal regulation in both linear and nonlinear time-varying stochastic systems with quadratic and more general cost functionals. In addition, we develop optimal feedback controllers for affine stochastic nonlinear systems through an inverse optimality approach specifically designed for the partial-state stabilization setting.
|
| |
| 14:30-14:45, Paper FrB17.5 | Add to My Program |
| Stochastic Model Predictive Control for Colloidal Self-Assembly: A Markov State Model-Based Case Study |
|
| Rao, Jingzhi | Beijing University of Chemical Technology |
| Sun, Wei | Louisiana State University |
| Tang, Xun | Louisiana State University |
Keywords: Stochastic optimal control, Process Control, Markov processes
Abstract: Controlling colloidal self-assembly into specific configurations for multi-functional materials has attracted vast interests among material scientists, control engineers, and computer scientists. While various feedback control strategies have demonstrated efficacy in rapidly steering the colloidal particles into desired configurations, dealing the stochasticity still stands as a challenge. Here we investigate the application of stochastic model predictive control (sMPC) on an electric field-mediated colloidal self-assembly process, simulated by Markov state models. Specifically, we examine the robustness of sMPC with respect to model accuracy, considering the challenge of obtaining accurate process models for a high-dimensional, stochastic many-body assembly system in practice. Our findings indicate that with a terminal reward objective function, the proposed sMPC controller can yield satisfactory results, and the sMPC also tends to hold robustness against modeling error. We anticipate the work presented here to stimulate further investigations on applying sMPC for colloidal and nano-particle self-assembly systems.
|
| |
| 14:45-15:00, Paper FrB17.6 | Add to My Program |
| Data-Conforming Model-Free Stochastic Reinforcement Learning |
|
| Athalye, Surabhi | Georgia Institute of Technology |
| Ramadan, Mohammad | Argonne National Laboratory |
| Anitescu, Mihai | Argonne National Laboratory |
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Stochastic optimal control, Reinforcement learning, Lyapunov methods
Abstract: In this paper, we address the concern of potential changes in the state-input distribution during the policy update step of model-free reinforcement learning (RL). In most RL settings, the control policy is updated based on previously recorded data, and then evaluated over the system's future trajectory. However, the new policy may alter the closed-loop behavior and result in future trajectories that are out of distribution with respect to the past learning data, possibly compromising the system's safety due to this discrepancy. To tackle this, we propose a framework that penalizes distributional shifts during RL execution. We incorporate a regularization term that systematically dampens deviations in the distribution in the state-input space, facilitating incremental exploration. Simulations demonstrate the efficacy of our approach.
|
| |
| FrB18 Regular Session, Churchill A2 |
Add to My Program |
| Fault Tolerant Systems I |
|
| |
| Chair: Bejarano, Francisco Javier | Instituto Politécnico Nacional, ESIME Ticomán |
| Co-Chair: Cheng, Shiyu | Washington University in St. Louis |
| |
| 13:30-13:45, Paper FrB18.1 | Add to My Program |
| Progress-Based Fault Detection and Health-Aware Task Allocation for Heterogeneous Multi-Robot Systems |
|
| Cline, Jack | California Polytechnic State University |
| Macaranas, Christian | California Polytechnic State University |
| Farzan, Siavash | California Polytechnic State University |
Keywords: Fault detection, Autonomous robots, Cooperative control
Abstract: We present a progress-based fault detection module and its integration with dynamic task allocation for heterogeneous robot teams. The detector monitors a normalized task-completion signal with a lightweight Kalman filter (KF) and a normalized innovation squared (NIS) test, augmented with a low-rate stall gate, an uncertainty gate, and debounce logic. Health estimates influence the allocator via health-weighted costs and health-dependent masks; reallocation is event-triggered and regularized with an l1 assignment-change penalty to limit reassignment churn while preserving feasibility through slack variables. The detector has constant per-robot update cost, and the allocation remains a convex quadratic program (QP). Experiments on a common team-task setup evaluate measurement-noise increases, velocity-slip biases, communication dropouts, and task abandonment. The results show timely detection in the noise and bias cases, maintained task completion with limited reassignment, and the expected observability delays under communication dropouts.
|
| |
| 13:45-14:00, Paper FrB18.2 | Add to My Program |
| Estimation of Unknown Inputs for Singular Systems: Application to Fault Detection |
|
| Correu Olivares, Roberto | Instituto Politécnico Nacional |
| Bejarano, Francisco Javier | Instituto Politécnico Nacional, ESIME Ticomán |
Keywords: Fault detection, Estimation, Linear systems
Abstract: A fault estimator is proposed for singular linear systems subject to disturbances. This model accounts for faults that explicitly appear both in the state equation and in the system outputs (i.e., system, actuator and sensor faults). A key aspect of this design is that it does not require estimating either the state vector or the disturbances, which provides greater flexibility in structural conditions compared to the requirements needed to implement an unknown input observer. Furthermore, the number of unknown inputs may exceed the number of available outputs. The faults are formulated through an algebraic expression involving high-order derivatives of the system output. In this way, the recovery of the fault signals is achieved using a high-order sliding mode differentiator, which requires that the derivative of the faults maintains a bounded norm.
|
| |
| 14:00-14:15, Paper FrB18.3 | Add to My Program |
| Early Fault Detection and Diagnosis in Closed-Loop Control Systems Using Relay Feedback Limit Cycles |
|
| Chaudhari, Yogesh | Indian Institute of Technology Guwahati |
| Majhi, Somanath | Indian Institute of Technology Guwahati |
Keywords: Fault detection, Fault diagnosis, Closed-loop identification
Abstract: Utilizing relay-induced limit cycles to stimulate system dynamics while preserving closed-loop stability, this paper proposes an innovative fault detection and diagnosis (FDD) architecture for closed-loop control systems. The methodology integrates analytical Describing Function (DF) modeling with Fast Fourier Transform (FFT) to provide a compact, fault-sensitive feature vector capable of detecting minor parametric deviations often masked by feedback dynamics. Fault isolation is achieved by comparing test feature vectors with a database of known fault signatures using weighted cosine similarity (WCS) and a confidence-based classification scheme. This integrated method supports diagnostic and prognostic goals, makes finding early signs of problems easier, and encourages a shift toward maintenance based on conditions. A validation study is performed on a relay-excited DC motor model, demonstrating its suitability for real-time, safety-critical applications while maintaining scalability and interpretability.
|
| |
| 14:15-14:30, Paper FrB18.4 | Add to My Program |
| Sensor and Threshold Selection for Safe Fault Detection |
|
| Clark, Andrew | Washington University in St. Louis |
| Cheng, Shiyu | Washington University in St. Louis |
Keywords: Fault detection, Fault tolerant systems, Observers for Linear systems
Abstract: Control systems can be severely impacted by node faults due to natural failures or malicious attacks. This paper considers two design problems for unknown input observer (UIO)-based fault detection schemes. First, we consider the problem of choosing optimal detection thresholds to ensure the fault is detected before safety violations occur while avoiding false alarms. We derive bounds on the thresholds to ensure that any constant-magnitude fault can be detected. Second, we investigate the problem of ensuring that the fault detector has adequate information to detect and isolate faults. We derive sufficient conditions for the existence of a UIO as a function of the available sensors, and prove that the problem of selecting a minimum-size set of neighbors is equivalent to matroid intersection, enabling polynomial-time approximation algorithms. Our simulation results confirm that our approach is able to detect faults before safety violations occur.
|
| |
| 14:30-14:45, Paper FrB18.5 | Add to My Program |
| Data-Driven, Redundancy-Free Fault-Tolerant Inertial Navigation System Based on Sensor-To-Sensor Identification |
|
| Bani Hani, Odai | Iowa State University |
| Al Janaideh, Mohammad | University of Guelph |
| Aljanaideh, Khaled | American University of Sharjah |
Keywords: Fault detection, Fault tolerant systems
Abstract: Inertial navigation systems (INS) use measurements from inertial sensors (accelerometers and gyroscopes) to estimate the location of a moving object. The estimated location can be used by a control system to move the object to a desired location. If one of the sensors used by the INS becomes faulty during the system operation, the estimated location becomes inaccurate, which can lead to damages to the system or the environment. Therefore, in safety-critical applications such as aircraft navigation, redundant sensors are used to detect and disconnect faulty sensors, which can be expensive. In this paper, we propose a data-driven, redundancy-free fault-tolerant inertial navigation system based on sensor-to-sensor identification. We identify sensor-to-sensor models between sensors from a single inertial measurements unit (IMU). These models, which are identified under healthy conditions of the INS system, are used along with a subset of sensor measurements to predict the response of the remaining sensor measurements. If a sensor becomes faulty, then we replace its actual faulty measurement with a healthy virtual measurement obtained using the identified sensor-to-sensor model and the remaining sensor measurements. We apply the proposed approach to two experimental setups, namely, a rotating platform and a moving vehicle. The proposed approach can be used in commercial applications to ensure low cost safe operation of the INS system.
|
| |
| 14:45-15:00, Paper FrB18.6 | Add to My Program |
| Fundamental Limitations of Sensitivity Metrics for Anomaly Impact Analysis in LTI Systems |
|
| Dong, Jingwei | Uppsala University |
| Zhang, Kangkang | Imperial College London |
| Nguyen, Anh Tung | Uppsala University |
| Teixeira, André M. H. | Uppsala University |
Keywords: Fault detection, Optimization, Linear systems
Abstract: This study establishes a connection between the output-to-output gain (OOG), a sensitivity metric quantifying the impact of stealthy attacks, and a novel input-to-input gain (IIG) introduced to evaluate fault sensitivity under disturbances, and investigates their fundamental performance limitations arising from the transmission zeros of the underlying dynamical system. Inspired by the OOG, which characterizes the maximum performance loss caused by stealthy attacks, the IIG is proposed as a new measure of robust fault sensitivity and is defined as the maximum energy of undetectable faults for a given disturbance intensity. Then, using right (for OOG) and left (for IIG) co-prime factorizations, both metrics are expressed as the H infinity norm of a ratio of the numerator factors. This unified representation facilitates a systematic analysis of their fundamental limitations. Subsequently, by utilizing the Poisson integral relation, theoretical bounds for the IIG and OOG are derived, explicitly characterizing their fundamental limitations imposed by system non-minimum phase (NMP) zeros. Finally, a numerical example is employed to validate the results.
|
| |
| FrB19 Regular Session, Churchill B1 |
Add to My Program |
| Optimal Control IV |
|
| |
| Chair: Chakravorty, Suman | Texas A&M University |
| Co-Chair: Usevitch, James | Brigham Young University |
| |
| 13:30-13:45, Paper FrB19.1 | Add to My Program |
| A Sequential Quadratic Programming Perspective on Optimal Control |
|
| Abhijeet, Fnu | Texas A&M University |
| Chakravorty, Suman | Texas A&M University |
Keywords: Optimal control, Optimization algorithms, Computational methods
Abstract: This paper offers a unified perspective on different approaches to the solution of optimal control problems through the lens of constrained sequential quadratic programming. In particular, it allows us to find the relationships between Newton’s method, the iterative LQR (iLQR), and Differential Dynamic Programming (DDP) approaches to solve the problem. It is shown that the iLQR is a principled SQP approach, rather than simply an approximation of DDP by neglecting the Hessian terms, to solve optimal control problems that can be guaranteed to always produce a cost-descent direction and converge to an optimum; while Newton’s approach or DDP do not have similar guarantees, especially far from an optimum. Our empirical evaluations on the pendulum and cart-pole swing-up tasks serve to corroborate the SQP-based analysis proposed in this paper.
|
| |
| 13:45-14:00, Paper FrB19.2 | Add to My Program |
| Optimal Optical Inter-Satellite Link Tracking Via Gauss-Seidel and Riccati Iterations |
|
| Garcia, Daniel | University of New Mexico |
| Danielson, Claus | University of New Mexico |
Keywords: Optimal control, Optimization algorithms, Predictive control for nonlinear systems
Abstract: Modern Optical Intersatellite Links require sub-degree pointing accuracy at a target satellite moving relative to the ego-spacecraft. This precise pointing presents significant challenges for spacecraft attitude control systems. This paper develops a novel nonlinear optimization algorithm that avoids linearizing the dynamics on the SO(3) manifold, exploiting the causal structure of the problem using block Gauss-Seidel iterations and Riccati iterations of the state and co-state to achieve optimal reference tracking of a moving reference. The proposed algorithm achieves linear complexity in the planning horizon and maintains exact SO(3) manifold constraints throughout the optimization process. Simulations demonstrate superior tracking performance compared to a linearized attitude planning, while maintaining feasible computation speed. These results demonstrate that our algorithm provides a viable solution for next-generation spacecraft attitude control systems requiring precise reference tracking capabilities.
|
| |
| 14:00-14:15, Paper FrB19.3 | Add to My Program |
| Predictive Control Barrier Functions for Discrete-Time Linear Systems with Unmodeled Delays |
|
| Paredes Salazar, Juan Augusto | University of Maryland, Baltimore County |
| Usevitch, James | Brigham Young University |
| Goel, Ankit | University of Maryland Baltimore County |
Keywords: Optimal control, Sampled-data control, Predictive control for linear systems
Abstract: This paper introduces a predictive control barrier function (PCBF) framework for enforcing state constraints in discrete-time systems with unknown relative degree, which can be caused by input delays or unmodeled input dynamics. Existing discrete-time CBF formulations typically require the construction of auxiliary barrier functions when the relative degree is greater than one, which complicates implementation and may yield conservative safe sets. The proposed PCBF framework addresses this challenge by extending the prediction horizon to construct a CBF for an associated system with relative degree one. As a result, the superlevel set of the PCBF coincides with the safe set, simplifying constraint enforcement and eliminating the need for auxiliary functions. The effectiveness of the proposed method is demonstrated on a discrete-time double integrator with input delay and a bicopter system with position constraints.
|
| |
| 14:15-14:30, Paper FrB19.4 | Add to My Program |
| Sampling-Based Global Optimal Control and Estimation Via Semidefinite Programming |
|
| Groudiev, Antoine | École Normal Supérieure, PSL Research University |
| Schramm, Fabian | Inria, École Normal Supérieure, PSL Research University |
| Berthier, Eloďse | ENSTA, Institut Polytechnique De Paris |
| Carpentier, Justin | Inria, École Normal Supérieure, PSL Research University |
| Dümbgen, Frederike | Inria, École Normal Supérieure, PSL Research University |
Keywords: Optimization, Optimal control, Estimation
Abstract: Global optimization has gained attention over the past decades, thanks to the development of both theoretical foundations and efficient numerical routines. Among recent advances, Kernel Sum of Squares (KernelSOS) provides a powerful theoretical framework, combining the expressivity of kernel methods with the guarantees of SOS optimization. In this paper, we take KernelSOS from theory to practice and demonstrate its use on challenging control and robotics problems. We identify and address the practical considerations required to make the method work in applied settings: restarting strategies, systematic calibration of hyperparameters, methods for recovering minimizers, and the combination with fast local solvers. As a proof of concept, the application of KernelSOS to robot localization highlights its competitiveness with existing SOS approaches that rely on heuristics and handcrafted reformulations to render the problem polynomial. Even in the high-dimensional, non-parametric setting of trajectory optimization with simulators treated as black boxes, we demonstrate how KernelSOS can be combined with fast local solvers to uncover higher-quality solutions without compromising overall runtimes.
|
| |
| 14:30-14:45, Paper FrB19.5 | Add to My Program |
| Optimal Finite-Thrust Maneuver Augmentation for Aerobraking Missions |
|
| Dasyam, Amrutha | Wichita State University |
| Dutta, Atri | Wichita State University |
Keywords: Optimization, Optimal control, Spacecraft control
Abstract: Aerobraking maneuvers use atmospheric drag over multiple atmospheric passes to gradually decelerate a spacecraft and reduce its orbital energy for an eventual capture into a low-altitude orbit. This paper proposes a thrust-augmentation strategy closer to the periapsis to enforce dynamic pressure and heating constraints. We present dynamic pressure-based heuristic throttle laws within a per-pass optimization framework to compute thrust augmentation needed alongside apoapsis trim maneuvers for accomplishing the aerobraking maneuver. Numerical simulations for a representative Mars aerobraking scenario and comparison of the heuristics with a per-pass optimization framework demonstrate that a ramp-down throttle strategy at the beginning of periapsis passage can yield good quality solution with reduced computational time compared to optimized throttle levels.
|
| |
| 14:45-15:00, Paper FrB19.6 | Add to My Program |
| Exhaustive-Serve-Longest Control for Multi-Robot Scheduling Systems |
|
| Merati, Mohammad | Boston University |
| Castanon, David | Boston University |
Keywords: Optimization, Queueing systems, Robotics
Abstract: We study online task allocation for multi-robot, multi-queue systems with stochastic arrivals at discrete locations and switching delays. We formulate the problem in discrete time; each location can host at most one robot per time slot, servicing a task consumes one slot, switching between locations incurs a one-slot travel delay, and arrivals are independent Bernoulli processes. We formulate a discounted-cost Markov decision process and propose Exhaustive-Serve-Longest (ESL), a simple real-time policy that serves exhaustively when the current location is nonempty and, when idle, switches to a longest unoccupied nonempty location. We prove the optimality of such policy under symmetry conditions on the arrival processes at each location. We compare in simulation the performance of the ESL policy with two well-known policies: a first-come, first-serve policy and a fixed-dwell cyclic policy. Across variations in server–location ratios and loads, ESL consistently yields lower discounted holding cost and smaller mean queue lengths, with action-time fractions showing more serving and restrained switching. When the symmetry conditions are violated, we compare the performance of the ESL policy on smaller problems with an optimal dynamic programming solution, and determine that the ESL performance is within 2% of the optimal performance. Its simplicity and robustness make ESL a practical default for real-time multi-robot scheduling systems.
|
| |
| FrB20 Regular Session, Churchill B2 |
Add to My Program |
| Model Predictive Control III |
|
| |
| Chair: Milios, Elias Lido Celestino | Robert Bosch GmbH |
| Co-Chair: Ramezani, Alireza | Northeastern University |
| |
| 13:30-13:45, Paper FrB20.1 | Add to My Program |
| Tube-Based Robust MPC for Variable Air Volume Systems in Airport Terminals |
|
| Zhang, Liguo | Beijing University of Technology |
| He, Kaichao | Beijing University of Technology |
| Zhan, Jingyuan | Beijing University of Technology |
| Shi, Rui | Beijing University of Technology |
Keywords: Building and facility automation, Grey-box modeling, Predictive control for linear systems
Abstract: This paper presents a tube-based model predictive control (Tube-MPC) framework for multi-zone heating, ventilation, and air conditioning (HVAC) systems in airport terminals. The proposed method explicitly addresses system uncertainties arising from passenger flow fluctuations and weather variations, ensuring robust satisfaction of thermal comfort and operational constraints. By incorporating disturbance-invariant sets into the control design, the controller maintains feasible operation while improving resilience to external perturbations. Simulation studies on a five-zone airport terminal case demonstrate that, compared with conventional MPC, the proposed Tube-MPC approach provides stronger disturbance rejection in the hall zone while maintaining the comfort range.
|
| |
| 13:45-14:00, Paper FrB20.2 | Add to My Program |
| Contract-Based Hierarchical Control Using Predictive Feasibility Value Functions |
|
| Berkel, Felix | Robert Bosch GmbH |
| Wabersich, Kim Peter | Robert Bosch GmbH |
| Xiang, Hongxi | ETH Zurich |
| Milios, Elias Lido Celestino | Robert Bosch GmbH |
Keywords: Predictive control for nonlinear systems, Hierarchical control, Machine learning
Abstract: Today's control systems are often characterized by modularity and safety requirements to handle complexity, resulting in the use of hierarchical control structures. Although hierarchical model predictive control offers favorable properties, achieving a provably safe, yet modular design remains a challenge. This paper introduces a contract-based hierarchical control strategy to improve the performance of control systems facing challenges related to model inconsistency and independent controller design across hierarchies. We consider a setup where a higher-level controller generates references that affect the constraints of a lower-level controller, which is based on a soft-constrained MPC formulation. The optimal slack variables of the lower-level MPC serve as the basis for a contract that allows the higher-level controller to assess the feasibility of the reference trajectory without exact knowledge of the model, constraints, and cost of the lower-level controller. To ensure computational efficiency while maintaining model confidentiality, we propose using an explicit function approximation, such as a neural network, to represent the cost of optimal slack values. The approach is tested for a hierarchical control setup consisting of a planner and a motion controller as commonly found in autonomous driving.
|
| |
| 14:00-14:15, Paper FrB20.3 | Add to My Program |
| Statistically Consistent Approximate Model Predictive Control |
|
| Milios, Elias Lido Celestino | Robert Bosch GmbH |
| Wabersich, Kim Peter | Robert Bosch GmbH |
| Berkel, Felix | Robert Bosch GmbH |
| Gruber, Felix | Robert Bosch GmbH |
| Zeilinger, Melanie N. | ETH Zurich |
Keywords: Predictive control for nonlinear systems, Machine learning, Neural networks
Abstract: Model Predictive Control (MPC) offers rigorous safety and performance guarantees but is computationally intensive. Approximate MPC aims to circumvent this drawback by learning a computationally cheaper surrogate policy. Common approaches focus on imitation learning (IL) via behavioral cloning (BC). However, BC fundamentally fails to provide accurate approximations when MPC solutions are set-valued due to non-convex constraints or local minima. We propose a two-stage IL procedure to accurately approximate nonlinear, potentially set-valued MPC policies. The method integrates an approximation of the MPC's optimal value function into a one-step look-ahead loss function, and thereby embeds the MPC's constraint and performance objectives into the IL objective. We prove statistical consistency for policies that exactly minimize our IL objective, implying convergence to a safe and stabilizing control law, and establish input-to-state stability guarantees for approximate minimizers. Simulations demonstrate improved performance compared to BC.
|
| |
| 14:15-14:30, Paper FrB20.4 | Add to My Program |
| Scalable and Modular Control Framework for Managing Active Cell Balancing Challenges |
|
| Ahmed, Afaq | COMSATS University Islamabad |
| Uppal, Ali Arshad | COMSATS University Islamabad |
| Ahmed, Qadeer | The Ohio State University |
Keywords: Predictive control for nonlinear systems, Modeling, Control applications
Abstract: Active cell balancing (ACB) is an important paradigm in an electric vehicle’s (EV) battery pack. However, in designing the ACB network (ACBN), the interconnections among cells and the selection of power electronic components lead to large variations. Therefore, to evaluate the performance of these diversifications, a modular and scalable framework is required. Consequently, this work utilizes graph theory to model and evaluate the ACBN performance for EV range extension. In this respect, a buck-boost (BB) converter-based architecture, which comprises N series adjacent cells, is modeled using the multiplex (M)–a variant of a graph. Moreover, the high-fidelity expressions for balancing currents are integrated with M. Building on that, a nonlinear model predictive control (NMPC) is formulated and subsequently solved to perform ACB. Similarly, the stability argument for the NMPC based on N cells is provided, and the provision to perform ACB for various ACBN types under the graph-NMPC-based framework is discussed. The simulations are performed for various reallife driving scenarios, and the results demonstrate that the BB converter-based architecture has been able to extend the average EV range up to 32 km under NMPC-based balancing.
|
| |
| 14:30-14:45, Paper FrB20.5 | Add to My Program |
| Contact-Implicit MPC for Multimodal Locomotion |
|
| Venkatesh Krishnamurthy, Kaushik | Northeastern University |
| Salagame, Adarsh | Northeastern University |
| Sihite, Eric | California Institute of Technology |
| Ramezani, Alireza | Northeastern University |
Keywords: Predictive control for nonlinear systems, Optimal control, Modeling
Abstract: Multimodal legged-aerial robots offer unique advantages in navigating complex environments by seamlessly transitioning between terrestrial and aerial locomotion modes. However, coordinating propulsive forces from thrusters with ground contact forces presents significant control challenges. This paper presents a contact-implicit model predictive control(CI-MPC) framework that unifies the control of legged and aerial subsystems without predefined contact schedules. We formulate the problem using a 2D thruster-assisted hopper as a reduced-order model, embedding Moreau time-stepping scheme within the MPC dynamics constraints to handle contact forces through differential inclusion and complementarity conditions. This formulation allows ground reaction forces to emerge naturally from the physical constraints. Simulation results demonstrate the framework’s capability to traverse diverse terrains including multi-level steps, uneven surfaces, and wave-like obstacles while maintaining a forward velocity of 1 m/s.Analysis reveals that the contact constraints lead to physically consistent behaviors, with ground reaction forces residing on friction cone boundaries. This unified framework eliminates mode-switching discontinuities and enables the natural discovery of hybrid locomotion strategies, offering a promising direction for controlling multimodal robotic systems.
|
| |
| FrB21 Regular Session, Churchill C1 |
Add to My Program |
| Robust Control |
|
| |
| Chair: Bhattacharya, Raktim | Texas A&M |
| Co-Chair: Bhadani, Rahul | University of Arizona |
| |
| 13:30-13:45, Paper FrB21.1 | Add to My Program |
| Robust Attitude Control of Nonlinear UAV Dynamics with LFT Models and mathcal{H}_infty Performance |
|
| Kumar, Tanay | Texas A&M University |
| Bhattacharya, Raktim | Texas A&M University |
Keywords: Robust control, Linear parameter-varying systems, Autonomous systems
Abstract: Attitude stabilization of unmanned aerial vehicles (UAVs) in uncertain environments presents significant challenges due to nonlinear dynamics, parameter variations, and sensor limitations. This paper presents a comparative study of H∞ and classical PID controllers for multi-rotor attitude regulation in the presence of wind disturbances and gyroscope noise. The flight dynamics are modeled using a linear parameter-varying (LPV) framework, where nonlinearities and parameter variations are systematically represented as structured uncertainties within a linear fractional transformation formulation. A robust controller based on H∞ formulation is designed using only gyroscope measurements to ensure guaranteed performance bounds. Nonlinear simulation results demonstrate the effectiveness of the robust controllers compared to classical PID control, showing significant improvement in attitude regulation under severe wind disturbances.
|
| |
| 13:45-14:00, Paper FrB21.2 | Add to My Program |
| Lagrangian-Based Disturbance Observer for Uncertain Mechanical Systems: A Case Study of 1 Degree of Freedom |
|
| Lee, Hyobin | University of Seoul |
| Park, Gyunghoon | University of Seoul |
Keywords: Robust control, Mechanical systems/robotics, Uncertain systems
Abstract: This paper introduces a new formulation of the disturbance observer (DOB) for a class of uncertain one degree-of-freedom mechanical systems from the perspective of Lagrangian mechanics. Unlike most existing works where the inverse of the nominal model is explicitly used to estimate the lumped disturbance, we derive an alternative expression of the lumped disturbance by differentiating the Lagrangian (i.e., the difference between the kinetic and potential energies), which yields an inverse-model-free DOB structure. The proposed inverse-model-free approach requires less model information for DOB design (e.g., the centripetal/Coriolis term). While our new formulation inherently introduces a singularity issue, we show that such singularities can be handled using a damped pseudo-inverse. Using singular perturbation theory, we rigorously show that this preserves the desired practical tracking behavior under the stated conditions of the DOB compared with other inverse-model-based designs. Simulation results also show that the damped pseudo-inverse provides additional freedom to handle measurement noise, allowing the proposed DOB to outperform existing methods in noisy environments.
|
| |
| 14:00-14:15, Paper FrB21.3 | Add to My Program |
| Robust Regret Control with Uncertainty-Dependent Baseline |
|
| Liu, Jietian | University of Michigan |
| Seiler, Peter | University of Michigan, Ann Arbor |
Keywords: Robust control, Optimal control, Uncertain systems
Abstract: This paper proposes a robust regret control framework in which the performance baseline adapts to the realization of system uncertainty. The plant is modeled as a discrete-time, uncertain linear time-invariant system with real-parametric uncertainty. The performance baseline is the optimal non-causal controller constructed with full knowledge of the disturbance and the specific realization of the uncertain plant. We show that a controller achieves robust additive regret relative to this baseline if and only if it satisfies a related, robust H_infty performance condition on a modified plant. One technical issue is that the modified plant can, in general, have a complicated nonlinear dependence on the uncertainty. We use a linear approximation step so that the robust additive regret condition can be recast as a standard mu-synthesis problem. A numerical example is used to demonstrate the proposed approach.
|
| |
| 14:15-14:30, Paper FrB21.4 | Add to My Program |
| Robust Multi-Objective Control for DC-DC Converters under Parametric Uncertainties: Experimental Validation |
|
| Smouni, Omaima | Université De Picardie Jules Verne - MIS / Icam De Grand-Paris Sud |
| Nachidi, Meriem | Icam / MIS |
| Rabhi, Abdelhamid | Universite De Picardie Jules Verne - MIS |
| Bosche, Jerome | Universite De Picardie Jules Verne - MIS |
| Yazidi, Amine | Université De Picardie Jules Verne - LTI |
Keywords: Robust control, Power electronics, LMIs
Abstract: This paper introduces a new robust multi-objective control strategy for DC-DC converters under input saturation and parametric uncertainties. An integral state-feedback controller is designed using linear matrix inequalities not only to ensure precise reference track- ing, but also to ensure that control signals remain within defined bounds based on a deadzone nonlinearity and a sector condition-based design. The controller also effectively rejects disturbances using an H∞ approach. Additionally, parametric uncertainties of the converter are incorporated into the controller design. The performance of the proposed controller is evaluated under various scenarios using a real prototype. The experimental results demonstrate the effectiveness and robustness of the proposed approach
|
| |
| 14:30-14:45, Paper FrB21.5 | Add to My Program |
| Resilient Composite Control for Stability Enhancement in EV Integrated DC Microgrids |
|
| Islam, Md Saiful | The University of Alabama in Huntsville |
| Bhadani, Rahul | The University of Alabama in Huntsville |
Keywords: Robust control, Power electronics
Abstract: When electric vehicles (EVs) are integrated into the standalone DC microgrids (DCMGs), stability issues arise due to their constant power load (CPL) behavior, which introduces negative incremental impedance (NII). In addition, the DCMGs suffer from an inherent low-inertia problem. Therefore, this study presents a composite controller incorporating a global integral terminal sliding mode controller with a backstepping controller. A virtual capacitor is employed to mitigate the lower inertia issue and strengthen the DC-bus voltage response. An improved fractional power-based reaching law decreases chattering as well as accelerates convergence. Exact feedback linearization converts the nonlinear power converter model into Brunovsky's canonical form, thereby mitigating NII effects and non-minimum phase issues. The entire system stability is verified using Lyapunov control theory. Finally, simulation outcomes confirm the superior performance of the proposed approach, with 34.4-53.3% reduction in overshoot, 52.9-74.9% in undershoot, and 12-47.4% in settling time compared to the existing controller.
|
| |
| 14:45-15:00, Paper FrB21.6 | Add to My Program |
| Robust Position Tracking Control of Electro-Hydraulic Actuators: Elimination of Velocity Measurements |
|
| Taskingollu, Sule | Ege University |
| Bayrak, Alper | Bolu Abant Izzet Baysal University |
| Selim, Erman | Ege University |
| Tatlicioglu, Enver | Ege University |
| Zergeroglu, Erkan | Gebze Technical University |
Keywords: Robust control, Robotics, Lyapunov methods
Abstract: This work presents a robust backstepping type controller formulation for the position tracking control of engineering systems actuated via electro-hydraulic actuators (EHAs). Specifically, a robust controller that does not require accurate knowledge of the system parameters and uses only position measurements is proposed. A filtered based approach is applied to remove the velocity dependency of the controller formulation. Stability of the closed loop system and the uniform boundedness of the tracking error signals are ensured via Lyapunov based arguments. The overall performance of the proposed method is illustrated, initially through physics-based MATLAB/Simscape studies, and then experimentally on a 1 degree of freedom (dof) EHA test-bed and a 2 dof robotic arm.
|
| |
| FrB22 Invited Session, Churchill C2 |
Add to My Program |
| Modeling, Control and Estimation of Soft Material and Continuum Systems |
|
| |
| Chair: Zhang, Jun | University of Nevada Reno |
| Co-Chair: Haghshenas-Jaryani, Mahdi | New Mexico State University |
| Organizer: Vikas, Vishesh | University of Alabama |
| Organizer: Zhang, Jun | University of Nevada Reno |
| Organizer: Haghshenas-Jaryani, Mahdi | New Mexico State University |
| Organizer: Tan, Xiaobo | Michigan State University |
| |
| 13:30-13:45, Paper FrB22.1 | Add to My Program |
| Design and Modeling of the Coiled String Actuator with a Constant Twisting and Coiling Zone (I) |
|
| Noack, Will | University of Nevada, Reno |
| Zhang, Jun | University of Nevada Reno |
Keywords: Modeling, Smart structures, Robotics
Abstract: Evolved from the twisted string actuator, the coiled string actuator (CSA) produces appreciable and reasonably consistent actuation by coiling a pair of fully twisted strings. Being a recently discovered compliant actuator, only the CSA with a varying twisting and coiling zone has been considered, namely, the strings are coiled as a continuum. CSA in this configuration cannot be easily applied to applications where coiled strings have direct contact with the system. It is strongly desirable to study CSA with a constant twisting and coiling zone, whose string sliding region can be conveniently embedded into systems like tendons. CSA in this configuration exhibits a unique simultaneous twisting-induced and coiling-induced actuation, which challenges both the actuator design and modeling. In this study, we propose a separator structure design that decreases the friction caused by the string-separator contact. A kinematic model is proposed that captures the simultaneous twisting-induced and coiling-induced actuation of the CSA with a constant twisting and coiling zone. Encouraging preliminary experimental characterizations and model simulation results are provided.
|
| |
| 13:45-14:00, Paper FrB22.2 | Add to My Program |
| Direct Data-Driven Predictive Control for a Three-Dimensional Cable-Driven Soft Robotic Arm (I) |
|
| Ouyang, Cheng | Mississippi State University |
| Ul Islam, Moeen | Mississippi State University |
| Chen, Dong | Mississippi State University |
| Zhang, Kaixiang | Michigan State University |
| Li, Zhaojian | Michigan State University |
| Tan, Xiaobo | Michigan State University |
Keywords: Control applications, Robotics, Intelligent systems
Abstract: Soft robots offer significant advantages in safety and adaptability, yet achieving precise and dynamic control remains a major challenge due to their inherently complex and nonlinear dynamics. Recently, Data-enabled Predictive Control (DeePC) has emerged as a promising model-free method that bypasses explicit system identification by directly leveraging input–output data. While DeePC has shown success in other domains, its application to soft robots remains underexplored, particularly for 3D soft robotic systems. This paper addresses this gap by developing and experimentally validating an effective DeePC framework on a 3D cable-driven soft arm. Specifically, we design and fabricate a soft robotic arm with a thick tubing backbone for stability, a dense silicone body with large cavities for strength and flexibility, and rigid endcaps for secure termination. Using this platform, we implement DeePC with singular value decomposition (SVD)-based dimension reduction for two key control tasks: fixed-point regulation and trajectory tracking in 3D space. Comparative experiments with a baseline model-based controller demonstrate DeePC’s superior accuracy and adaptability, highlighting its potential as a practical solution for dynamic control of soft robots.
|
| |
| 14:00-14:15, Paper FrB22.3 | Add to My Program |
| DiSA-IQL: Offline Reinforcement Learning for Robust Soft Robot Control under Distribution Shifts (I) |
|
| He, Linjin | Georgetown University |
| Qi, Xinda | Michigan State University |
| Chen, Dong | Mississippi State University |
| Li, Zhaojian | Michigan State University |
| Tan, Xiaobo | Michigan State University |
Keywords: Control applications, Robotics, Reinforcement learning
Abstract: Soft snake robots offer remarkable flexibility and adaptability in complex environments, yet their control remains challenging due to highly nonlinear dynamics. Existing model-based and bio-inspired controllers rely on simplified assumptions that limit their performance. Deep reinforcement learning (DRL) has recently emerged as a promising alternative, but online training is often impractical because of costly and potentially damaging real-world interactions. Offline RL provides a safer option by leveraging pre-collected datasets, but it suffers from distribution shift, which degrades generalization to unseen scenarios. To overcome this challenge, we propose DiSA-IQL (Distribution-Shift-Aware Implicit Q-Learning), an extension of IQL that incorporates robustness modulation by penalizing unreliable state–action pairs to mitigate distribution shift. We evaluate DiSA-IQL on goal-reaching tasks across two settings: in-distribution and out-of-distribution evaluation. Simulation results show that DiSA-IQL consistently outperforms baseline models, including Behavior Cloning (BC), Conservative Q-Learning (CQL), and vanilla IQL, achieving higher success rates, smoother trajectories, and greater robustness.
|
| |
| 14:15-14:30, Paper FrB22.4 | Add to My Program |
| Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion (I) |
|
| Gzenda, Vaughn | Carleton University |
| Chhabra, Robin | Toronto Metropolitan University |
Keywords: Autonomous robots, Reinforcement learning, Adaptive systems
Abstract: Soft robotic crawlers are mobile robots that utilize soft body deformability and compliance to achieve locomotion through surface contact. Designing control strategies for such systems is challenging due to model inaccuracies, sensor noise, and the need to discover locomotor gaits. In this work, we present a model-based reinforcement learning (MB-RL) framework in which Gaussian latent dynamics inferred from onboard sensors serve as a predictive model that guides an actor-critic algorithm to optimize locomotor policies. We evaluate the framework on a minimal crawler model in simulation using inertial measurement units and time-of-flight sensors as observations. The learned latent dynamics enable short-horizon motion prediction while the actor-critic discovers effective locomotor policies. This approach highlights the potential of latent-dynamics MB-RL for enabling embodied soft robotic adaptive locomotion based solely on noisy sensor feedback.
|
| |
| 14:30-14:45, Paper FrB22.5 | Add to My Program |
| A Physics-Informed Geometric Model for Characterizing Force and Length Variation of McKibben Pneumatic Artificial Muscles (I) |
|
| Montoya, Sabrina | New Mexico State University |
| Haghshenas-Jaryani, Mahdi | New Mexico State University |
Keywords: Robotics, Modeling, Model Validation
Abstract: Pneumatic Artificial Muscles (PAMs) are a type of soft actuator used in robotics to mimic the behavior of biological muscle, typically consisting of an inner bladder and a form of outer sleeve that causes contraction when pressurized. Many mathematical models have been developed to predict the force and displacement of PAMs. However, these models are mainly sensitive to the variation in materials and geometries of PAMs, making them limited in practical use for specific PAMs. This paper presents a unified physics-informed geometric model of PAMs for a range of sizes with initial lengths of 60, 70, and 80 mm and initial diameters ranging from 3.175, 6.35, and 9.525 mm. The fabrication and experimental testing of a type of PAM produced in our Lab for the muscle-driven snake-like robot locomotion is discussed. A series of experimental isometric and isotonic tests was conducted on customized PAMs with varying geometries made of the same materials. Finally, an empirical modeling approach was utilized, which resulted in a geometric model that preserves the structure of the Pujana-Aresse model. The model is limited to the size range tested in the lab and has only been validated for the aforementioned sizes. These muscle sizes were explored based on the overall geometry of the snake and the desired application for the muscles. For validation, the model prediction was compared to the experimental results, yielding a mean Normalized Root Mean Square Error (NRMSE) of 16.3% for the isometric force and 12% for the isotonic displacement.
|
| |
| 14:45-15:00, Paper FrB22.6 | Add to My Program |
| Adaptive Admittance Control of Nasal Swab Robots |
|
| Hsiao, Tesheng | National Yang Ming Chiao Tung University |
| Li, Yi-Jing | National Yang Ming Chiao Tung University |
Keywords: Robotics, Adaptive systems, Control applications
Abstract: Nasal swabbing is an essential procedure for detecting various respiratory diseases, and performing it with a robot can reduce the risk of infection for medical personnel. However, nasal swab robots face several challenges. For example, the narrow and elongated nasal passage allows only a small pose error during swab insertion, and the internal nasal structure varies slightly depending on each individual’s health condition. To deal with these problems, this paper proposes an admittance control law that enables the robot to adjust the swab’s pose based on the feedback of contact force and torque. However, the choice of admittance parameters significantly affects the behavior of the robot. To reduce the contact force and alleviate oscillation of the swab simultaneously, an adaptive admittance parameter strategy is introduced that modifies the admittance parameter in real time according to the pose of the swab. Consequently, safety, efficiency, and comfort of the nasal swabbing process are enhanced. Then, the proposed adaptive admittance control law was implemented in a 6-axis industrial robot, and a curved channel was constructed as a test environment for experimental verification. The results showed that a satisfactory balance among all performance indices, regarding safety, efficiency, and comfort, was achieved.
|
| |
| FrC03 Regular Session, Grand Salon 3 |
Add to My Program |
| Quantum Information and Control |
|
| |
| Chair: Quintero, Kaydian | Embry-Riddle Aeronautical University |
| Co-Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
| |
| 15:30-15:45, Paper FrC03.1 | Add to My Program |
| Exploring Chebyshev Spectral Method with Quantum Annealing Via Carleman Linearization |
|
| Nguyen, Hieu | North Carolina Agricultural and Technical State University |
| Ngo, Anh Phuong | North Carolina A&T State University |
Keywords: Quantum information and control, Computational methods, Optimization
Abstract: This paper explores the integration of Carleman linearization and quantum annealing to solve nonlinear ordinary differential equations (ODEs) under the Chebyshev spectral method. Using Carleman linearization, the computation of optimal Chebyshev coefficients is reformulated as a quadratic program, which can be further transformed into a Quadratic Unconstrained Binary Optimization (QUBO) problem via binary expansion. This QUBO formulation is well-suited for quantum annealers, where solutions are obtained by minimizing the Hamiltonian of the corresponding Ising model. To address hardware limitations, we employ an iterative quantum annealing strategy with dynamically updated parameters. Case studies on the D-Wave platform demonstrate the potential of our approach to harness the strengths of Carleman linearization and Chebyshev approximation theory, offering continuous and differentiable solutions to nonlinear ODEs, especially promising when quantum resources become more economically viable.
|
| |
| 15:45-16:00, Paper FrC03.2 | Add to My Program |
| Robust Multi-Parameter & State Estimation in Dissipative Flux Qubit Systems Via Sliding Mode Control |
|
| Quintero, Kaydian | Embry-Riddle Aeronautical University |
| Drakunov, Sergey V. | Embry-Riddle Aeronautical University |
| Berhane, Bereket | Embry-Riddle Aeronautical University |
Keywords: Quantum information and control, Identification
Abstract: Recent advancements in quantum computing, secure communication, and precision sensors indicate a transition from theoretical exploration to practical applications in quantum technologies. This paper addresses state estimation and control of superconducting flux qubits, a promising platform for scalable quantum computation. A novel adaptive sliding mode observer (ASMO) is proposed for simultaneous robust state reconstruction and online estimation of unknown damping parameters β and Λ in the presence of dissipation and measurement noise. Using incomplete measurement data, the observer ensures accurate estimation while maintaining robustness to model uncertainties. A sliding mode control law is then developed to stabilize the system and achieve desired trajectory tracking. Lyapunov-based analysis guarantees convergence of the estimation and control errors and ensures closed-loop stability. Simulation results demonstrate the effectiveness and robustness of the proposed framework, indicating its potential to enhance the reliability and stability of superconducting quantum processors under realistic operating conditions
|
| |
| 16:00-16:15, Paper FrC03.3 | Add to My Program |
| Continuous-Time Quantum Reservoirs |
|
| Chen, Anthony Siming | University of Nottingham |
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Quantum information and control, Machine learning, Neural networks
Abstract: This paper studies temporal learning with quantum reservoirs driven by continuous-time dynamics. Our goal is to model long-range dependencies under irregular sampling while avoiding backpropagation through time and repeated numerical ODE integration. We formulate a Continuous-time Quantum Reservoir (CQR) framework in which an input-modulated Hamiltonian drives Schrödinger evolution, and a classical linear readout is fitted from sampled quantum features. We analyze three variants: (i) CQR-unitary, which uses expectation values of Hermitian observables; (ii) CQR-measurement, which uses projective-measurement probabilities; and (iii) CQR-trainable, which additionally tunes input-coupling Hamiltonian parameters via parameter-shift updates. Experiments on Lorenz forecasting, human-activity classification, and two regression benchmarks show competitive accuracy, improved memory capacity, and clear compute-accuracy trade-offs across variants. We also provide a Stone-Weierstrass-based universality result for the induced feature algebra and discuss practical design limitations.
|
| |
| 16:15-16:30, Paper FrC03.4 | Add to My Program |
| Quantum Deception: Honey-X Deception Using Quantum Games |
|
| Reppas, Efstratios | Georgia Institute of Technology |
| Wadi, Ali | Georgia Institute of Technology |
| Gould, Brendan | Georgia Institute of Technology |
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Game theory, Quantum information and control, Optimization
Abstract: In this paper, we develop a framework for deception in quantum games, extending the Honey-X paradigm from classical zero-sum settings into the quantum domain. Building on a view of deception in classical games as manipulation of a player’s perception of the payoff matrix, we formalize quantum deception as controlled perturbations of the payoff Hamiltonian subject to a deception budget. We show that when victims are aware of possible deception, their equilibrium strategies surprisingly coincide with those of naive victims who fully trust the deceptive Hamiltonian. This equivalence allows us to cast quantum deception as a bilevel optimization problem, which can be reformulated into a bilinear semidefinite program. To illustrate the framework, we present simulations on quantum versions of the Penny Flip game, demonstrating how quantum strategy spaces and non-classical payoffs can amplify the impact of deception relative to classical formulations.
|
| |
| 16:30-16:45, Paper FrC03.5 | Add to My Program |
| Data-Driven Actuator Selection for High-Fidelity Quantum Optimal Control |
|
| Wadi, Ali | Georgia Institute of Technology |
| Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Quantum information and control, Optimal control, Machine learning
Abstract: In this paper, we propose a new framework for quantum optimal actuator selection that unifies control allocation with optimal quantum control design. To guarantee the uniqueness of solutions, we transform the infinite-horizon control allocation problem into an equivalent finite-horizon setting. Leveraging the isometry of the Bloch representation, we systematically characterize available actuators and introduce a quantitative measure of controllability based on accessible control degrees of freedom. Building on this foundation, we develop a model-free, trajectory-informed machine learning approach that evaluates control performance directly from system trajectories to identify the optimal actuator set. Finally, we provide rigorous analysis demonstrating that, even under unknown system dynamics, the proposed method converges to the optimal value function for a sufficiently large time horizon.
|
| |
| 16:45-17:00, Paper FrC03.6 | Add to My Program |
| Quantum-Assisted Barrier Sequential Quadratic Programming for Nonlinear Optimal Control |
|
| Binandeh Dehaghani, Nahid | Aalborg University |
| Wisniewski, Rafal | Aalborg University |
| Aguiar, A. Pedro | Faculty of Engineering, University of Porto |
Keywords: Quantum information and control
Abstract: We propose a quantum-assisted framework for solving constrained finite-horizon nonlinear optimal control problems using a barrier Sequential Quadratic Programming (SQP) approach. A quantum subroutine is incorporated to efficiently solve the Schur complement step using block-encoding and Quantum Singular Value Transformation techniques. We formally analyze the time complexity and convergence behavior under the cumulative effect of quantum errors, establishing local input-to-state stability and convergence to a neighborhood of the stationary point, with explicit error bounds in terms of the barrier parameter and quantum solver accuracy. The proposed framework enables computational complexity to scale polylogarithmically with the system dimension demonstrating the potential of quantum algorithms to enhance classical optimization routines in nonlinear control applications.
|
| |
| FrC04 Regular Session, Grand Salon 4 |
Add to My Program |
| Human-In-The-Loop Control |
|
| |
| Chair: Hedengren, John | Brigham Young University |
| Co-Chair: Barton, Kira | University of Michigan, Ann Arbor |
| |
| 15:30-15:45, Paper FrC04.1 | Add to My Program |
| Adaptive Driving Style for SAE Level-2 Driving Automation: Minimizing Preference Mismatch |
|
| Akash, Kumar | Honda Research Institute USA, Inc |
| Zheng, Zhaobo | Honda Research Institute USA, Inc |
| Misu, Teruhisa | Honda Research Institute USA, Inc |
| Krishnamoorthy, Vidya | San Jose State University |
| Dong, Mia | San Jose State University |
| Lee, Yuni | San Jose State University |
| Huang, Gaojian | San Jose State University |
Keywords: Human-in-the-loop control, Automotive control, Cooperative control
Abstract: Driving style is a key factor in the comfort and acceptance of automated vehicle (AV) features. In SAE Level-2 automation, where the driver must supervise the system and remain ready to intervene, mismatches between the automation’s driving style and the driver’s preference can reduce trust and trigger takeovers. This paper proposes an adaptive driving-style control framework that minimizes such preference mismatch. In a driving-simulator study, we compare fixed, trust-based, and preference-based adaptation heuristics and analyze their effects on preference mismatch and trust. We then train a driving-preference prediction model and use it in an implicit adaptation policy that selects among bounded driving styles for upcoming events. A validation study shows that the predictive policy achieves equal or lower preference mismatch than comparison baselines, particularly when starting from a less defensive style, while also yielding higher average trust. The results provide a step toward developing human-aware driving automation that can implicitly adapt its driving style to the driver’s preferences.
|
| |
| 15:45-16:00, Paper FrC04.2 | Add to My Program |
| LLM-Enhanced Human-In-The-Loop MPC: TCLab Demonstration for Pharma 4.0 |
|
| Pershing, Jonathan | Brigham Young University |
| Stone, Nathan | Brigham Young University |
| Joseph, Allison | Brigham Young University |
| Whitaker, Darren | Takeda Development Center Americas |
| Hedengren, John | Brigham Young University |
Keywords: Human-in-the-loop control, Intelligent systems, Control applications
Abstract: Industry 4.0 in pharmaceutical manufacturing demands advanced automation and control to handle increasingly specialized drugs and personalized therapies. However, leveraging sophisticated controls like model predictive control (MPC) often requires expertise beyond the typical plant operator. This paper proposes a human-in-the-loop framework where a large language model (LLM) assists operators in configuring and tuning an MPC using natural language commands. The LLM translates high-level objectives (e.g. “speed up the response,” “avoid overshoot”) into MPC tuning adjustments and provides real-time analysis of controller performance. A human supervisor remains in the loop as a safety net, which is essential in regulated pharma environments to ensure trust and compliance. The concept is demonstrated on a Temperature Control Lab (TCLab) hardware setup using a GEKKO-based MPC. Results show that key control objectives can be modified on the fly via natural language, enabling intuitive operator interaction. Four representative scenarios illustrate how LLM-translated commands affect MPC behavior, and an insight-generation feature provides automated textual analysis of performance. This human-in-the-loop, LLM-assisted MPC approach offers a pathway to deploy advanced control in pharma manufacturing while preserving human oversight.
|
| |
| 16:00-16:15, Paper FrC04.3 | Add to My Program |
| Energy-Efficient Intent Estimation and Distributed Optimal Control for Human–Multi-Robot Collaboration |
|
| Ganie, Irfan Ahmad | Wilkes University |
| Jagannathan, Sarangapani | Missouri Univ of Science & Tech |
Keywords: Human-in-the-loop control, Learning, Optimal control
Abstract: This paper introduces an energy-efficient distributed optimal control framework for human–multi-robot cooperative manipulation, combining a spiking neural network (SNN) observer for biologically inspired intent estimation with a game-theoretic distributed optimal control strategy for coordination. At the estimation level, the event-driven SNN captures human motor intent from local force feedback and consensus information, enabling real-time trajectory estimation through sparse spike processing. Unlike conventional neural networks, the SNN processes asynchronous events to significantly reduce computational energy, while singular value decomposition (SVD)–based weight tuning laws ensure stable gradient propagation during online learning. At the control level, a multilayer NN-based actor-critic architecture applies adaptive dynamic programming within a cooperative game-theoretic setting. Coupled Hamilton-Jacobi-Bellman equations are solved through neighborhood optimization, allowing each robot to minimize a performance cost that incorporates local dynamics, neighboring interactions, and human-robot force coordination. Simulation results in human–multi-robot collaborative tasks, achieve superior tracking accuracy with a 45% reduction in operational cost compared to state-of-the-art methods. The SNN-based observer further reduces energy consumption by 60% while maintaining estimation accuracy, enabling practical real-time deployment on resource-constrained robotic systems.
|
| |
| 16:15-16:30, Paper FrC04.4 | Add to My Program |
| Characterization of Modes of Human-Autonomy Team Behavior Using the Distance to Generalized Nash Equilibria in Shared Vehicle Control |
|
| Dudek, Aleksandra | University of Michigan |
| Naidja, Nouhed | CentraleSupelec/Laboratoire Des Signaux Et Systčmes (L2S) |
| James, Scott Clifford | Applied Dynamics International, Inc |
| Castanier, Matthew | US Army DEVCOM Ground Vehicle Systems Center |
| Pechberti, Steve | Institut VEDECOM, Versailles, France |
| Rahal, Mohamed-Cherif | Institut VEDECOM, Versailles, France |
| Vermillion, Christopher | University of Michigan |
| Barton, Kira | University of Michigan, Ann Arbor |
Keywords: Human-in-the-loop control, Modeling
Abstract: This paper quantifies the performance of human-autonomy teams (HATs) by measuring the distance between observed strategies and the generalized Nash equilibrium (GNE). Traditional approaches of assessing HATs do not fully capture the bidirectional interactions or provide actionable insight to improve team behavior. Here, we model HATs as non-zero-sum games and introduce a potential function (informed by those seen in potential games) to characterize how a HAT's behavior diverges from the identified GNE strategy. In a user study with a shared-control vehicle simulator, we apply the metric to assess the effectiveness of the teams formed between an autonomous agent and human participants. We further identify distinct behavior modes of participants based on the trends in the potential function values across different ratios of shared control. The results demonstrate the utility of the potential function in distinguishing patterns of human behavior, providing a pathway towards enhancing the effectiveness of HATs.
|
| |
| 16:30-16:45, Paper FrC04.5 | Add to My Program |
| Arbitration with Control Barrier Functions for Safe Shared Control |
|
| Uzun, Muhammed Yusuf | Bilkent University |
| Yildiz, Yildiray | Bilkent University |
Keywords: Human-in-the-loop control
Abstract: By combining automation accuracy with human adaptability, shared control provides enhanced performance and safety in dynamic, complex environments. Traditional arbitration methods for integrating automation and human inputs often rely on system-specific, parameter-dependent functions that are based on shared control metrics such as trust, workload, or attention. Meanwhile, Control Barrier Functions (CBFs) enforce safety constraints on automated systems but are typically limited to safeguarding plant states. This work introduces a novel arbitration method based on Control Barrier Functions (CBFs), where shared control metrics such as workload, attention, and trust are expressed as real-time inequality constraints. The resulting quadratic-programming formulation determines the automation assistance input that enforces these constraints while preserving feasibility and safety. This CBF-based arbitration provides a systematic, interpretable, and scalable foundation for safe human–autonomy integration.
|
| |
| 16:45-17:00, Paper FrC04.6 | Add to My Program |
| Shared Autonomy under Human Performance Uncertainty Via Bayesian Learning and Nested Control Barrier Functions |
|
| Li, Xiao | University of Michigan, Ann Arbor |
| Talbot, John | Toyota Research Institute |
| Dallas, James | Toyota Research Institute |
| Subosits, John | Stanford University: Dynamic Design Lab |
Keywords: Autonomous systems, Constrained control, Human-in-the-loop control
Abstract: Shared autonomy envisions seamless machine assistance to human drivers, enhancing both driving safety and personalization of the driving experience. In this paper, we propose a shared control algorithm for a lane-keeping application that formally guarantees the vehicle remains within a predefined track while minimizing intrusiveness. This is accomplished through the development of a family of Control Barrier Functions that construct a nested sequence of safe subsets, each representing a different level of conservativeness. The level of conservativeness is then adapted online to individual drivers based on their inferred, uncertain driving performance, using a Bayesian filter. We employ a gradient-based algorithm to automatically optimize the hyperparameters, thereby minimizing the intrusiveness of each Control Barrier Functions in parallel. The proposed approach achieves an average computation time of 0.0047 seconds, and its effectiveness is validated through comprehensive simulation studies and real-world hardware experiments with a human driver in the loop.
|
| |
| FrC06 Regular Session, Grand Salon 7 |
Add to My Program |
| Network Analysis and Control |
|
| |
| Chair: Liu, Ji | Stony Brook University |
| Co-Chair: Gamarra, Marco | Air Force Research Laboratory |
| |
| 15:30-15:45, Paper FrC06.1 | Add to My Program |
| A 3-State Clock Machine for Combinatorial Optimization |
|
| Cheng, Yi | University of Virginia |
| Lin, Zongli | University of Virginia |
Keywords: Optimization algorithms, Network analysis and control
Abstract: Many combinatorial optimization problems (COPs) are NP-hard and, in general, hard to solve. Over the past years, many unconventional computing paradigms have been proposed for solving such problems. Notably, Ising machines are one such paradigm for solving COPs with binary variables, which have attracted sustained attention in recent years. Specifically, Ising machines are a class of Langevin models for sampling low-energy configurations of the Ising model. Since many COPs such as the max-cut problem can be directly mapped to the Ising model, Ising machines solve the COPs by finding or sampling lowest-energy configurations (ground states) of the Ising model. Although Ising machines have shown great potential, they remain somewhat inadequate in addressing multi-state COPs. Here, inspired by the 3-state clock model, a generalization of the Ising model to a multi-state model, we introduce a nonlinear dynamical model, referred to as 3-state clock machine, for solving max-3-cut problems.
|
| |
| 15:45-16:00, Paper FrC06.2 | Add to My Program |
| Multiagent Social Influence: Modeling Persuasion in Contested Social Networks |
|
| Tumu, Renukanandan | University of Pennsylvania |
| Vasile, Cristian Ioan | Lehigh University |
| Preciado, Victor M. | University of Pennsylvania |
| Mangharam, Rahul | University of Pennsylvania |
Keywords: Network analysis and control, Control of networks, Large-scale systems
Abstract: We present the Social Influence Game (SIG), a framework for modeling adversarial persuasion in social networks with an arbitrary number of competing players. Our goal is to provide a tractable and interpretable model of contested influence that scales to large systems while capturing the structural leverage points of networks. Each player allocates influence from a fixed budget to steer opinions that evolve under DeGroot dynamics, and we prove that the resulting best-response optimization problem is a difference-of-convex program. To enable scalability, we develop an Iterated Linear (IL) solver that approximates player objectives with linear programs. In experiments on random and archetypical networks, IL achieves solutions within 7% of nonlinear solvers while being over 10× faster, scaling to large social networks. This paper lays a foundation for asymptotic analysis of contested influence in complex networks.
|
| |
| 16:00-16:15, Paper FrC06.3 | Add to My Program |
| Toward Extremal Graphs for Maximum Total Effective Resistance under Vertex and Edge Constraints |
|
| Lu, Susie | MIT |
| Gamarra, Marco | Air Force Research Laboratory |
| Liu, Ji | Stony Brook University |
Keywords: Network analysis and control, Cooperative control
Abstract: This paper studies an algorithm that generates generalized lollipop graphs for any feasible pair of numbers of vertices and edges. An explicit expression for the total effective resistance of these graphs is derived, and it is shown that this value achieves or approximates the maximal possible total effective resistance among all connected graphs with the same vertex and edge counts. In particular, the gap between the total effective resistance of the constructed generalized lollipop graphs and the extremal value converges to zero as the graph size tends to infinity.
|
| |
| 16:15-16:30, Paper FrC06.4 | Add to My Program |
| Distributed Online Submodular Maximization under Communication Delays: A Simultaneous Decision-Making Approach |
|
| Xu, Zirui | University of Michigan |
| Tzoumas, Vasileios | University of Michigan, Ann Arbor |
Keywords: Network analysis and control, Optimization, Sensor networks
Abstract: We provide a distributed online algorithm for multi-agent submodular maximization under communication delays. We are motivated by the future distributed information-gathering tasks in unknown and dynamic environments, where utility functions naturally exhibit the diminishing-returns property, i.e., submodularity. Existing approaches for online submodular maximization either rely on sequential multi-hop communication, resulting in prohibitive delays and restrictive connectivity assumptions, or restrict each agent’s coordination to its one-hop neighborhood only, thereby limiting the coordination performance. To address the issue, we provide the Distributed Online Greedy (DOG) algorithm, which integrates tools from adversarial bandit learning with delayed feedback to enable simultaneous decision-making across arbitrary network topologies. We provide the approximation performance of DOG against an optimal solution, capturing the suboptimality cost due to decentralization as a function of the network structure. Our analyses further reveal a trade-off between coordination performance and convergence time, determined by the magnitude of communication delays. By this trade-off, DOG spans the spectrum between the state-of-the-art fully centralized online coordination approach [1] and fully decentralized one-hop coordination approach [2].
|
| |
| 16:30-16:45, Paper FrC06.5 | Add to My Program |
| Opinion Clustering under the Friedkin-Johnsen Model: Agreement in Disagreement |
|
| Shrinate, Aashi | IIT Kanpur |
| Tripathy, Twinkle | IIT Kanpur |
Keywords: Network analysis and control
Abstract: The convergence of opinions in the Friedkin-Johnsen (FJ) framework is well studied, but the topological conditions leading to opinion clustering remain less explored. To bridge this gap, we examine the role of topology in the emergence of opinion clusters within the network. The key contribution of the paper lies in the introduction of the notion of topologically prominent agents, referred to as Locally Topologically Persuasive (LTP) agents. Interestingly, each LTP agent is associated with a unique set of (non-influential) agents in its vicinity. Using them, we present conditions to obtain opinion clusters in the FJ framework in any arbitrarily connected digraph. A key advantage of the proposed result is that the resulting opinion clusters are independent of the edge weights and the stubbornness of the agents. Finally, we demonstrate using simulation results that, by suitably placing LTP agents, one can design networks that achieve any desired opinion clustering.
|
| |
| 16:45-17:00, Paper FrC06.6 | Add to My Program |
| Multi-Agent Stage-Wise Conservative Linear Bandits |
|
| Afsharrad, Amirhossein | Stanford University |
| Moradipari, Ahmadreza | University of California Santa Barbara |
| Lall, Sanjay | Stanford University |
Keywords: Machine learning, Network analysis and control, Reinforcement learning
Abstract: In many real-world applications such as recommendation systems, multiple learning agents must balance exploration and exploitation while maintaining safety guarantees to avoid catastrophic failures. We study the stochastic linear bandit problem in a multi-agent networked setting where agents must satisfy stage-wise conservative constraints. A network of N agents collaboratively maximizes cumulative reward while ensuring that the expected reward at every round is no less than (1-alpha) times that of a baseline policy. Each agent observes local rewards with unknown parameters, but the network optimizes for the global parameter (average of local parameters). Agents communicate only with immediate neighbors, and each communication round incurs additional regret. We propose MA-SCLUCB (Multi-Agent Stage-wise Conservative Linear UCB), an episodic algorithm alternating between action selection and consensus-building phases. We prove that MA-SCLUCB achieves regret tilde{O}left(frac{d}{sqrt{N}}sqrt{T}cdotfrac{log(N T)}{sqrt{log(1/|lambda_2|)}}right) with high probability, where d is the dimension, T is the horizon, and |lambda_2| is the network's second largest eigenvalue magnitude. Our analysis shows: (i) collaboration yields frac{1}{sqrt{N}} improvement despite local communication, (ii) communication overhead grows only logarithmically for well-connected networks, and (iii) stage-wise safety adds only lower-order regret. Thus, distributed learning with safety guarantees achieves near-optimal performance in reasonably connected networks.
|
| |
| FrC07 Regular Session, Grand Salon 9 |
Add to My Program |
| Traffic Control |
|
| |
| Chair: Malikopoulos, Andreas A. | Cornell University |
| Co-Chair: Scruggs, Jeff | University of Michigan |
| |
| 15:30-15:45, Paper FrC07.1 | Add to My Program |
| Traffic Density Control Via Filtered Feedback Linearization in Homogeneous Highway Corridors with Input Constraints |
|
| Rahmanidehkordi, Arash | University of North Carolina at Charlotte |
| Ghasemi, Amirhossein | University of North Carolina Charlotte |
Keywords: Traffic control, Constrained control, Feedback linearization
Abstract: This paper presents a novel control framework that integrates Filtered Feedback Linearization (FFL) with Model Predictive Control (MPC) to manage traffic flow in a homogeneous highway network. FFL is a high-gain robust control method that relies on minimal knowledge of the traffic model and can effectively reject unknown and unmeasured disturbances; however, it does not inherently handle actuator constraints, which may lead to control saturation. Conversely, MPC effectively enforces constraints and optimizes control actions, but its performance depends on model accuracy and it lacks inherent robustness to disturbances. To bridge these limitations, we propose an FFL-MPC framework in which FFL ensures disturbance rejection while MPC regulates virtual control inputs within feasible limits. A key contribution of this work is a novel constraint mapping algorithm that dynamically transforms actuator constraints into the virtual control domain, preserving compatibility with the low-pass filtering dynamics of FFL. The METANET model is employed to model the traffic dynamics. The proposed framework is validated through numerical simulations of highway traffic, demonstrating its effectiveness in alleviating congestion, enforcing control limits, and preserving robust performance under model uncertainty and external disturbances.
|
| |
| 15:45-16:00, Paper FrC07.2 | Add to My Program |
| Spatiotemporal Forecasting of Incidents and Congestion with Implications for Sustainable Traffic Control (I) |
|
| Kinchen, Tony | Cornell Univerisity |
| Bai, Ting | Cornell University |
| Senthil Kumar, Nishanth Venkatesh | Cornell University |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Simulation, Modeling, Estimation
Abstract: Urban traffic anomalies, such as collisions and disruptions, threaten the safety, efficiency, and sustainability of transportation systems. In this paper, we present a simulation-based framework for modeling, detecting, and predicting such anomalies in urban networks. Using the Simulation of Urban MObility (SUMO) platform, we generate reproducible rear-end and intersection crash scenarios with matched baselines, enabling controlled experimentation and comparative evaluation. We record vehicle-level travel time, speed, and emissions for both edge- and network-level analysis. Building on this dataset, we develop a hybrid forecasting architecture that combines bidirectional long short-term memory networks with a diffusion convolutional recurrent neural network to capture temporal dynamics and spatial dependencies. Our simulation studies on the Broadway corridor in New York City demonstrate the framework's ability to reproduce consistent incident conditions, quantify their effects, and provide accurate multi-horizon traffic forecasts. Our results highlight the value of combining controlled anomaly generation with deep predictive models to support reproducible evaluation and sustainable traffic management.
|
| |
| 16:00-16:15, Paper FrC07.3 | Add to My Program |
| Route Recommendations for Traffic Management under Learned Partial Driver Compliance |
|
| Bang, Heeseung | Cornell University |
| Cho, Jung-Hoon | MIT |
| Wu, Cathy | UC Berkeley |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Traffic control, Multivehicle systems, Transportation networks
Abstract: In this paper, we aim to mitigate congestion in traffic management systems by guiding travelers along system-optimal (SO) routes. However, we recognize that most theoretical approaches assume perfect driver compliance, which often does not reflect reality, as drivers tend to deviate from recommendations to fulfill their personal objectives. Therefore, we propose a route recommendation framework that explicitly learns partial driver compliance and optimizes traffic flow under realistic adherence. We first compute an SO edge flow through flow optimization techniques. Next, we train a compliance model based on historical driver decisions to capture individual responses to our recommendations. Finally, we formulate a stochastic optimization problem that minimizes the gap between the target SO flow and the realized flow under conditions of imperfect adherence. Our simulations conducted on a grid network reveal that our approach significantly reduces travel time compared to baseline strategies, demonstrating the practical advantage of incorporating learned compliance into traffic management.
|
| |
| 16:15-16:30, Paper FrC07.4 | Add to My Program |
| Mesoscopic Digital Control for Disturbance String Stability Via Constant Time-Headway Spacing Policies |
|
| Bonsanto, Pietro | Universitŕ Degli Studi Dell'Aquila |
| Mattioni, Mattia | Universitŕ Degli Studi Di Roma La Sapienza |
| Iovine, Alessio | CNRS |
| De Santis, Elena | University of L'Aquila |
| Di Benedetto, Maria Domenica | University of L'Aquila |
Keywords: Traffic control, Sampled-data control, Autonomous vehicles
Abstract: This paper addresses the enforcement of Disturbance String Stability (DSS) in connected autonomous vehicle platoons subject to asynchronous communication, sampling, and quantization in case of a constant time headway spacing policy. Building on a mesoscopic control framework, we design digital controllers that guarantee DSS despite external disturbances and device limitations. Simulation results demonstrate the effectiveness of the proposed approach.
|
| |
| 16:30-16:45, Paper FrC07.5 | Add to My Program |
| Nonlinear Consensus for Traffic Assignment in Multi-Route Networks |
|
| Adjei, Emmanuel | University of California Irvine |
| Butler, Brooks A. | University of California, Irvine |
| Jin, Wen-Long | University of California, Irvine |
| Egerstedt, Magnus | University of North Carolina, Chapel Hill |
Keywords: Transportation networks, Traffic control, Emerging control applications
Abstract: This paper presents a framework connecting nonlinear consensus dynamics with traffic assignment problems in multi-route transportation networks. We define a model of route-choice behavior in which flow evolution is governed by first-in-first-out dynamics, with route costs combining baseline and congestion terms. This formulation yields Wardrop equilibria as consensus states, where all used routes have equalized costs. The analysis establishes key system properties, including flow conservation, invariance of unused routes, and uniqueness of the symmetric equilibrium. Stability is demonstrated through monotonic convergence, while case studies of balanced, congested, and asymmetric networks illustrate flow reallocation and steady-state concentration on optimal routes. This framework provides a foundation for decentralized route selection in intelligent transportation systems and enables control strategies that exploit consensus-like rerouting in urban traffic.
|
| |
| 16:45-17:00, Paper FrC07.6 | Add to My Program |
| Discrete-Time Pricing for Nash Stabilization in Routing Games |
|
| Lee, Richard | University of Michigan |
| Scruggs, Jeff | University of Michigan |
| Yin, Yafeng | University of Michigan |
Keywords: Transportation networks, Traffic control, Lyapunov methods
Abstract: This work presents a dynamic pricing scheme which guarantees global asymptotic stability of the Nash equilibrium set for a discrete-time routing game. We adopt a game theoretic approach in which each population consists of commuters between a particular origin-destination pair. Populations revise their strategies according to payoff discrepancies between routes, where the strategy revision process is modeled by the class of impartial pairwise comparison (IPC) evolutionary dynamic models (EDMs). We show that the EDM satisfies a discrete-time dissipativity property and leverage this insight to design a dynamic payoff mechanism acting as a supplemental toll imposed on each population. By introducing an augmented storage function which governs the toll dynamics, we ensure the control input (i.e., toll) asymptotically vanishes at the Nash equilibrium set, thus guaranteeing convergence of the closed-loop system. We validate our findings with numerical experiments.
|
| |
| FrC08 Regular Session, Grand Salon 10-13 |
Add to My Program |
| Multi-Vehicle Systems |
|
| |
| Chair: Malikopoulos, Andreas A. | Cornell University |
| Co-Chair: Alexandridis, Alex | University of West Attica |
| |
| 15:30-15:45, Paper FrC08.1 | Add to My Program |
| R3R: Decentralized Multi-Agent Collision Avoidance with Infinite-Horizon Safety |
|
| Vielmetti, Thomas Marshall | University of Michigan, Ann Arbor |
| Agrawal, Devansh Ramgopal | University of Michigan, Ann Arbor |
| Panagou, Dimitra | University of Michigan, Ann Arbor |
Keywords: Decentralized control, Multivehicle systems, Autonomous robots
Abstract: Existing decentralized methods for multi-agent motion planning lack formal, infinite-horizon safety guarantees, especially for communication-constrained systems. We present R3R which, to our knowledge, is the first decentralized and asynchronous framework for multi-agent motion planning under range-limited communication constraints with infinite-horizon safety guarantees for systems of nonlinear agents. R3R's novelty lies in combining our gatekeeper safety framework with a geometric constraint termed R-Boundedness, which together establish a formal link between an agent's communication radius and its ability to plan safely. We constrain trajectories to lie within a fixed planning radius, determined by a function of the agent's communication radius. This enables trajectories to be certified as provably safe for all time using only local information. Our algorithm is fully asynchronous, and ensures the forward invariance of these guarantees even in time-varying networks where agents asynchronously join and replan. We evaluate our approach in simulations of up to 128 Dubins vehicles, validating our theoretical safety guarantees in dense, obstacle-rich scenarios. We further show that R3R's computational complexity scales with local agent density rather than problem size, providing a practical solution for scalable and provably safe multi-agent systems.
|
| |
| 15:45-16:00, Paper FrC08.2 | Add to My Program |
| Stability Analysis of Vehicle Platooning with PID Control Via Dominant Pole Placement |
|
| Choudhary, Isha | Indian Institute of Technology Mandi |
| Dixit, Shilp | Vay Technology Inc., Las Vegas, USA |
| Halder, Kaushik | Indian Institute of Technology Mandi |
Keywords: Multivehicle systems, Cooperative control, PID control
Abstract: This paper proposes a novel distributed proportional-integral-derivative (PID) controller design using the dominant pole-placement method for homogeneous vehicle platoon systems with directed and undirected topologies under the constant time headway policy (CTHP). Analytical expressions for PID gains are derived using a coefficient matching approach, considering all real non-dominant poles. String stability criteria are derived for both the directed predecessor-following (PF) and undirected bidirectional (BD) topologies. A genetic algorithm (GA) is employed to optimize three key elements: (i) closed-loop design parameters, (ii) PID controller gains, and (iii) time headway. The algorithm searches the stabilizable region of the controller parameter space to satisfy derived PID analytical expressions and string stability criteria while minimizing the integral square error (ISE). The proposed PID controller ensures both internal and string stability. The effectiveness of the design is validated through numerical simulations on a five-vehicle platoon with PF and BD topologies under CTHP.
|
| |
| 16:00-16:15, Paper FrC08.3 | Add to My Program |
| A Lyapunov-Based Formation Control Framework for Multi-Agent Systems in Space-Restricted and Obstacle-Populated Environments |
|
| Protoulis, Teo | University of West Attica |
| Alexandridis, Alex | University of West Attica |
Keywords: Multivehicle systems, Lyapunov methods, Autonomous systems
Abstract: In this paper, a Lyapunov-based framework for flexible, distance-based, and leaderless formation control of multi-agent systems (MAS) in space-restricted and obstacle-populated environments is proposed. A control Lyapunov function methodology that considers a generic dynamic model of autonomous vehicles is introduced to derive decentralized control laws that are computed locally by each agent and guarantee a number of crucial properties. Among others, the feedback laws ensure that the swarm satisfies strict prespecified formation bounds, while guaranteeing inter-agent collision avoidance, convergence of the vehicles’ velocities to the desired values and adherence to hard upper and lower bounds, and satisfaction of spatial restrictions imposed by the navigation environment. Simulation results on a swarm of fixed-wing unmanned aerial vehicles validate the theoretical findings.
|
| |
| 16:15-16:30, Paper FrC08.4 | Add to My Program |
| Target Tracking with Field-Of-View Constraints Using Sentry-Type Robots with Limited Observations |
|
| O'Brien, Richard | United States Naval Academy |
| Dawkins, Jeremy | United States Naval Academy |
| Galloway, Kevin | United States Naval Academy |
| Kutzer, Michael | United States Naval Academy |
Keywords: Sensor fusion, Multivehicle systems, Kalman filtering
Abstract: A multi-robot, target tracking algorithm is proposed that improves tracking accuracy by imposing field-of-view (FOV) constraints based on each robot’s sensor characteristics. The algorithm is independent of the underlying tracking scheme and is evaluated using experimental data collected from sentry-type robots whose trajectories are independent of positive detections of the target. These data are processed using an unscented Kalman filter (UKF) with and without the FOV constraint. The UKF with FOV constraint delivers significantly less tracking error and recovers much more quickly than the standard UKF in periods where the target is unobserved.
|
| |
| 16:30-16:45, Paper FrC08.5 | Add to My Program |
| Combining Cooperative Re-Routing with Intersection Coordination for Connected and Automated Vehicles in Urban Networks |
|
| Typaldos, Panagiotis | Cornell University |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Cooperative control, Traffic control, Simulation
Abstract: In this paper, we present a hierarchical framework integrating upper-level routing with low-level optimal trajectory planning for connected and automated vehicles (CAVs) in urban networks. The upper-level controller distributes traffic flows using a dynamic re-routing algorithm that leverages real-time density information and fundamental diagrams to predict when edges reach critical density, proactively adjusting routing weights to prevent congestion. The low-level controller coordinates CAVs at signal-free intersections, generating optimal, fuel-efficient trajectories while ensuring safety through constraint satisfaction. We formulate this as an optimal control problem with an analytical solution. SUMO micro-simulation experiments on a realistic network demonstrate that our framework significantly outperforms baseline static routing, achieving 55.8% reduction in delays and 12.1% reduction in travel times, showing substantial improvements in urban traffic efficiency
|
| |
| 16:45-17:00, Paper FrC08.6 | Add to My Program |
| A Coordinated Routing Approach for Enhancing Bus Timeliness and Travel Efficiency in Mixed-Traffic Environment |
|
| Liang, Tanlu | Cornell University |
| Bai, Ting | Cornell University |
| Malikopoulos, Andreas A. | Cornell University |
Keywords: Transportation networks, Autonomous systems, Cooperative control
Abstract: This paper proposes a coordinated routing approach that investigates the use of connected and automated vehicles (CAVs) in dedicated bus lanes. The aim is to improve bus schedule adherence while enhancing the travel efficiency of CAVs during the transitional phase of mixed traffic environments. Our approach utilizes real-time traffic data to dynamically reroute CAVs in anticipation of congestion. By continuously monitoring traffic conditions on dedicated lanes and tracking the real-time positions of buses, the system adjusts CAV routes in advance to avoid potential interference with operating buses. This cooperation reduces CAV travel times and minimizes delays that impact transit services. The proposed strategy is validated using microscopic traffic simulations in SUMO. The results demonstrate significant improvements in both transit on-time performance and CAV travel efficiency across a range of traffic conditions.
|
| |
| FrC09 Regular Session, Grand Salon 12 |
Add to My Program |
| Energy Systems |
|
| |
| Chair: Mukherjee, Sayak | Pacific Northwest National Laboratory |
| Co-Chair: Narimani, Mohammad Rasoul | California State University Northridge |
| |
| 15:30-15:45, Paper FrC09.1 | Add to My Program |
| Control Affine Hybrid Power Plant Subsystem Modeling for Supervisory Control Design |
|
| Ampleman, Stephen | Johns Hopkins University |
| Sharma, Himanshu | Pacific Northwest National Laboratory |
| Mukherjee, Sayak | Pacific Northwest National Laboratory |
| Glavaski, Sonja | Pacific Northwest National Lab |
Keywords: Energy systems, Constrained control, Supervisory control
Abstract: Hybrid power plants (HPPs) combine multiple generation sources and energy storage to support generation shortfalls and grid-service demands. This paper introduces a modeling and control design framework for wind-solar-battery HPPs. Specifically, this work adapts established modeling paradigms for wind farms, solar plants and battery models into a supervisory-control-oriented form. In the case of wind and battery models, generator torque and cell current control laws are developed using nonlinear control and control barrier function techniques to track commands issued by a supervisory controller while maintaining safe and stable operation. The framework is demonstrated on a utility demand-tracking task under time-varying wind and irradiance using a rule-based supervisory control law.
|
| |
| 15:45-16:00, Paper FrC09.2 | Add to My Program |
| Certifying the Nonexistence of Feasible Paths between Power System Operating Points |
|
| Narimani, Mohammad Rasoul | California State University Northridge |
| Davis, Katherine | Texas A&M |
| Molzahn, Daniel | Georgia Institute of Technology |
Keywords: Energy systems, Optimization algorithms, Power systems
Abstract: By providing the optimal operating point that satisfies both the power flow equations and engineering limits, the optimal power flow (OPF) problem is central to the operation of electric power systems. While extensive research efforts have focused on reliably computing high-quality OPF solutions, assessing the feasibility of transitioning between operating points remains challenging since the feasible spaces of OPF problems may consist of multiple disconnected components. It is not possible to transition between operating points in different disconnected components without violating OPF constraints. To identify such situations, this paper introduces an algorithm for certifying the infeasibility of transitioning between two operating points within an OPF feasible space. As an indication of potential disconnectedness, the algorithm first seeks an infeasible point on the line connecting a pair of feasible points. The algorithm then certifies disconnectedness by using convex relaxation and bound tightening techniques to show that all points on the plane that is normal to this line are infeasible. Using this algorithm, we provide the first certifications of disconnected feasible spaces for a variety of OPF test cases.
|
| |
| 16:00-16:15, Paper FrC09.3 | Add to My Program |
| Control Co-Design for Maximizing the Stochastic Endurance of a Hybrid Electric Tiltrotor Drone |
|
| Haddad, Noushin | University of Maryland, College Park |
| Fathy, Hosam K. | University of Maryland |
Keywords: Energy systems, Power systems, Optimization
Abstract: This paper examines the problem of maximizing the endurance of an unmanned aerial vehicle. The paper focuses on a tiltrotor vehicle configuration. This configuration has the advantage of combining vertical takeoff/landing with more efficient horizontal flight. However, this comes at the disadvantage of a substantial power demand discrepancy between takeoff/landing versus cruise. This discrepancy motivates powertrain hybridization. Specifically, the paper examines a parallel hybrid powertrain that combines the energy and power density advantages of internal combustion and battery-electric propulsion, respectively, using a mix of lithium-ion and lithium-sulfur batteries. We define stochastic endurance in terms of the ability to execute a Monte Carlo-simulated family of flight missions without power shortages. We then pose a control co design problem, where the objective is endurance maximization and the optimization variables are the sizing of the powertrain components and the parameterization of an onboard power management policy. This policy uses Gaussian kernels to compute the power split ratio between the powertrain subsystems based on total power demand and the “states of charge” of the onboard energy stores. A simulation study shows that this approach leads to attractive endurance levels and supports mission planning by quantifying the impact of initial fuel and payload allocations on endurance.
|
| |
| 16:15-16:30, Paper FrC09.4 | Add to My Program |
| Small HVAC Control Demonstrations in Larger Buildings Often Overestimate Savings |
|
| J. Khabbazi, Arash | Purdue University |
| J. Kircher, Kevin | Purdue University |
Keywords: Building and facility automation, Energy systems, Linear systems
Abstract: How much energy, money, and emissions can advanced control of heating and cooling equipment save in real buildings? To address this question, researchers sometimes control a small number of thermal zones within a larger multi-zone building, then report savings for the controlled zones only. That approach can overestimate savings by neglecting heat transfer between controlled zones and adjacent zones. This paper mathematically characterizes the overestimation error when the dynamics are linear and the objectives are linear in the thermal load, as usually holds when optimizing energy efficiency, energy costs, or emissions. Overestimation errors can be large even in seemingly innocuous situations. For example, when controlling only interior zones that have no direct thermal contact with the outdoors, all perceived savings are fictitious. This paper provides an alternative estimation method based on the controlled and adjacent zones' temperature measurements. The new method does not require estimating how much energy the building would have used under baseline operations, so it removes the additional measurement and verification challenge of accurate baseline estimation.
|
| |
| 16:30-16:45, Paper FrC09.5 | Add to My Program |
| Risk-Sensitive Model Predictive Control for Grid Services through Distributed Energy Resources: A Conditional Value-At-Risk Approach |
|
| Khalil, Ahmed | The University of Texas at Austin |
| Sharma, Himanshu | Pacific Northwest National Laboratory |
| Wang, Wei | Pacific Northwest National Laboratory |
| Ramachandran, Thiagarajan | Pacific Northwest National Laboratory |
Keywords: Energy systems, Uncertain systems, Optimal control
Abstract: Large aggregations of distributed energy resources (DERs) are increasingly used to provide grid services, such as ramping and regulation, yet their integration is challenged by volatile net demand and stringent real-time safety requirements. This work proposes a risk-sensitive closed-loop model predictive control (MPC) framework to optimally allocate heterogeneous DERs while mitigating the impact of rare but severe demand excursions, such as sudden load spikes of large electric loads. The MPC approach leverages the fact that short-term load forecasts are more accurate, enabling each step in the rolling optimization sequence to use refined predictions while ensuring constraint satisfaction. To achieve this, the closed-loop model-predictive controller is parametrized using the System-Level Synthesis (SLS) framework, and risk aversion is achieved by embedding a Conditional Value-at-Risk (CVaR) constraint that explicitly limits the expected impact of extreme forecast errors. Numerical simulations on demand-forecast datasets demonstrate the proposed approach's ability to reduce the power required from bulk generation in real time and to account for forecast demand uncertainty.
|
| |
| 16:45-17:00, Paper FrC09.6 | Add to My Program |
| Coordinated Control of Active and Passive Elements for Energy-Efficient Building Temperature Regulation |
|
| Ishtiaq, Fardin | Rensselaer Polytechnic Institute |
| Rempel, Alexandra | University of Oregon |
| Kar, Koushik | Rensselaer Polytechnic Institute |
| Mishra, Sandipan | Rensselaer Polytechnic Institute |
Keywords: Building and facility automation, Energy systems, Mechatronics
Abstract: Space conditioning accounts for nearly half of building energy use and associated CO_2 emissions. Conventional indoor environmental control strategies rely on heating, ventilation, and air conditioning systems to regulate indoor temperature reliably, but they often do not exploit passive strategies such as shading and natural ventilation, which limits potential energy savings. Purely passive mechanisms, on the other hand, cannot maintain comfort across all climates and seasons. This paper develops a coordinated strategy for the joint control of active heating and cooling systems and operable passive elements. The proposed strategy combines decentralized PI control of active heating and cooling, with reinforcement learning (RL) agents that actuate operable passive elements such as window openings. The control objective is to minimize energy consumption while keeping zone temperatures within a prescribed band. Because the active and passive inputs are coupled through thermal dynamics, analytical conditions for Lyapunov stability and ultimate boundedness are derived for this joint control strategy. The portability of the proposed approach was demonstrated by training the RL agent on a single-zone building and deploying it on a multi-zone building without retraining. EnergyPlus simulations of a multi-zone dwelling in three cities and two seasons show reductions of 40--74% in degree-hour discomfort and 54--86% in active energy loads relative to a baseline PI controller with rule-based logic for passive elements. The coordinated controller kept the indoor temperature within 0.5^circC below and 1.4^circC above the comfort band. These results highlight the potential of coordinated active and passive strategies to deliver substantial energy savings while maintaining occupant comfort.
|
| |
| FrC10 Regular Session, Grand Salon 15 |
Add to My Program |
| Process Control |
|
| |
| Chair: Pourkargar, Davood | Kansas State University |
| Co-Chair: Mitrai, Ilias | The University of Texas at Austin |
| |
| 15:30-15:45, Paper FrC10.1 | Add to My Program |
| A Distributed Machine Learning Approach for Cyberattack Detection in Integrated Process Systems |
|
| Bagheri, Amirsalar | Kansas State University |
| Ebrahimi, AmirMohammad | Kansas State University |
| Pourkargar, Davood | Kansas State University |
Keywords: Chemical process control, Machine learning, Sensor networks
Abstract: Integrated process systems enhance operational efficiency but increase the cyberattack surface due to dense interconnections and reliance on networked sensing. This study develops a community-aware distributed transformer framework that jointly performs attack detection and localization of compromised sensors. A spectral community detection algorithm is used to decompose the graph representation of the process into strongly interacting communities. Within each community, a transformer learns the dynamics of local measured outputs and identifies sensor-level attack likelihoods and a community-level detection flag. System-level decisions are then obtained by aggregating the outputs across communities. The framework is evaluated on an integrated benzene alkylation process, with training data generated from closed-loop simulations under model predictive control and moving horizon estimation across multiple adversarial scenarios, including single- and multi-sensor attacks. Compared with centralized end-to-end and two-tier transformer baselines, the distributed approach attains higher accuracy, F1 scores, and Jaccard indices, demonstrating improved robustness and more precise localization of compromised sensors.
|
| |
| 15:45-16:00, Paper FrC10.2 | Add to My Program |
| Discovering Interpretable Piecewise Nonlinear Model Predictive Control Laws Via Symbolic Decision Trees |
|
| Mitrai, Ilias | The University of Texas at Austin |
Keywords: Chemical process control, Process Control, Machine learning
Abstract: In this paper, we propose symbolic decision trees as surrogate models for approximating model predictive control laws. The proposed approach learns simultaneously the partition of the input domain (splitting logic) as well as local nonlinear expressions for predicting the control action leading to interpretable piecewise nonlinear control laws. The local nonlinear expressions are determined by the learning problem and are modeled using a set of basis functions. The learning task is posed as a mixed integer optimization problem, which is solved to global optimality with state-of-the-art global optimization solvers. We apply the proposed approach to a case study regarding the control of an isothermal reactor. The results show that the proposed approach can learn the control law accurately, leading to closed-loop performance comparable to that of a standard model predictive controller. Finally, comparison with existing interpretable models shows that the symbolic trees achieve both lower prediction error and superior closed-loop performance.
|
| |
| 16:00-16:15, Paper FrC10.3 | Add to My Program |
| Approximate Dynamic Optimization Via Deep Neural Operators |
|
| Nassaji, Amin | University of Minnesota |
| Mitrai, Ilias | The University of Texas at Austin |
| Daoutidis, Prodromos | Univ. of Minnesota |
Keywords: Chemical process control, Process Control, Machine learning
Abstract: This paper addresses the solution of nonlinear dynamic optimization problems that compute optimal manipu- lated input profiles to enforce desired output profiles. Such tra- jectory optimization problems commonly arise in chemical pro- cess applications, for example, batch processes where optimal temperature or feeding profiles (in case of fed-batch processes) are calculated to enforce time-varying product quality profiles, tightly controlling the reaction rate or rate of heat generation. We propose deep neural operators that approximate function to function mappings as surrogates for the solution of such dynamic optimization problems. We specifically employ deep operator networks (DeepONets) and Fourier-enhanced Deep- ONets in a batch polymerization reactor case study for which number-average and weight-average molecular weight profiles, together with a final conversion target, are enforced through an optimal temperature program. Our results show that the Fourier-enhanced DeepONet architecture performs very well in approximating the solution of the dynamic optimization problem for different instances, achieving a lower prediction error compared to the standard DeepONet architecture and standard feedforward neural networks.
|
| |
| 16:15-16:30, Paper FrC10.4 | Add to My Program |
| Cooling under Convexity: An Inventory Control Perspective on Industrial Refrigeration |
|
| Shah, Vade | University of California, Santa Barbara |
| John, Yohan | University of California, Santa Barbara |
| Freifeld, Ethan | University of California, Santa Barbara |
| Chen, Lily Yuxuan | University of California Santa Barbara |
| Marden, Jason R. | University of California, Santa Barbara |
Keywords: Process Control, Optimization, Optimal control
Abstract: Industrial refrigeration systems have tremendous energy needs, but optimizing their operation remains challenging due to the tension between minimizing energy costs and meeting strict cooling requirements. Load shifting--strategic overcooling in anticipation of future demands--offers substantial efficiency gains. This work seeks to rigorously quantify these potential savings through the derivation of optimal load shifting policies. Our first contribution establishes a novel connection between industrial refrigeration and inventory control problems with convex ordering costs, where the convexity arises from the fundamental relationship between energy consumption and cooling capacity. Leveraging this formulation, we derive three main theoretical results: (1) an optimal algorithm for deterministic demand scenarios, with proof that optimal trajectories are non-increasing which is a valuable structural insight for practical control; (2) performance bounds that quantify the value of load shifting as a function of cost convexity, demand variability, and temporal patterns; (3) a computationally tractable load shifting heuristic with provable near-optimal performance under uncertainty. Numerical simulations validate our theoretical findings, and a case study using real industrial refrigeration data demonstrates an opportunity for improved load shifting.
|
| |
| 16:30-16:45, Paper FrC10.5 | Add to My Program |
| Real-Time Classification of Industrial Alarm Floods Using Context-Enriched Embeddings |
|
| Mohan Rao, Harikrishna Rao | University of Alberta |
| Tamascelli, Nicola | ABB AG |
| Chen, Tongwen | University of Alberta |
Keywords: Chemical process control, Process Control, Pattern recognition and classification
Abstract: Alarm floods are critical operational challenges in process industries, overwhelming operators and increasing the risk of missed or delayed responses. This paper presents a context-enriched Word2Vec framework for real-time alarm flood classification, designed to capture temporal spacing, activation order, and alarm priority within an embedding space. By extending the generic Skip-gram model with enriched tokenization and priority-weighted training, the method addresses key challenges such as variable sequence lengths, overlapping alarms across categories, and imbalance between high- and low-priority events. The framework is evaluated on the Tennessee Eastman benchmark, where it achieved high classification accuracy, with early divergence of similarity trajectories enabling timely identification of fault categories. Compared to a generic Word2Vec baseline, the enriched embeddings consistently improved both early-stage and overall classification performance while maintaining computational efficiency suitable for online applications. These results highlight the potential of computationally efficient natural language processing (NLP)-inspired approaches to support operators with faster and more reliable decision-making during industrial alarm floods.
|
| |
| 16:45-17:00, Paper FrC10.6 | Add to My Program |
| Multi-Agent Reinforcement Learning Based Adaptive PI Controller for Multivariable Process Control Applications |
|
| Yadav, Sourabh | Indian Institute of Technology Kanpur |
| Detroja, Ketan P. | Indian Institute of Technology Hyderabad |
Keywords: PID control, Reinforcement learning, Decentralized control
Abstract: Designing controllers for industrial processes is a daunting task, particularly because most industrial processes are multi-input multi-output (MIMO) in nature. This introduces coupling effects and loop interactions, which are not encountered in the case of single-input single-output (SISO) processes. In this manuscript, a novel multi-agent reinforcement learning-based approach for designing adaptive PI controllers is proposed. The Smith predictor is incorporated to compensate for the deadtime in the diagonal elements of the process transfer function. The proposed method employs the Deep Deterministic Policy Gradient (DDPG) algorithm with Long-Short-Term Memory (LSTM) based actor-critic network architecture to design adaptive PI controllers for a MIMO process. The performance of the proposed controller is measured against some of the existing methods for designing classical PI controllers, using performance indices such as Integral Absolute Error (IAE) and Integral Square Error (ISE). The proposed controller demonstrates superior performance compared to classical state-of-the-art PI controllers. The proposed controller is performing well even in the presence of plant-model mismatches, delay uncertainties, and disturbances, demonstrating robust performance.
|
| |
| FrC11 Regular Session, Grand Salon 16 |
Add to My Program |
| Robotics III |
|
| |
| Chair: Chou, Glen | Georgia Institute of Technology |
| Co-Chair: Chhabra, Robin | Toronto Metropolitan University |
| |
| 15:30-15:45, Paper FrC11.1 | Add to My Program |
| Shared Object Manipulation with a Team of Collaborative Quadrupeds |
|
| Wang, Shengzhi | The Chinese University of Hong Kong |
| Dehio, Niels | Technische Universität Braunschweig |
| Zeng, Xuanqi | Chinses University of Hong Kong |
| Yang, Xian | The Chinese University of Hong Kong |
| Zhang, Lingwei | CUHK |
| Liu, Yun Hui | The Chinese Univ. of Hong Kong |
| Au, Kwok Wai Samuel | CUHK |
Keywords: Robotics, Control applications, Cooperative control
Abstract: Utilizing teams of multiple robots is advantageous for handling bulky objects. Many related works focus on multi-manipulator systems, which are limited by workspace constraints. In this paper, we extend a classical hybrid motion-force controller to a team of legged manipulator systems, enabling collaborative loco-manipulation of rigid objects with a force-closed grasp. Our novel approach allows the robots to flexibly coordinate their movements, achieving efficient and stable object co-manipulation and transport, validated through extensive simulations and real-world experiments.
|
| |
| 15:45-16:00, Paper FrC11.2 | Add to My Program |
| Learning Constraints from Stochastic Partially-Observed Closed-Loop Demonstrations |
|
| Chiu, Chih-Yuan | Georgia Institute of Technology |
| Zhang, Zhouyu | Georgia Institute of Technology |
| Chou, Glen | Georgia Institute of Technology |
Keywords: Robotics, Optimal control, Autonomous systems
Abstract: We present a method for learning unknown parametric constraints from locally-optimal input-output trajectory data. We assume the data is generated by rollouts of stochastic nonlinear dynamics, under a single state or output feedback law and initial condition but distinct noise realizations, to robustly satisfy underlying constraints despite worst-case noise outcomes. We encode the Karush-Kuhn-Tucker (KKT) conditions of this robust optimal feedback control problem within a feasibility problem to recover constraints consistent with the local optimality of the demonstrations. We prove that our constraint learning method (i) accurately recovers the demonstrator’s policy, and (ii) conservatively estimates the set of policies that ensure constraint satisfaction despite worst-case noise realizations. Moreover, we perform sensitivity analysis, proving that when demonstrations are corrupted by transmission error, the inaccuracy in the learned feedback law scales linearly in the error magnitude. Empirically, our method accurately recovers unknown constraints from simulated noisy, closed-loop demonstrations generated using dynamics, both linear and nonlinear, (e.g., unicycle and quadrotor) and a range of feedback mechanisms.
|
| |
| 16:00-16:15, Paper FrC11.3 | Add to My Program |
| Safe Obstacle-Free Guidance of Space Manipulators in Debris Removal Missions Via Deep Reinforcement Learning |
|
| Lam, Vincent | Toronto Metropolitan University |
| Chhabra, Robin | Toronto Metropolitan University |
Keywords: Robotics, Reinforcement learning, Autonomous robots
Abstract: The objective of this study is to develop a model-free taskspace trajectory planner for space manipulators that operates in real time, using a Twin Delayed Deep Deterministic Policy Gradient (TD3) agent to enable safe and reliable debris capture. A local control strategy with singularity avoidance and manipulability enhancement is employed to ensure stable execution. The manipulator must simultaneously track a capture point on a non-cooperative target, avoid self-collisions, and prevent unintended contact with the target. To address these challenges, we propose a curriculum-based multi-critic network where one critic emphasizes accurate tracking and the other enforces collision avoidance. A prioritized experience replay buffer is also used to accelerate convergence and improve policy robustness. The framework is evaluated on a simulated seven-degree-of-freedom KUKA LBR iiwa mounted on a free-floating base in Matlab/Simulink, demonstrating safe and adaptive trajectory generation for debris removal missions.
|
| |
| 16:15-16:30, Paper FrC11.4 | Add to My Program |
| Torques-To-Pixels: Verification of Visual Controllers |
|
| Estornell, Alexander | Northeastern University |
| Jung, Leonard | Northeastern University |
| Everett, Michael | Northeastern University |
Keywords: Vision-based control, Robust control
Abstract: Perception-based neural network controllers are increasingly used in autonomous systems that rely on visual inputs to operate in the real world. Ensuring the safety of such systems under uncertainty is challenging. Existing verification techniques typically focus on Lp-bounded perturbations in the pixel space, which fails to capture the low-dimensional structure of many real-world effects. In this work, we introduce a novel verification framework for perception-based controllers that can generate outer-approximations of reachable sets through explicitly modeling uncertain observations with geometric perturbations. Our approach constructs a boundable mapping from states to images, enabling the use of state-based verification tools while accounting for uncertainty in perception. We provide theoretical guarantees on the soundness of our method and demonstrate its effectiveness across benchmark control environments. This work provides a principled framework for certifying the safety of perception-driven control systems under realistic visual perturbations.
|
| |
| 16:30-16:45, Paper FrC11.5 | Add to My Program |
| Gaussian Mixture-Based Inverse Perception Contract for Uncertainty-Aware Robot Navigation |
|
| Du, Bingyao | Columbia University |
| Kim, Joonkyung | Texas A&M University |
| Lyu, Yiwei | Texas A&M University |
Keywords: Robotics, Vision-based control
Abstract: Reliable navigation in cluttered environments requires perception outputs that are not only accurate but also equipped with uncertainty sets suitable for safe control. An inverse perception contract (IPC) provides such a connection by mapping perceptual estimates to sets that contain the ground truth with high confidence. Existing IPC formulations, however, instantiate uncertainty as a single ellipsoidal set and rely on deterministic trust scores to guide robot motion. Such a representation cannot capture the multi-modal and irregular structure of fine-grained perception errors, often resulting in over-conservative sets and degraded navigation performance. In this work, we introduce Gaussian Mixture-based Inverse Perception Contract (GM-IPC), which extends IPC to represent uncertainty with unions of ellipsoidal confidence sets derived from Gaussian mixture models. This design moves beyond deterministic single-set abstractions, enabling fine-grained, multi-modal, and non-convex error structures to be captured with formal guarantees. A learning framework is presented that trains GM-IPC to account for probabilistic inclusion, distribution matching, and empty-space penalties, ensuring both validity and compactness of the predicted sets. We further show that the resulting uncertainty characterizations can be leveraged in downstream planning frameworks for real-time safe navigation, enabling less conservative and more adaptive robot motion while preserving safety in a probabilistic manner.
|
| |
| 16:45-17:00, Paper FrC11.6 | Add to My Program |
| A Simulation Evaluation Suite for Robust Adaptive Quadcopter Control |
|
| Zhang, Dingqi | University of California, Berkeley |
| Tao, Ran | University of Illinois at Urbana-Champaign |
| Cheng, Sheng | University of Illinois Urbana-Champaign |
| Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
| Mueller, Mark W. | University of California, Berkeley |
Keywords: Robotics, Robust adaptive control, Simulation
Abstract: Robust adaptive control methods are essential for maintaining quadcopter performance under external disturbances and model uncertainties. However, fragmented evaluations across tasks, simulators, and implementations hinder systematic comparison of these methods. This paper introduces an easy-to-deploy, modular simulation testbed for quadcopter control, built on textit{RotorPy}, that enables evaluation under a wide range of disturbances such as wind, payload shifts, rotor faults, and control latency. The framework includes a library of representative adaptive and non-adaptive controllers and provides task-relevant metrics to assess tracking accuracy and robustness. The unified modular environment enables reproducible evaluation across control methods and eliminates redundant reimplementation of components such as disturbance models, trajectory generators, and analysis tools. We illustrate the testbed’s versatility through examples spanning multiple disturbance scenarios and trajectory types, including automated stress testing, to demonstrate its utility for systematic analysis. Code is available at url{https://github.com/Dz298/AdaptiveQuadBench}.
|
| |
| FrC12 Regular Session, Grand Salon 18 |
Add to My Program |
| Formal Verification and Synthesis |
|
| |
| Chair: Mallada, Enrique | Johns Hopkins University |
| Co-Chair: Aksaray, Derya | Northeastern University |
| |
| 15:30-15:45, Paper FrC12.1 | Add to My Program |
| Learning Linear Temporal Specifications from Demonstrations with Uncertainty |
|
| Fahim, Parastou | Penn State University |
| Lagoa, Constantino M. | Pennsylvania State Univ |
| Meira-Goes, Romulo | Pennsylvania State University |
Keywords: Formal verification/synthesis, Automata, Learning
Abstract: Learning temporal logic specifications from system demonstrations is essential for tasks such as formal verification and controller synthesis, especially in safety-critical domains. Existing approaches typically assume demonstrations are correct or only affected by misclassification errors. In practice, however, system traces are often uncertain or incomplete due to sensor faults, measurement errors, or data loss. We present a framework for learning minimal Linear Temporal Logic (LTL) formulas from demonstrations with uncertainty. Our approach models uncertainty via Hamming distance to generate possible estimates around each observed trace, which are grouped with constraints requiring that at least one trace per group is consistent with the learned formula. Our problem is then reduced to an equivalent Pseudo-Boolean Optimization. We evaluate our method against state-of-the-art LTL learning approaches and show that it recovers specifications that more closely align with ground-truth formulas under uncertainty.
|
| |
| 15:45-16:00, Paper FrC12.2 | Add to My Program |
| Motion Planning under Temporal Logic Specifications in Semantically Unknown Environments |
|
| Taheri, Azizollah | Northeastern University |
| Aksaray, Derya | Northeastern University |
Keywords: Formal verification/synthesis, Autonomous robots, Optimization
Abstract: This paper addresses a motion planning problem to achieve spatio-temporal-logical tasks, expressed by syntactically co-safe linear temporal logic specifications (scLTLnext), in uncertain environments. Here, the uncertainty is modeled as some probabilistic knowledge on the semantic labels of the environment. For example, the task is "first go to region 1, then go to region 2"; however, the exact locations of regions 1 and 2 are not known a priori, instead a probabilistic belief is available. We propose a novel automata-theoretic approach, where a special product automaton is constructed to capture the uncertainty related to semantic labels, and a reward function is designed for each edge of this product automaton. The proposed algorithm utilizes value iteration for online replanning. We show some theoretical results and present some simulations/experiments to demonstrate the efficacy of the proposed approach.
|
| |
| 16:00-16:15, Paper FrC12.3 | Add to My Program |
| Temporal-Logic-Aware Frontier-Based Exploration |
|
| Taheri, Azizollah | Northeastern University |
| Aksaray, Derya | Northeastern University |
Keywords: Formal verification/synthesis, Autonomous robots
Abstract: This paper addresses the problem of temporal logic motion planning for an autonomous robot operating in an unknown environment. The objective is to enable the robot to satisfy a syntactically co-safe Linear Temporal Logic (scLTL) specification when the exact locations of the desired labels are not known a priori. We introduce a new type of automaton state, referred to as commit states. These states capture intermediate task progress resulting from actions whose consequences are irreversible. In other words, certain future paths to satisfaction become not feasible after taking those actions that lead to the commit states. By leveraging commit states, we propose a sound and complete frontier-based exploration algorithm that strategically guides the robot to make progress toward the task while preserving all possible ways of satisfying it. The efficacy of the proposed method is validated through simulations.
|
| |
| 16:15-16:30, Paper FrC12.4 | Add to My Program |
| Safety-Critical Control Via Recurrent Tracking Functions |
|
| Liu, Jixian | Johns Hopkins University |
| Mallada, Enrique | Johns Hopkins University |
Keywords: Formal verification/synthesis, Closed-loop identification, Robotics
Abstract: This paper addresses the challenge of synthesizing safety-critical controllers for high-order nonlinear systems, where constructing valid Control Barrier Functions (CBFs) remains computationally intractable. Leveraging layered control, we design CBFs in reduced-order models (RoMs) while regulating full-order models' (FoMs) dynamics at the same time. Traditional Lyapunov tracking functions are required to decrease monotonically, and systematic synthesis methods for such functions exist only for fully-actuated systems. To overcome this limitation, we introduce Recurrent Tracking Functions (RTFs), which replace the monotonic decay requirement with a weaker finite-time recurrence condition. This relaxation permits transient deviations of tracking errors while ensuring safety. By integrating CBFs for RoMs with RTFs, we construct recurrent CBFs (RCBFs) whose zero-superlevel set is control tau-recurrent, and guarantee safety for all initial states in such a set when RTFs are satisfied. We establish theoretical safety guarantees and validate the approach through numerical experiments, demonstrating RTFs' effectiveness and the safety of FoMs.
|
| |
| 16:30-16:45, Paper FrC12.5 | Add to My Program |
| Safe and Optimal Learning from Preferences Via Weighted Temporal Logic with Applications in Robotics and Formula 1 |
|
| Karagulle, Ruya | Univ. of Michigan |
| Vasile, Cristian Ioan | Lehigh University |
| Ozay, Necmiye | Univ. of Michigan |
Keywords: Formal verification/synthesis, Human-in-the-loop control, Machine learning
Abstract: Autonomous systems increasingly rely on human feedback to align their behavior, expressed as pairwise comparisons, rankings, or demonstrations. While existing methods can adapt behaviors, they often fail to guarantee safety in safety-critical domains. We propose a safety-guaranteed, optimal, and efficient approach for solving the learning problem from preferences, rankings, or demonstrations using Weighted Signal Temporal Logic (WSTL). WSTL learning problems, when implemented naively, lead to multi-linear constraints in the weights to be learned. By introducing structural pruning and log-transform procedures, we reduce the problem size and recast it as a Mixed-Integer Linear Program while preserving safety guarantees. Experiments on robotic navigation and real-world Formula 1 data demonstrate that the method effectively captures nuanced preferences and models complex task objectives.
|
| |
| 16:45-17:00, Paper FrC12.6 | Add to My Program |
| Shielded Reinforcement Learning under Dynamic Temporal Logic Constraints |
|
| Yuksel, Sadik Bera | Northeastern University |
| Buyukkocak, Ali Tevfik | University of Minnesota |
| Aksaray, Derya | Northeastern University |
Keywords: Reinforcement learning, Formal verification/synthesis, Autonomous robots
Abstract: Reinforcement Learning (RL) has shown promise in various robotics applications, yet its deployment on real systems is still limited due to safety and operational constraints. The safe RL field has gained considerable attention in recent years, which focuses on imposing safety constraints throughout the learning process. However, real systems often require more complex constraints than just safety, such as periodic recharging or time-bounded visits to specific regions. Imposing such spatio-temporal tasks during learning still remains a challenge. Signal Temporal Logic (STL) is a formal language for specifying temporal properties of real-valued signals and provides a way to express such complex tasks. In this paper, we propose a framework that leverages sequential control barrier functions and model-free RL to ensure that the given STL tasks are satisfied throughout the learning process. Our method extends beyond traditional safety constraints by enforcing rich STL specifications, which can involve visits to dynamic targets with unknown trajectories. We also demonstrate the effectiveness of our framework through various simulations.
|
| |
| FrC13 Regular Session, Grand Salon 19 |
Add to My Program |
| Agent Based Systems II |
|
| |
| Chair: Lin, Zongli | University of Virginia |
| Co-Chair: Selmic, Rastko | Concordia University |
| |
| 15:30-15:45, Paper FrC13.1 | Add to My Program |
| Resilient Leader-Following Consensus Control Using Set-Membership Fuzzy Filtering |
|
| Rahimifard, Mahshid | Concordia University |
| Selmic, Rastko | Louisiana Tech University |
Keywords: Agents-based systems, Cooperative control, Fault tolerant systems
Abstract: This paper presents a resilient leader–following consensus control method for a class of nonlinear multi-agent systems under cyberattacks. A Takagi–Sugeno (T–S) fuzzy model is employed to approximate the system nonlinearities. A resilient leader-following consensus control law is developed to guarantee that the system states remain within a bounded ellipsoidal set of the leader state despite the simultaneous presence of Unknown But Bounded (UBB) noise, Denial of Service (DoS) attacks on control signals, and deception attacks on sensor measurements. Furthermore, a two-step resilient set-membership estimation method is proposed, together with a fuzzy set-membership filtering approach to provide reliable state estimates in the presence of UBB noise and cyberattacks. Numerical simulations demonstrate the effectiveness of the proposed method.
|
| |
| 15:45-16:00, Paper FrC13.2 | Add to My Program |
| Distributed Prescribed-Time Formation Control with Multiple Multi-Agent Systems for Smart Transportation Applications |
|
| Merzi, Mehmet Alp | University of Calabria |
| D'Alfonso, Luigi | Universitŕ Della Calabria |
| Fedele, Giuseppe | Universitŕ Della Calabria |
Keywords: Agents-based systems, Distributed control, Cooperative control
Abstract: In this paper, we present a novel approach for distributed prescribed-time formation control that addresses cooperative object transportation applications, where agents must converge to fixed positions around an object within a prescribed time, before actual transportation begins. The proposed method combines a diffeomorphism-based transformation with a swarm-inspired multi-agent framework to achieve four complementary objectives: (1) driving agents from their initial positions to fixed target locations, (2) guaranteeing convergence within a prescribed time, (3) ensuring collision avoidance throughout the process, and (4) achieving all these objectives with a simple formulation. At the core of the approach lies a time-varying affine transformation that maps agents between virtual and real reference frames, where a distributed control law based on attraction-repulsion dynamics is utilized. The virtual control law employs a time-varying gain that diverges as the time approaches the required prescribed time, enabling finite-time convergence through time normalization techniques. The method is first developed for scalar multi-agent systems and then extended to multidimensional cases by using additional diffeomorphism-based transformations.
|
| |
| 16:00-16:15, Paper FrC13.3 | Add to My Program |
| Temporal Graph-Theoretic Stability Analysis of Time-Varying Opinion Dynamics |
|
| Abedinzadeh, Mohammadhossein | BINGHAMTON UNIVERSITY Department of Electrical and Computer Engineering |
| Akyol, Emrah | SUNY Binghamton |
Keywords: Agents-based systems, Large-scale systems, Network analysis and control
Abstract: The time-varying Friedkin--Johnsen (TVFJ) model provides a principled foundation for studying realistic, time-varying social networks, as it captures both persistent individual biases and adaptive influences. Building on this model, we develop a graph-theoretic framework for stability analysis in opinion dynamics. We introduce emph{defected} and emph{weakly defected} temporal graphs (DTG/WDTG) as topological certificates that translate temporal connectivity and stubborn influence into contraction bounds on the state-transition matrix. Using these notions, we prove (i) emph{asymptotic stability} of TVFJ when DTGs recur infinitely often, and (ii) emph{exponential stability} under semi-periodic defected networks with explicit growth and decay rates. For emph{periodically switching} networks (PSFJ), we further provide a (p)-LTI decomposition that yields closed-form equilibria and establishes the tight bound (|omega|le p). Collectively, these results unify algebraic stability tests with an interpretable, topology-driven analysis, offering scalable tools for reasoning about opinion formation in evolving networks.
|
| |
| 16:15-16:30, Paper FrC13.4 | Add to My Program |
| Leader-Follower Consensus of Linear Multi-Agent Systems with Intermittent Communication and Disturbances Via Observer-Based Control |
|
| Zarei, Fatemeh | Northeastern University |
| Saeidi, Amirahmad | Amirkabir University of Technology |
| Shafai, Bahram | Northeastern Univ |
Keywords: Distributed control, Robust control, Observers for Linear systems
Abstract: In this paper, we investigate the leader–follower consensus problem for linear multi-agent systems (MASs) subject to intermittent communication and bounded exogenous disturbances. To address this challenge, we propose a distributed output-feedback control protocol that incorporates a Proportional-Integral Fading Observer (PIFO). The observer is designed to simultaneously estimate and compensate for unknown disturbances acting on each follower agent. Building on local relative information and disturbance estimates, we develop a distributed consensus control strategy by using Lyapunov stability theory and establish sufficient conditions that guarantee the asymptotic convergence of both consensus and estimation errors. Finally, simulation results are presented to confirm the theoretical findings and to demonstrate the effectiveness of the proposed method in rejecting disturbance and achieving consensus.
|
| |
| 16:30-16:45, Paper FrC13.5 | Add to My Program |
| Privacy-Preserving Dynamic Average Consensus by Masking Reference Signals |
|
| Maithripala, Mihitha | University of Virginia |
| Lin, Zongli | University of Virginia |
Keywords: Distributed control, Agents-based systems
Abstract: In multi-agent systems, dynamic average consensus (DAC) is a decentralized estimation strategy in which a set of agents tracks the average of time-varying reference signals. Because DAC requires exchanging state information with neighbors, attackers may gain access to these states and infer private information. In this paper, we develop a privacy-preserving method that protects each agent’s reference signal from external eavesdroppers and honest-but-curious agents while achieving the same convergence accuracy and convergence rate as conventional DAC. Our approach masks the reference signals by having each agent draw a random real number for each neighbor, exchanges that number over an encrypted channel at the initialization, and computes a masking value to form a masked reference. Then the agents run the conventional DAC algorithm using the masked references. Convergence and privacy analyses show that the proposed algorithm matches the convergence properties of conventional DAC while preserving the privacy of the reference signals. Numerical simulations validate the effectiveness of the proposed privacy-preserving DAC algorithm.
|
| |
| 16:45-17:00, Paper FrC13.6 | Add to My Program |
| Communication Efficient Consensus Via Lower-Dimensional Data |
|
| Rai, Ayush | Purdue University |
| Mou, Shaoshuai | Purdue University |
Keywords: Agents-based systems, Autonomous systems, Linear systems
Abstract: This paper studies consensus in multi-agent systems under limited information exchange, where each agent has access only to a low-dimensional projection of its neighbors' states. We propose a novel projection-based consensus algorithm in which agents update their states using these partial observations, governed by observation matrices that capture local interaction structures. To analyze the resulting dynamics, we introduce the concept of an extended adjacency matrix, which integrates both the network connectivity and the projection information. We provide a theoretical analysis showing that, for strongly connected directed networks and appropriately designed projections, the agents converge exponentially to a consensus within the intersection subspace of the communicated information. Numerical simulations illustrate and validate the theoretical findings, highlighting the effectiveness of the proposed method in achieving agreement under partial-state information.
|
| |
| FrC14 Regular Session, Grand Salon 21 |
Add to My Program |
| Machine Learning IV |
|
| |
| Chair: Thein, May-Win | University of New Hampshire |
| Co-Chair: La, Hung | University of Nevada |
| |
| 15:30-15:45, Paper FrC14.1 | Add to My Program |
| Uniting Reinforcement Learning and Model Predictive Control in Feedback Control of Nonlinear Processes |
|
| Cui, Xiaodong | University of California, Los Angeles |
| Khodaverdian, Arthur | University of California, Los Angeles |
| Christofides, Panagiotis D. | Univ. of California at Los Angeles |
Keywords: Reinforcement learning, Lyapunov methods, Stability of nonlinear systems
Abstract: This work explores the implementation of a reinforcement learning (RL) based approach to replace model predictive control (MPC) in cases where practical implementations of MPC are infeasible due to excessive computation times. Specifically, with the use of externally enforced stability guarantees, an RL-based controller that is trained to optimize the same cost function as MPC can serve as a potentially more appealing real-time option as opposed to using the same MPC with a shorter horizon. A benchmark nonlinear chemical process model is used to demonstrate the feasibility of this RL-based framework that simultaneously guarantees stability and enables improvements in computational efficiency and potential control quality of the closed-loop system.
|
| |
| 15:45-16:00, Paper FrC14.2 | Add to My Program |
| Learning Robust Regions of Attraction Using Rollout-Enhanced Physics-Informed Neural Networks with Policy Iteration |
|
| Wang, Junkai | Georgia Institute of Technology |
| Zhao, Yuxuan | Hong Kong University of Science and Technology |
| Zhou, Mi | Georgia Institute of Technology |
| Zhang, Fumin | Hong Kong University of Science and Technology |
Keywords: Machine learning, Robust control, Lyapunov methods
Abstract: The region of attraction is a key metric of the robustness of systems. This paper addresses the numerical solution of the generalized Zubov’s equation, which produces a special Lyapunov function characterizing the robust region of attraction for perturbed systems. To handle the highly nonlinear characteristic of the generalized Zubov's equation, we propose a physics-informed neural network framework that employs a policy iteration training scheme with rollout to approximate the viscosity solution. In addition to computing the optimal disturbance during the policy improvement process, we incorporate simulation-derived value estimates as anchor points to facilitate the training procedure to prevent singularities in both low- and high-dimensional systems. Numerical simulations validate the effectiveness of the proposed approach.
|
| |
| 16:00-16:15, Paper FrC14.3 | Add to My Program |
| LEAP-O: Learning to Predict Dynamic Obstacles for Safe Trajectory Planning |
|
| Nguyen, Binh | University of Central Florida |
| Nghiem, Truong X. | University of Central Florida |
| Nguyen, Linh | Federation University Australia |
| La, Hung | University of Nevada |
| Nguyen, Thang | Texas A&M University-Corpus Christi |
Keywords: Machine learning, Autonomous robots, Optimization
Abstract: Trajectory planning plays a crucial role in autonomous driving and navigation by enabling robots to generate safe paths while minimizing travel costs and avoiding collisions. This paper addresses the issue of predicting dynamic obstacles for safe trajectory planning when prior information is unavailable and detection range is limited. We propose a learning framework using Gaussian Processes (GP) for motion prediction and uncertainty estimation, further enhanced by Recurrent Neural Networks (RNN) for more accurate predictions. In addition, we develop a receding horizon planning method, formulated as a stochastic optimization problem, to ensure safe, collision-free paths with confidence probabilities. Together, these contributions provide a robust framework for adaptive and safe trajectory generation in dynamic environments. Simulations were performed to demonstrate the effectiveness of the proposed strategy, where our approach (combining GP and RNN) outperformed a baseline method that utilized only GP.
|
| |
| 16:15-16:30, Paper FrC14.4 | Add to My Program |
| Plug-And-Play Design of Machine Learning-Based Model Predictive Controllers for Nonlinear Systems |
|
| Ji, Yuxiao | National University of Singapore |
| Shi, Yao | National University of Singapore |
| Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Chemical process control, Predictive control for nonlinear systems
Abstract: This work presents a plug-and-play (PnP) machine learning-based model predictive control (ML-MPC) framework for interconnected nonlinear systems. Assuming that each subsystem is input-to-state stable (ISS), the stability of the entire system is ensured via the small-gain theorem. Sufficient conditions are derived to guarantee stability when ML models replace existing controllers. The effectiveness of the framework is demonstrated through a nonlinear chemical process network.
|
| |
| 16:30-16:45, Paper FrC14.5 | Add to My Program |
| PIML-RHC As a Control Method: A Physics-Informed Transformer Policy for Real-Time Eco-Driving |
|
| Dos Santos, Arnon Bruno Ventrilho | Universidade Federal Do Parana |
| Zanata Alves, Marco Antonio | Universidade Federal Do Parana |
| Soares de Oliveira, Luiz Eduardo | Universidade Federal Do Parana |
Keywords: Machine learning, Predictive control for nonlinear systems, Automotive control
Abstract: We present PIML-RHC, a physics-informed receding-horizon controller in which a Transformer policy is trained end-to-end through a differentiable vehicle model to jointly decide powertrain actions (torque/brake/gear) while enforcing an rpm envelope over a preview horizon. The policy distills a long-horizon, reference-conditioned objective into a single, non-iterative forward pass. Nonlinear Model Predictive Control (NMPC) with mixed discrete-continuous decisions (gears, torque, brake) is a principled baseline but often relies on iterative solvers, which exhibit iteration-dependent compute and potential variability in worst-case execution time (WCET); by contrast, PIML-RHC has bounded per-step compute due to its fixed-depth network. Across 50 tracks (60 steps) from production-like distributions, PIML-RHC achieves competitive or superior efficiency and tracking to NMPC variants while consistently improving drivability (torque smoothness). A hardware-in-the-loop validation on an embedded Raspberry Pi 4B shows median end-to-end loop time 72.7 ms, compatible with supervisory eco-driving cycles. We also discuss WCET, the absence of optimization-time guarantees, and safety mechanisms (e.g., CBF supervisors) required for deployment.
|
| |
| 16:45-17:00, Paper FrC14.6 | Add to My Program |
| Reinforcement Learning-Based Path Planning and Obstacle Avoidance Using PPO and B-Splines in Unknown Environments |
|
| Shokouhi, Shahab | University of New Hampshire |
| Oruc, Oguzhan | Istanbul Technical University |
| Thein, May-Win | University of New Hampshire |
Keywords: Reinforcement learning, Learning, Neural networks
Abstract: This paper introduces Smart B-Splines (SmartBSP), an advanced reinforcement learning framework for real-time path planning and obstacle avoidance in autonomous robotics navigating through complex environments. The proposed system integrates Proximal Policy Optimization (PPO) and Actor-Critic architecture to process limited LiDAR inputs and compute spatial decision-making probabilities. The robotic vehicle's perceptual field is discretized into a grid format and then analyzed to produce a spatial probability distribution. During the training process a nuanced cost function is minimized that accounts for path curvature, endpoint proximity, and obstacle avoidance. Simulations results in different scenarios validate the algorithm's resilience and adaptability across diverse operational scenarios. Subsequently, Real-time experiments are carried out to assess the efficacy of the proposed algorithm. The experimental results show that the proposed algorithm is successful in different scenarios while maintaining path smoothness.
|
| |
| FrC15 Regular Session, Grand Salon 22 |
Add to My Program |
| Stability of Nonlinear Systems II |
|
| |
| Chair: Lopez, Brett | University of California - Los Angeles |
| Co-Chair: Komaee, Arash | Southern Illinois University |
| |
| 15:30-15:45, Paper FrC15.1 | Add to My Program |
| Global Asymptotic Stability Certificates for Discrete-Time Lur'e Systems under Incremental-Like Restrictions |
|
| Montana, Gioia | Sapienza Universitŕ Di Roma |
| Cristofaro, Andrea | Sapienza University of Rome |
| Mattioni, Mattia | Universitŕ Degli Studi Di Roma La Sapienza |
| Valmorbida, Giorgio | L2S, CentraleSupelec |
Keywords: Stability of nonlinear systems, Lyapunov methods
Abstract: In this paper, we provide Lyapunov-based conditions for assessing the global asymptotic stability of nonlinear discrete-time Lur'e systems under incremental-like restrictions. The proposed results invoke dissipation arguments for the linear and nonlinear components of the dynamics and rely on a Lyapunov function with a few parameters and integral terms. The proof of the main result does not require any approximations of the integral terms. The impact of the result is highlighted through a comparison with the literature.
|
| |
| 15:45-16:00, Paper FrC15.2 | Add to My Program |
| Accumulating Magnetic Nanorods in a Stable Magnetic Trap by Exploiting Their Rotational Dynamics |
|
| Cantrell, Denae | University of California, Berkeley |
| Komaee, Arash | Southern Illinois University |
Keywords: Stability of nonlinear systems, MEMs and Nano systems, Biomedical
Abstract: Magnetic drug delivery aims to accumulate a swarm of magnetized therapeutic nanoparticles at sites of disease, but it is impossible by the very nature of magnetic fields to create a stable equilibrium point inside them in order to trap these particles. When spherical nanoparticles are exposed to a magnetic field, they immediately align with that field, causing them to move towards the source of the field, rather than a targeted site of disease. Magnetic nanorods, on the other hand, exhibit a brief rotational period during which they could be repelled by the magnetic field towards the targeted site. Previous experimental work has successfully exploited the transient rotational dynamics of magnetic nanorods to concentrate them in the vicinity of a targeted point. To develop this purely experimental work into an operational drug delivery system, a mathematical model is necessary to quantitatively describe the experiment. This paper presents such a model, verified by computer simulations and experiment.
|
| |
| 16:00-16:15, Paper FrC15.3 | Add to My Program |
| Robust Data-Driven Invariant Sets for Nonlinear Systems |
|
| Kiani, Sahand | Pennsylvania State University |
| Lagoa, Constantino M. | Pennsylvania State Univ |
Keywords: Stability of nonlinear systems, Optimization, Learning
Abstract: The synthesis of robust invariant sets for nonlinear systems has traditionally been hindered by the inherent non-convexity and a strict reliance on exact analytical models. This paper presents a purely data-driven framework to compute robust polytopic contractive sets for unknown nonlinear systems operating under persistent bounded process noise and state-input constraints. Rather than attempting to identify a single, potentially nominal model, we utilize a finite data set to construct a polytopic consistency set—a rigorous geometric boundary encapsulating all possible system dynamics compatible with the noisy measurements. The core contribution of this work extends an established sufficient condition for λ-contractiveness into the data-driven setting. Crucially, we prove that enforcing this condition strictly over the vertices of the consistency set guarantees robust invariance.
|
| |
| 16:15-16:30, Paper FrC15.4 | Add to My Program |
| Analysis of the Geometric Heat Flow Equation: Computing Geodesics in Real-Time with Convergence Guarantees |
|
| Gessow, Samuel | University of California, Los Angeles |
| Lopez, Brett | University of California - Los Angeles |
Keywords: Stability of nonlinear systems, Optimization, Numerical algorithms
Abstract: We present an analysis on the convergence properties of the so-called geometric heat flow equation for computing geodesics (extremal curves) on Riemannian manifolds. Computing geodesics numerically in real time has become an important capability across several fields, including control and motion planning. The geometric heat flow equation involves solving a parabolic partial differential equation whose solution is a geodesic. In practice, solving this PDE numerically can be done efficiently, and tends to be more numerically stable and exhibit a better rate of convergence compared to numerical optimization. We prove that the geometric heat flow equation is exponentially stable in L_2 if the curvature of the Riemannian manifold does not exceed a positive bound and that asymptotic convergence in L_2 is always guaranteed. We also present a pseudospectral method that leverages Chebyshev polynomials to accurately compute geodesics in only a few milliseconds for non-contrived manifolds. Our analysis was verified with our custom pseudospectral method by computing geodesics on common non-Euclidean surfaces, and in feedback for a contraction-based controller with a non-flat metric for a nonlinear system.
|
| |
| 16:30-16:45, Paper FrC15.5 | Add to My Program |
| A Numerical Investigation of the Domain of Attraction of the Kapitza Pendulum Using Harmonic and Nonharmonic Base Excitation |
|
| Islam, Syed Aseem Ul | University of Michigan |
| Kouba, Omran | Higher Institute for Applied Sciences and Technology |
| Portella Delgado, Jhon Manuel | University of Michigan |
| Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Stability of nonlinear systems, Simulation
Abstract: Harmonic, vertical base excitation can stabilize the inverted pendulum; this application of vibrational control is commonly known as the Kapitza pendulum. In addition to being visually surprising, this phenomenon is remarkable due to the fact that stabilization is achieved by open-loop control and thus without the need for sensing or real-time computing. The present paper contributes to the study of the Kapitza pendulum by exploring the limitations of vibrational control. In particular, for harmonic base excitation, we estimate the domain of attraction of the Kapitza pendulum over the range of stabilizing parameters. We then determine the ability of nonharmonic base excitation to extend the domain of attraction. Both investigations depend on numerical simulation and optimization.
|
| |
| FrC16 Regular Session, Grand Salon 24 |
Add to My Program |
| Iterative Learning Control |
|
| |
| Chair: Rogers, Eric | University of Southampton |
| Co-Chair: Hashimoto, Kazumune | Osaka University |
| |
| 15:30-15:45, Paper FrC16.1 | Add to My Program |
| MM-LMPC: Multi-Modal Learning Model Predictive Control Via Bandit-Based Mode Selection |
|
| Hashimoto, Wataru | Osaka University |
| Hashimoto, Kazumune | Osaka University |
Keywords: Iterative learning control, Predictive control for nonlinear systems, Optimal control
Abstract: Learning Model Predictive Control (LMPC) improves performance on iterative tasks by leveraging data from previous executions. However, in tasks with multiple possible solution routes, LMPC heavily depends on the initial trajectories: states with high cost-to-go are rarely selected as terminal candidates in later iterations, leaving parts of the state space unexplored and potentially missing better solutions. This challenge arises because LMPC constructs a sampled safe set from past trajectories and uses it as a terminal constraint, with a terminal cost given by the corresponding cost-to-go. To overcome this limitation, we propose Multi-Modal LMPC (MM-LMPC), which clusters past trajectories into modes and maintains mode-specific terminal sets and value functions. A bandit-based meta-controller with a Lower Confidence Bound (LCB) policy balances exploration and exploitation across modes, enabling systematic refinement across multiple modes. MM-LMPC preserves the standard LMPC properties of recursive feasibility and closed-loop stability, while additionally guaranteeing asymptotic convergence to the best mode and a logarithmic regret bound. Simulations on obstacle-avoidance tasks show the effectiveness of the proposed method.
|
| |
| 15:45-16:00, Paper FrC16.2 | Add to My Program |
| Accelerated Iterative Learning Control of Stochastic Discrete Linear Systems with Switching of the Dynamics and Reference Trajectory |
|
| Pakshin, Pavel | Arzamas Polytechnic Institute of R.E. Alekseev Nizhny Novgorod STU |
| Emelianova, Julia | Arzamas Polytechnic Institute of R.E. Alekseev NizhnyNovgorod State Technical University |
| Rogers, Eric | University of Southampton |
|
|
| |
| 16:00-16:15, Paper FrC16.3 | Add to My Program |
| Event-Triggered Learning Robust MPC for Unknown Time-Varying ARX Systems Using Input-Output Data |
|
| Deng, Li | University of Alberta |
| Shu, Zhan | University of Alberta |
| Chen, Tongwen | University of Alberta |
Keywords: Iterative learning control, Time-varying systems, Robust control
Abstract: An event-triggered learning robust model predictive control (MPC) framework is proposed for unknown time-varying autoregressive exogenous (ARX) systems where all time-varying matrices are approximated by a polytope representation. An event-triggered learning scheme that combines model estimation with polytope learning is developed, which reduces the frequency of learning updates while ensuring convergence. Using the learned polytope, a robust MPC controller based on an augmented state-space model is designed to guarantee a mixed input-output constraint and minimize the upper bound of an infinite-horizon cost function. To capture the connection between consecutive learned polytopes, a matching error is introduced and thus input-to-state stability is analyzed. A numerical example is provided to demonstrate the effectiveness of the proposed approach.
|
| |
| 16:15-16:30, Paper FrC16.4 | Add to My Program |
| A Simple Structure Model-Free ILC Scheme for Nonlinear Batch Processes Designed within a Repetitive Process Setting |
|
| Paszke, Wojciech | University of Zielona Gora |
| Hao, Shoulin | Dalian University of Technology |
| Liu, Tao | Dalian University of Technology |
| Tao, Hongfeng | Jiangnan University |
| Rogers, Eric | University of Southampton |
Keywords: Iterative learning control
Abstract: This paper develops new results on data-driven iterative learning control for nonlinear batch processes. The dynamic linearization approach is used to obtain linearized local dynamical models utilizing only the collected process input and output data. As a result, no dynamic structure of the nonlinear model is required for the control design. Additionally, the design problem is formulated within the repetitive process framework, which simplifies the design procedure, facilitates the integrated synthesis of feedback and learning controllers, and aids in the adjustment of control parameters. The convergence of the new data-driven control method is demonstrated by the stability of the resulting repetitive process, which ensures that the tracking error decreases along both the time and iteration (batch) axes. Stability properties can be effectively checked using linear matrix inequality techniques. A numerical example is included to highlight the application of the new results.
|
| |
| 16:30-16:45, Paper FrC16.5 | Add to My Program |
| Koopman-Based Sliding Mode Control for Data-Driven Stabilization of Nonlinear Systems |
|
| Labbadi, Moussa | Bretagne INP |
| Zhong, Zhengang | University of Warwick |
Keywords: Iterative learning control
Abstract: This paper presents a data-driven method for the design of stabilizing controllers for nonlinear systems with exponential stability guarantees. The approach leverages Koopman operator theory to obtain a bilinear representation of the system dynamics directly from measurement data, while explicitly accounting for approximation errors arising from the use of finitely many data samples. On top of this lifted representation, a sliding mode control (SMC) strategy is developed to robustly stabilize the system and compensate for residual modeling error. The proposed framework focuses on bilinear single-input systems, for which rigorous stability guarantees are provided. Furthermore, the methodology naturally extends to multi-input systems, opening the way for broader applications in data-driven robust control.
|
| |
| 16:45-17:00, Paper FrC16.6 | Add to My Program |
| A Repetitive Learning Model Predictive Control Method for Nonlinear Systems with Application to Roll-To-Roll Manufacturing |
|
| Martin, Christopher | University of Texas at Austin |
| Li, Shihao | The University of Texas at Austin |
| Li, Jiachen | University of Texas at Austin |
| Li, Wei | University of Texas at Austin |
| Chen, Dongmei | The University of Texas at Austin |
Keywords: Manufacturing systems, Iterative learning control, Predictive control for nonlinear systems
Abstract: Roll-to-roll (R2R) mechanical dry transfer is a manufacturing process that enables high-throughput and environmentally friendly fabrication of advanced thin-film devices. Achieving high-quality transfer requires precise tension control, which is challenging due to nonlinear peeling dynamics and input constraints. Additionally, since devices are often transferred repetitively in continuous production lines, this repetition can be leveraged to improve control. To this end, this study presents a repetitive learning model predictive control (RLMPC) strategy. To account for repetitive disturbances and model mismatch, the method uses measurements from past periods and an internal linear time-varying (LTV) model to solve a receding horizon optimal control problem at each timestep. The LTV sensitivities are updated online using a nonlinear process model, enabling full utilization of the nonlinear model in a computationally efficient manner. A case study applying the RLMPC method to R2R mechanical dry transfer is presented, representing the first application of RLMPC to an R2R system. The approach significantly improved the web tension tracking performance across successive cycles, achieving a 64% reduction in root mean squared error compared to a baseline linear MPC. These results indicate that the proposed method has the potential to deliver superior performance in industrial applications.
|
| |
| FrC17 Regular Session, Churchill A1 |
Add to My Program |
| Stochastic Optimal Control II |
|
| |
| Chair: Bakolas, Efstathios | The University of Texas at Austin |
| Co-Chair: Wan, Yan | University of Texas at Arlington |
| |
| 15:30-15:45, Paper FrC17.1 | Add to My Program |
| Multi-Model Covariance Steering for Continuous-Time Stochastic Linear Systems |
|
| Bakolas, Efstathios | The University of Texas at Austin |
Keywords: Stochastic optimal control, Stochastic systems, Uncertain systems
Abstract: In this paper, we consider the finite-horizon covariance steering problem for continuous-time stochastic linear systems subject to white noise in the special case in which the (uncertain) system dynamics is unknown but belongs necessarily to a finite collection of known models. We refer to the latter problem as the multi-model covariance steering (MMCS) problem. Because a common controller that would steer the state covariance of each possible system realization to the same positive definite matrix may likely not exist in general, we consider a different problem formulation that is based on the a posteriori interpretation of the standard covariance steering problem as a Linear Quadratic Gaussian (LQG) control problem with a special terminal cost. This interpretation allows us to formulate the multi-model covariance steering problem as a minimax (robust) stochastic optimal control problem that can be addressed by means of the Robust Maximum Principle. In particular, we show that the feedback control law that solves the MMCS problem can be fully characterized by (1) the solution to an LQG-type problem for a system with an augmented state space and (2) a tractable, yet non-convex, optimization problem. Simulations results that illustrate some key ideas of this paper are also included.
|
| |
| 15:45-16:00, Paper FrC17.2 | Add to My Program |
| Fine-Tuning Diffusion Models Via Stochastic Control: Entropy Regularization and Beyond |
|
| Tang, Wenpin | Columbia University |
| Zhou, Fuzhong | Columbia University |
Keywords: Stochastic optimal control, Reinforcement learning, Machine learning
Abstract: This paper aims to develop and provide a rigorous treatment to entropy regularized fine-tuning in the context of continuous-time diffusion models, which was proposed by Uehara et al. (arXiv:2402.15194). The idea is to use stochastic control for sample generation, where the entropy regularizer is introduced to mitigate reward collapse. We also show how the analysis can be extended to fine-tuning with a general f -divergence regularizer. Numerical experiments on large-scale text-to-image models – Stable Diffusion v1.5 are conducted to validate our approach.
|
| |
| 16:00-16:15, Paper FrC17.3 | Add to My Program |
| Belief-Based Reinforcement Learning for Asymmetric Partially Observable Zero-Sum Games |
|
| Hosseini, Seyed Hamid | Northeastern University |
| Kazeminajafabadi, Armita | Northeastern University |
| Kamara, Amidu | U.S. Department of Homeland Security |
| Imani, Mahdi | Northeastern University |
Keywords: Stochastic systems, Markov processes, Reinforcement learning
Abstract: Adversarial decision-making under asymmetric partial observability arises across domains such as cybersecurity, autonomous maneuvering, and resource allocation. Existing approaches often rely on recurrent encoders or centralized critics, but these lack equilibrium guarantees and degrade under observation noise or information asymmetry. We develop a belief-space game-theoretic framework in which each player maintains a local belief vector, a posterior distribution over hidden states conditioned on their own observations. We show that these beliefs serve as sufficient statistics for optimal play, enabling partially observable interactions to be recast as a two-player zero-sum Markov game over belief pairs. We characterize Nash equilibria in this belief game and propose a belief-based actor–critic method that approximates these strategies at scale using decentralized belief-conditioned actors and a centralized critic active only during training. Experiments on network-based adversarial scenarios demonstrate significant gains over state-of-the-art baselines, with advantages increasing under higher noise and stronger asymmetry.
|
| |
| 16:15-16:30, Paper FrC17.4 | Add to My Program |
| Probabilistic Modeling of CGM-Derived Daily Minimum Glucose in Type 2 Diabetes: Hierarchical vs. Non-Hierarchical Bayesian Approaches |
|
| Ganji, Mohammadreza | University of Virginia |
| El Fathi, Anas | University of Virginia |
| Fabris, Chiara | University of Virginia |
Keywords: Uncertain systems, Stochastic optimal control, Identification for control
Abstract: Type 2 diabetes is a chronic metabolic disorder characterized by insulin resistance and progressive beta-cell dysfunction. After lifestyle modifications and non-insulin antidiabetic drugs, long-acting (basal) insulin is the therapy of choice. However, due to disease heterogeneity, treatment requires initial titration to escalate doses to the needed amount and frequent re-calibration due to the time-varying metabolic system. Historically, insulin dose titration and adaptation have relied on fasting plasma glucose measured by self-monitoring of blood glucose, but with continuous glucose monitoring (CGM) becoming standard of care, data streams with larger information content are available. In this study, CGM-derived daily minimum glucose (DMG) is examined as an informative metric for basal insulin titration and adaptation, motivated by its direct relationship with clinically defined hypoglycemia events. The goal is to introduce a probabilistic model of DMG conditioned on insulin input, and to compare two Bayesian modeling paradigms for DMG prediction: hierarchical and non-hierarchical. These paradigms reflect different assumptions about patient independence versus shared population structure, and their impact on parameter estimates and uncertainty quantification was systematically evaluated. On test data, the hierarchical model achieved a posterior predictive RMSE of 29.43 ± 0.84 and the non-hierarchical model 30.50 ± 1.37 (mean ± SD). The proposed framework yields uncertainty-aware models of DMG, enabling probabilistic inference and providing a foundation for stochastic control in basal insulin titration and adaptation.
|
| |
| 16:30-16:45, Paper FrC17.5 | Add to My Program |
| MPCM–Taguchi for Decision-Making in Uncertain Systems with Application to DC Microgrids |
|
| Zhou, Siyu | University of Texas at Arlington |
| Wan, Yan | University of Texas at Arlington |
| Koru, Ahmet Taha | University of Texas at Arlington |
Keywords: Uncertain systems, Stochastic optimal control, Large-scale systems
Abstract: Decision-making for systems with high-dimensional uncertain parameters is critical to maintaining system stability and reliable operation. This paper develops an efficient decision-making method for high-dimensional uncertain systems by leveraging the Multivariate Probabilistic Collocation Method (MPCM)-Taguchi method. Based on the MPCM-Taguchi framework, an integral reinforcement learning (IRL) control algorithm is developed. The proposed method is applied to power buffer systems in DC microgrids with uncertain loads. Comparative studies demonstrate the effectiveness and practical value of the approach for real-time decision-making in microgrid systems.
|
| |
| 16:45-17:00, Paper FrC17.6 | Add to My Program |
| Discrete-Time Linear Quadratic Stochastic Control with Equality-Constrained Inputs: Application to Energy Demand Response |
|
| Seugnet, Léo | Polytechnique Montréal |
| Gao, Shuang | Polytechnique Montreal |
Keywords: Stochastic optimal control, Constrained control, Smart grid
Abstract: We investigate the discrete-time stochastic linear quadratic control problem for a population of cooperative agents under the hard equality constraint on total control inputs, motivated by demand response in renewable energy systems. We establish the optimal solution that respects hard equality constraints for systems with additive noise in the dynamics. The optimal control law is derived using dynamic programming and Karush-Kuhn-Tucker (KKT) conditions, and the resulting control solution depends on a discrete-time Riccati-like recursive equation. Application examples of coordinating the charging of a network of residential batteries to absorb excess solar power generation are demonstrated, and the proposed control is shown to achieve exact power tracking while considering individual State-of-Charge (SoC) objectives.
|
| |
| FrC18 Regular Session, Churchill A2 |
Add to My Program |
| Fault Tolerant Systems II |
|
| |
| Chair: Chung, Soon-Jo | California Institute of Technology |
| Co-Chair: Bodson, Marc | Univ. of Utah |
| |
| 15:30-15:45, Paper FrC18.1 | Add to My Program |
| Control Structure Screening for Robust Isolation of Controller-Actuator False Data Injection Attacks |
|
| Gajjar, Aatam | University of California, Davis |
| El-Farra, Nael H. | University of California, Davis |
| Ellis, Matthew | University of California, Davis |
Keywords: Chemical process control, Process Control, Fault diagnosis
Abstract: In this work, we present a screening methodology that incorporates robust cyberattack isolation as a key criterion in the selection of control structures for dynamic processes. We focus on controller–actuator link attacks and on isolation schemes based on a bank of unknown-input observers with dedicated residuals. For this class of schemes, we characterize how the control system structure determines the ability to isolate attacks not only from one another, but also from persistent process disturbances. We introduce the robust attack isolation metric (RAIM), which measures the extent to which attack isolation can be guaranteed despite disturbances. Building on RAIM, we develop a systematic screening algorithm to guide the selection of control structures with the greatest potential for robust attack isolation. We demonstrate the effectiveness of the methodology by applying it to a chemical process to illustrate the attack isolation capabilities of a robustly cyberattack-aware control structure.
|
| |
| 15:45-16:00, Paper FrC18.2 | Add to My Program |
| Failure-Tolerant Data-Driven Control for Discrete-Time Linear Systems |
|
| Lu, Shuaizheng | Augusta University |
| Liao, Jiaping | Augusta University |
| Mo, Zihao | Augusta University |
| Xiang, Weiming | Augusta University |
Keywords: Fault tolerant systems, Linear systems, Robust control
Abstract: This paper presents a novel direct data-driven failure-tolerant control methodology for discrete-time linear time-invariant (LTI) systems that are subject to controller failure. Our approach leverages system data directly, obviating the need for an explicit system model. By applying time-varying piecewise quadratic Lyapunov function (TPQLF), the proposed methodology formulates the controller design as a semi-definite programming (SDP) problem, enabling exponential stabilization directly from collected system data. The effectiveness of the proposed approach is demonstrated through a numerical example involving a networked system experiencing packet drop-offs, showing its value for enhancing reliability and safety in variable conditions.
|
| |
| 16:00-16:15, Paper FrC18.3 | Add to My Program |
| Closing the Loop Inside Neural Networks: Causality-Guided Layer Adaptation for Fault Recovery Control |
|
| Taheri, Mahdi | California Institute of Technology (Caltech) |
| Chung, Soon-Jo | California Institute of Technology |
| Hadaegh, Fred Y. | California Inst. of Tech |
Keywords: Fault tolerant systems, Neural networks, Adaptive control
Abstract: This paper studies the problem of real-time fault recovery control for nonlinear control-affine systems subject to actuator loss of effectiveness faults and external disturbances. We develop a two-stage framework that combines causal inference with selective online adaptation to achieve an effective learning-based recovery control method. In the offline phase, we introduce a causal layer attribution technique based on the average causal effect (ACE) to evaluate the relative importance of each layer in a pretrained deep neural network (DNN) controller compensating for faults. This provides a principled approach to select the most causally influential layer for fault recovery control in the sense of ACE, and goes beyond the widely used last-layer adaptation approach. In the online phase, we deploy a Lyapunov-based gradient update to adapt only the ACE-selected layer to circumvent the need for full-network or last-layer only updates. The proposed adaptive controller guarantees uniform ultimate boundedness (UUB) with exponential convergence of the closed-loop system in the presence of actuator faults and external disturbances. Compared to conventional adaptive DNN controllers with full-network adaptation, our methodology has a reduced computational overhead in the online phase. To demonstrate the effectiveness of our proposed methodology, a case study is provided on a 3-axis attitude control system of a spacecraft with four reaction wheels.
|
| |
| 16:15-16:30, Paper FrC18.4 | Add to My Program |
| Fault Tolerant Torque/Speed Control of a Doubly-Fed Induction Machine |
|
| Hossain, Md Abid | University of Utah |
| Bodson, Marc | Univ. of Utah |
Keywords: Electrical machine control, Fault tolerant systems, PID control
Abstract: This paper proposes a control algorithm to perform torque control for the doubly-fed induction machine in the presence of inverter faults. With an open or short-circuited inverter switch, performing torque control using the remaining healthy phases of motors like permanent magnet (PM) and cage-rotor induction motors (IM) is impossible without additional switches or undesirable torque ripples. The objective of the paper is to demonstrate interesting fault-tolerance options available for doubly fed induction machines (DFIM) when operated using half-sized converters on the stator and rotor windings. In contrast with existing methods that reconfigure the DFIM as an induction machine by short-circuiting the stator or rotor windings or by including more complicated circuitry to detach the faulted part, this paper describes the operation without additional circuit components. With the proposed control method, the DFIM draws controlled current from the faulted inverter, leading to higher torque generation capability. The same strategy can be applied to single-phase open-circuit faults with minor modifications. The maximum torque per ampere feature is considered and integrated into the algorithm. Tests are performed on an experimental testbed to validate the proposed control algorithm.
|
| |
| 16:30-16:45, Paper FrC18.5 | Add to My Program |
| Fault-Tolerant Adaptive Switching Control of Deformable Mirrors under Actuator Failures and Saturation Limits |
|
| Xu, Binyan | Univeristy of Guelph |
| Aljanaideh, Khaled | American University of Sharjah |
| Goy, Matthias | Fraunhofer Institute for Applied Optics and Precision Engineering |
| Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Mechanical systems/robotics
Abstract: This paper presents an adaptive switching fault-tolerant control (FTC) framework for deformable mirrors to ensure reliable surface shape tracking under actuator faults and constraints. A finite-dimensional sensor–actuator model is established that operates directly in actuator–sensor coordinates while preserving both the resonant characteristics and static gain of the original mirror dynamics. The model is then decomposed into localized one-sensor, multiple-actuator subsystems, enabling decentralized control and reducing computational complexity. Each subsystem integrates an adaptive control law with an online switching mechanism that autonomously reconfigures actuator excitation subsets. Rather than relying on explicit fault detection, the switching is guided by a performance index that deactivates faulty or saturated actuators and reallocates effort to healthy ones. An adaptive disturbance compensation law further addresses modeling uncertainties and neglected actuator couplings. A Lyapunov-based analysis guarantees closed-loop stability for all subsystems and the overall DM system. Simulation results demonstrate that the proposed method achieves accurate wavefront shaping, efficient actuator utilization, and robust fault tolerance.
|
| |
| FrC19 Regular Session, Churchill B1 |
Add to My Program |
| Optimal Control V |
|
| |
| Chair: Yel, Esen | Rensselaer Polytechnic Institute |
| Co-Chair: Bin Mohaya, Turki | University of Michigan |
| |
| 15:30-15:45, Paper FrC19.1 | Add to My Program |
| Partial Attention in Deep Reinforcement Learning for Safe Multi-Agent Control |
|
| Bin Mohaya, Turki | University of Michigan, Ann Arbor |
| Seiler, Peter | University of Michigan, Ann Arbor |
Keywords: Traffic control, Multivehicle systems, Robotics
Abstract: Attention mechanisms excel at learning sequential patterns by discriminating data based on relevance and importance. This provides state-of-the-art performance in today’s advanced generative artificial intelligence models. This paper applies this concept of an attention mechanism for multi-agent safe control. We specifically consider the design of a neural network to control autonomous vehicles in a highway merging scenario. The environment is modeled as a Decentralized Partially Observable Markov Decision Process (Dec-POMDP). Within a QMIX framework, we include partial attention for each autonomous vehicle, thus allowing each ego vehicle to focus on the most relevant neighboring vehicles. Moreover, we propose a comprehensive reward signal that considers the environment’s global objectives (e.g., safety and vehicle flow) and the individual interests of each agent. Simulations are conducted in the Simulation of Urban Mobility (SUMO). The results show better performance compared to other driving algorithms in terms of safety, driving speed, and reward.
|
| |
| 15:45-16:00, Paper FrC19.2 | Add to My Program |
| Event-Based Control Via Sparsity-Promoting Regularization: A Rollout Approach with Performance Guarantees |
|
| Nishida, Shumpei | Ritsumeikan University |
| Okano, Kunihisa | Ritsumeikan University |
Keywords: Optimal control, Optimization, Networked control systems
Abstract: This paper presents a controller design framework aiming to balance control performance and actuation rate. Control performance is evaluated by an infinite-horizon average cost, and the number of control actions is penalized via sparsity-promoting regularization. Since the formulated optimal control problem has a combinatorial nature, we employ a rollout algorithm to obtain a tractable suboptimal solution. In the proposed scheme, actuation timings are determined through a multistage minimization procedure based on a receding-horizon approach, and the corresponding control inputs are computed online. We establish theoretical performance guarantees with respect to periodic control and prove the stability of the closed-loop system. The effectiveness of the proposed method is demonstrated through a numerical example.
|
| |
| 16:00-16:15, Paper FrC19.3 | Add to My Program |
| Planning Stealthy Backdoor Attacks in MDPs with Observation-Based Triggers |
|
| Wei, Xinyi | University of Florida |
| Han, Shuo | University of Illinois Chicago |
| Hemida, Ahmed (Ahmed H. Anwar) | ARL |
| Kamhoua, Charles | U.S. Army Research Laboratory |
| Fu, Jie | University of Florida |
Keywords: Markov processes
Abstract: This paper investigates backdoor attack planning in stochastic control systems modeled as Markov Decision Processes (MDPs). A backdoor attack involves an adversary deploying a policy that performs well in the original MDP to pass testing, but behaves maliciously at runtime when combined with a trigger that perturbs system dynamics. We consider a sophisticated attacker capable of jointly optimizing the backdoor policy and its trigger using only a blackbox simulator. During execution, the attacker has access only to partial observations of the system state and is restricted to introduce small perturbations to the system’s transition dynamics. We formulate the attack planning problem as a constrained Markov game with an augmented state space and two players: Player 0 learns a backdoor policy that maximizes attack rewards when the trigger is active. However, when the trigger is inactive, the backdoor policy behaves near-optimally in the original MDP; Player 1 designs a finite-memory, observation-based trigger to activate the attack. We propose a switching gradient-based optimization algorithm to jointly solve for the backdoor policy and trigger. Experiments on a case study demonstrate the effectiveness of our method in achieving stealthy and successful backdoor attacks. how the attack performance varies under different parameters related to the stealthiness of the backdoor attack.
|
| |
| 16:15-16:30, Paper FrC19.4 | Add to My Program |
| Safe Output Feedback Approximate Dynamic Programming for Nonlinear Control Affine Systems |
|
| Mahmud, S M Nahid | Purdue University |
| Rai, Ayush | Purdue University |
| Mou, Shaoshuai | Purdue University |
Keywords: Nonlinear output feedback, Optimal control, Observers for nonlinear systems
Abstract: In this work, we study the state-constrained optimal control problem for nonlinear control-affine systems under output feedback. While Approximate Dynamic Programming (ADP) enables online feedback via approximate Bellman solutions, most safe-ADP methods assume full-state availability, which is often unrealistic in practice. Simply replacing unmeasured states with their estimates can violate safety. We propose estimation-aware Lyapunov–Control Barrier Functions (EA-LCBFs) that modify barrier conditions using explicit bounds on estimation error, yielding an invasive safety filter around a nominal ADP policy. The resulting online scheme preserves ADP performance, certifies forward invariance of the actual safe set, and operates only using output measurements. We demonstrate reliable constraint satisfaction without compromising the performance of ADP-based optimization on a representative nonlinear system.
|
| |
| 16:30-16:45, Paper FrC19.5 | Add to My Program |
| Optimizing Task Completion Time Updates Using POMDPs |
|
| Eddy, Duncan | Stanford University |
| Yel, Esen | Rensselaer Polytechnic Institute |
| Passmore, Emma Elizabeth | Stanford University |
| Egan, Niles | Stanford University |
| Armour, Grayson | Stanford University |
| Asmar, Dylan | Stanford University |
| Kochenderfer, Mykel | Stanford University |
Keywords: Control applications, Information technology systems, Uncertain systems
Abstract: Managing announced task completion times is a fundamental control problem in project management. While extensive research exists on estimating task durations and task scheduling, the problem of when and how to update completion times communicated to stakeholders remains understudied. Organizations must balance announcement accuracy against the costs of frequent timeline updates, which can erode stakeholder trust and trigger costly replanning. Despite the prevalence of this problem, current approaches rely on static predictions or ad-hoc policies that fail to account for the sequential nature of announcement management. In this paper, we formulate the task announcement problem as a Partially Observable Markov Decision Process (POMDP) where the control policy must decide when to update announced completion times based on noisy observations of true task completion. Since most state variables (current time and previous announcements) are fully observable, we leverage the Mixed Observability MDP (MOMDP) framework to enable more efficient policy optimization. Our reward structure captures the dual costs of announcement errors and update frequency, enabling synthesis of optimal announcement control policies. Using off-the-shelf solvers, we generate policies that act as feedback controllers, adaptively managing announcements based on belief state evolution. Simulation results demonstrate significant improvements in both accuracy and announcement stability compared to baseline strategies, achieving up to 75% reduction in unnecessary updates while maintaining or improving prediction accuracy.
|
| |
| 16:45-17:00, Paper FrC19.6 | Add to My Program |
| Complex-Valued Optimal Control for Three-Phase Voltage Source Converter Model Using Policy Iteration |
|
| Nagappan, Manoj | SRM Institute of Science and Technology |
| Murugesan, Sathishkumar | SRM Institute of Science and Technology |
| Tran, Quang Huy | National Cheng Kung University |
| Liu, Yen-Chen | National Cheng Kung University |
Keywords: Linear systems, Optimal control, Iterative learning control
Abstract: We develop a unified model-based policy iteration (PI) framework for determining the optimal control policy of complex-valued (CV) linear continuous-time (CT) systems. Using the properties of CR-calculus and the value function for the CV system, we first derive the CV algebraic Riccati equation (CVARE) and the corresponding control policy. Afterwards, we develop a unified PI technique based on CV policy evaluation and policy update steps that iteratively generate a control policy, which can also be used for the real-valued case. We demonstrate convergence of the proposed algorithm toward the optimal control solution, and we illustrate its effectiveness through the computation of the optimal control policy for a three-phase voltage source converter incorporating an LC output filter in its CV representation.
|
| |
| FrC20 Regular Session, Churchill B2 |
Add to My Program |
| Model Predictive Control IV |
|
| |
| Chair: Walker, Markus | Karlsruhe Institute of Technology (KIT) |
| Co-Chair: Nuculaj, Luke | Oakland University |
| |
| 15:30-15:45, Paper FrC20.1 | Add to My Program |
| A Model Predictive Control Approach to Optimal Point Tracking of Unknown Trajectories for Human Binocular Eye Rotation |
|
| Athukorallage, Bhagya | Williams College |
| Ghosh, Bijoy | Texas Tech University |
| Finlay, Mikaela | MIT |
| Platonov, Liza | Middlebury College |
| Bagga, Akkshansh | Williams College |
| Tobin, Benjamin | Williams College |
Keywords: Predictive control for nonlinear systems, Optimal control, Optimization
Abstract: This paper presents an optimal control framework for modeling binocular eye movements while tracking unknown trajectories. Building upon a Riemannian geometric formulation of eye dynamics, we extend prior monocular and binocular models to derive equations of motion subject to Listing’s Law and binocular coupling constraints. We formulate a model predictive control (MPC) approach with two variants: a deterministic controller based on linear extrapolation and a stochastic controller leveraging Gaussian Process regression for uncertainty-aware trajectory prediction. Numerical simulations across sinusoidal, noisy sinusoidal, and random walk trajectories demonstrate that stochastic MPC significantly improves tracking robustness in uncertain or noisy conditions, achieving up to 94% reduction in tracking error compared to deterministic MPC.
|
| |
| 15:45-16:00, Paper FrC20.2 | Add to My Program |
| Sample-Efficient and Smooth Cross-Entropy Method Model Predictive Control Using Deterministic Samples |
|
| Walker, Markus | Karlsruhe Institute of Technology (KIT) |
| Frisch, Daniel | Karlsruhe Institute of Technology (KIT) |
| Hanebeck, Uwe D. | Karlsruhe Institute of Technology (KIT) |
Keywords: Predictive control for nonlinear systems, Optimal control, Optimization algorithms
Abstract: Cross-entropy method model predictive control (CEM—MPC) is a powerful gradient-free technique for non-linear optimal control, but its performance is often limited by the reliance on random sampling. This conventional approach can lead to inefficient exploration of the solution space and non-smooth control inputs, requiring a large number of samples to achieve satisfactory results. To address these limitations, we propose deterministic sampling CEM (dsCEM), a novel framework that replaces the random sampling step with deterministic samples derived from localized cumulative distributions (LCDs). Our approach introduces modular schemes to generate and adapt these sample sets, incorporating temporal correlations to ensure smooth control trajectories. This method can be used as a drop-in replacement for the sampling step in existing CEM-based controllers. Experimental evaluations on two nonlinear control tasks demonstrate that dsCEM consistently outperforms state-of-the-art iCEM in terms of cumulative cost and control input smoothness, particularly in the critical low-sample regime.
|
| |
| 16:00-16:15, Paper FrC20.3 | Add to My Program |
| Ensemble Kalman Inversion for Constrained Nonlinear MPC: An ADMM-Splitting Approach |
|
| Khalil, Ahmed | The University of Texas at Austin |
| Safwat, Mohamed | University of Washington, Seattle |
| Bakolas, Efstathios | The University of Texas at Austin |
Keywords: Predictive control for nonlinear systems, Optimization algorithms, Optimal control
Abstract: This work proposes a novel Alternating Direction Method of Multipliers (ADMM)-based Ensemble Kalman Inversion (EKI) algorithm for solving constrained nonlinear model predictive control (NMPC) problems. First, stage-wise nonlinear inequality constraints in the NMPC problem are embedded via an augmented Lagrangian with nonnegative slack variables. We then show that the resulting unconstrained augmented-Lagrangian primal subproblem admits a Bayesian interpretation: under independent Gaussian virtual observations, its minimizers coincide with MAP estimators, enabling solution via EKI. However, since the nonnegativity constraint on the slacks is a hard constraint not naturally encoded by a Gaussian model, our proposed algorithm yields a two-block ADMM scheme that alternates between (i) an inexact primal step that minimizes the augmented-Lagrangian objective (implemented via EKI rollouts), (ii) a nonnegativity projection for the slacks, and (iii) a dual ascent step. To balance exploration and convergence, an annealing schedule tempers sampling covariances while a penalty schedule increases constraint enforcement over outer iterations, encouraging global search early and precise constraint satisfaction later. We evaluate the proposed controller on a 6-DOF UR5e manipulation benchmark in MuJoCo, comparing it against DIAL-MPC (an iterative MPPI variant) as the arm traverses a cluttered tabletop environment.
|
| |
| 16:15-16:30, Paper FrC20.4 | Add to My Program |
| Personalized MPC for Autonomous Vehicle Lateral Motion Control Via Inverse Reinforcement Learning (I) |
|
| Zhou, Zhaodong | Oakland University |
| Tao, Mingyuan | Isuzu Technical Center of America, Inc |
| Qiu, Jiayi | Isuzu Technical Center of America, Inc |
| Zhang, Peng | Isuzu Technical Center of America, Inc |
| Xu, Meng | Isuzu Technical Center of America, Inc |
| Sun, Yong | Isuzu Technical Center of America, Inc |
| Chen, Jun | Oakland University |
Keywords: Predictive control for nonlinear systems, Reinforcement learning, Automotive control
Abstract: This paper presents a personalized lane change control framework that combines the maximum entropy inverse reinforcement learning (MaxEnt IRL) with model predictive control (MPC). Instead of manually tuning the MPC cost weight, the proposed method learns a cost function from expert driving demonstrations using interpretable trajectory features. The learned weights are incorporated into the MPC formulation to generate personalized lane change trajectories that mimic the expert driving style. Simulation results in CARLA show that the IRL-trained MPC controller can reproduce distinct driving styles and matches expert behaviors in heading, lateral motion, and steering. Furthermore, real world on-road testing on an ISUZU truck demonstrated that the proposed controller generalizes well to unseen initial conditions. Overall, the proposed approach reduces manual calibration efforts, eliminates the need for explicit path planning, improves controller interpretability, and enables personalized driver behavior replication.
|
| |
| 16:30-16:45, Paper FrC20.5 | Add to My Program |
| Model Predictive Control with High-Probability Safety Guarantee for Nonlinear Stochastic Systems |
|
| Liu, Zishun | Georgia Institute of Technology |
| Ma, Liqian | Georgia Institute of Technology |
| Chen, Yongxin | Georgia Institute of Technology |
Keywords: Predictive control for nonlinear systems, Stochastic systems, Stochastic optimal control
Abstract: We present a model predictive control (MPC) framework for nonlinear stochastic systems that ensures safety guarantee with high probability. Unlike most existing stochastic MPC schemes, our method adopts a set-erosion that converts the probabilistic safety constraint into a tractable deterministic safety constraint on a smaller safe set over deterministic dynamics. As a result, our method is compatible with any off-the-shelf deterministic MPC algorithm. The key to the effectiveness of our method is a tight bound on the stochastic fluctuation of a stochastic trajectory around its nominal version. Our method is scalable and can guarantee safety with high probability level (e.g., 99.99%), making it particularly suitable for safety-critical applications involving complex nonlinear dynamics. Rigorous analysis is conducted to establish a theoretical safety guarantee, and numerical experiments are provided to validate the effectiveness of the proposed MPC method.
|
| |
| 16:45-17:00, Paper FrC20.6 | Add to My Program |
| The Elastic Model Predictive Safety Filter |
|
| Auricchio, Alberto | University of Bristol |
| Zhang, Kaiqiang | University of Bristol |
| Richards, Arthur | University of Bristol |
Keywords: Autonomous systems, Predictive control for nonlinear systems, Optimization
Abstract: Autonomous systems often operate in multiple operational modes, each with its own set of safety constraints. Safely transitioning between modes, for example, from a high-speed survey mode to a close-inspection mode, presents a significant challenge. This paper introduces the `Elastic' Model Predictive Safety Filter (EPSF), a novel control framework that enables provably safe, dynamic transitions between different operational modes. The EPSF extends the Predictive Safety Filter by incorporating a soft-constrained multimodal Model Predictive Control formulation. In transition between modes, the controller's cost function penalises deviations from the constraints of a desired target mode whilst strictly enforcing the constraints of the currently operational mode. This objective steers the system state towards a region where a transition is feasible. The convergence of slack variables to zero serves as an online, verifiable certificate that this transition region has been reached, guaranteeing recursive feasibility and asymptotic convergence. A convex implementation is provided for linear systems, and simplifications of the sufficient conditions for convergence are discussed. The effectiveness of the EPSF is demonstrated through numerical simulation of an unstable linear system and a nonlinear kinematic car model.
|
| |
| FrC21 Regular Session, Churchill C1 |
Add to My Program |
| Sliding Mode Control |
|
| |
| Chair: Tomsovic, Kevin | Clemson University |
| Co-Chair: Drakunov, Sergey V. | Embry-Riddle Aeronautical University |
| |
| 15:30-15:45, Paper FrC21.1 | Add to My Program |
| Feedforward and Sliding Mode Control of a Novel Precision 2-DoF Reluctance Motion System |
|
| Pumphrey, Michael Joseph | University of Guelph |
| Al Saaideh, Mohammad | Memorial University of Newfoundland |
| Xu, Binyan | Univeristy of Guelph |
| Alatawneh, Natheer | University of Guelph |
| Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics
Abstract: This paper presents the design and implementation of a hybrid sliding mode control (SMC) strategy for a novel two-degree-of-freedom (2-DoF) motion system. The system, driven by a reluctance actuator (RA) for translational motion and two moving magnet actuators (MMAs) for rotational motion, is intended for high-precision applications such as fast-scan mirrors in extreme ultraviolet (EUV) lithography. The nonlinearities of the RA, along with system uncertainties, challenge the controller. To address the dynamics of each axis, a feedforward controller is implemented for the rotational axis, while a robust SMC is developed for the nonlinear translational axis. The SMC is specifically designed to compensate for the RA's strong force-current-position nonlinearity and parametric uncertainties. A smooth hyperbolic tangent function is used to mitigate chattering. The proposed controller is experimentally validated, demonstrating significant improvements in both transient and dynamic tracking performance and validating its suitability for advanced manufacturing and optical applications.
|
| |
| 15:45-16:00, Paper FrC21.2 | Add to My Program |
| Stability and Stabilization of Switched Systems with Random Intermittent Information Transmission |
|
| Taousser, Fatima | University of Tennessee |
| Yichao, Wang | University of Tennessee |
| Djouadi, Seddik, M. | University of Tennessee |
| Tomsovic, Kevin | Clemson University |
Keywords: Stability of linear systems, Stochastic systems, Switched systems
Abstract: This paper investigates the almost sure exponential stability of state-feedback control systems operating over stochastic, non-uniform time domains. The system is subject to intermittent signal transmission interruptions and employs a hold-on strategy, whereby the controller retains and applies the most recent control input during communication failures until transmission is restored. The resulting dynamics are modeled as a switched system evolving over stochastic, disjoint time intervals, interspersed with random discrete steps corresponding to held control inputs. Using a stochastic time-scale framework, we derive sufficient conditions that characterize a region of almost sure exponential stability. The proposed results are demonstrated through a leader-follower consensus problem in multi-agent systems with intermittent communication between the leader and followers.
|
| |
| 16:00-16:15, Paper FrC21.3 | Add to My Program |
| Follow-The-Ridge Approach Based Sliding-Mode Control for Local Minimax Optimisation |
|
| Zenati, Abdelhafid | City Univesity of London |
| Aouf, Nabil | City University of London |
Keywords: Numerical algorithms, Machine learning, Optimization algorithms
Abstract: Numerous machine learning tasks today are essentially about identifying equilibrium points of problems modeled as games played in steps. This is particularly true for two-player zero-sum games, which form the basis of engineering problems formulated as minimax optimisation. Traditional algorithms, such as gradient descent (GDA), often encounter difficulties in accurately locating local minimax points, frequently resulting in convergence to non-local-minimax solutions or trapped in limit cycles. This research introduces a groundbreaking algorithm integrating discrit sliding mode control (DSMC) theory-based strategy for minimax optimisation in machine learning that is shifting away from the conventional dependence on intricate second derivatives. The innovative algorithm introduces a refined nested loop structure that not only simplifies the optimisation process but also guarantees provable, uniform convergence to local minimax points, effectively avoiding the common trap of limit cycles.
|
| |
| 16:15-16:30, Paper FrC21.4 | Add to My Program |
| Sliding Mode Control with Disjoint Switching Manifolds and Distributed Digital Twin for a Steam Tracing System |
|
| D'Amico, Jay | Louisiana Steam Equipment Co |
| Chintalapati, Siddarth | Louisiana Steam Equipment Company |
| Fox, Kyle | Louisiana Steam Equipment Company |
| Kinzie, Ryan | Steam Solutions |
| O'Brien, Scarlett | Steam Solutions |
| Shafiyee, Alif Muhammad | Steam Solutions |
| Drakunov, Sergey V. | Embry-Riddle Aeronautical University |
Keywords: Variable-structure/sliding-mode control, Process Control, Distributed control
Abstract: This paper presents a novel sliding mode control with disjoint switching manifolds applied to a large-scale industrial steam-tracing system. The proposed control algorithm incorporates a digital twin based on a set of coupled partial differential equations that model heat transfer across multiple layers of piping infrastructure. The digital twin enables realtime data fusion from heterogeneous sensor sources, including thermocouples, fiber-optic temperature sensors, and pressure transducers. This information is used to maintain the product temperature within the desired range. The sliding mode controller governs an array of fast-switching steam valves, ensuring high precision and robustness in the face of ambient temperature fluctuations, adverse weather conditions, insulation degradation, valve malfunctions, and sensor faults.
|
| |
| 16:30-16:45, Paper FrC21.5 | Add to My Program |
| Necessary and Sufficient Conditions for Consensus of Binary Input Consensus Protocol |
|
| Li, Mingxi | Beihang University |
| Li, Dongyu | BEIHANG UNIVERSITY |
| Wang, Hanzhou | Beihang University |
Keywords: Variable-structure/sliding-mode control, Robust control, Agents-based systems
Abstract: The Binary Input Consensus Protocol (BICP) is a simple approach for multi-agent systems to reach finite-time consensus under disturbances. Existing research on BICP mainly focuses on discrete-time models or continuous-time models under assumptions that guarantee solution uniqueness. However, attacks, interference, or device failures may invalidate these assumptions, leaving it uncertain whether the system can achieve consensus. We establish the necessary and sufficient conditions for the BICP to achieve consensus under arbitrary bounded disturbances, laying a foundation for analyzing redundancy and failure resilience in such systems. Furthermore, we reformulate the necessary and sufficient conditions as a sign determination problem of the optimal value of an optimization problem and present a more intuitive and interpretable sufficient condition.
|
| |
| 16:45-17:00, Paper FrC21.6 | Add to My Program |
| Liquid Neural Network-Based Integral Sliding Mode Control of Multirotor UAV under Model Uncertainty and Disturbance |
|
| Akhtar, Zainab | University of Engineering & Technology |
| Ijaz, Salman | University of Nottingham Ningbo China |
| Ahmed, Qadeer | The Ohio State University |
| Bhatti, Sidra | OHIO State University |
Keywords: Autonomous systems, Lyapunov methods, Uncertain systems
Abstract: Reliable estimation of uncertainties and external disturbances is crucial for the autonomous operation of unmanned aerial vehicles (UAVs), as it directly impacts stable and robust flight control. To address this, this paper proposes an integrated Liquid Neural Network (LNN)-based integral sliding mode control approach to attain precise control of a multirotor UAV subject to model uncertainty and disturbance. The LNN is designed to estimate model uncertainties and disturbances, thereby providing robustness beyond conventional neural networks. These estimates are subsequently integrated into an integral sliding mode controller, ensuring accurate trajectory tracking under such effects. The simulation results demonstrate that the proposed LNN-based estimator achieves faster convergence, precise uncertainty and disturbance estimation, and improved trajectory tracking performance compared to baseline methods.
|
| |
| FrC22 Regular Session, Churchill C2 |
Add to My Program |
| Distributed Parameter Systems |
|
| |
| Chair: Cui, Leilei | University of New Mexico |
| Co-Chair: Peet, Matthew M. | Arizona State University |
| |
| 15:30-15:45, Paper FrC22.1 | Add to My Program |
| Deterministic Learning-Based Spatiotemporal Dynamics Identification and Estimation of Nonlinear Uncertain Parabolic PDE Systems |
|
| Zhang, Jingting | University of Electronic Science and Technology of China |
| Cheng, Hong | University of Electronic Science and Technology of China |
Keywords: Nonlinear systems identification, Distributed parameter systems, Machine learning
Abstract: This paper investigates the dynamics identification and estimation problems for a class of parabolic partial differential equation (PDE) systems with uncertain nonlinear spatiotemporal dynamics. A key novelty of the proposed approach is to accurately estimate and identify the small dynamics-change by distinguishing it from the dynamics-uncertainty in the time and space scales. This can facilitate the design of, e.g., precise control and fault diagnosis, for the highly-uncertain PDE systems. Specifically, a deterministic learning-based spatiotemporal dynamics identification scheme is first proposed using the radial basis function neural network (RBF NN) for the PDE system's uncertain dynamics. Notably, it exhibits a distinctive capability of guaranteeing the satisfaction of the partial persistent excitation condition, and the optimal convergence of the NN weights. This enables the learned knowledge to be obtained and stored with a convergent RBF NN model, to provide accurate approximation of the system's spatiotemporal dynamics without needing parameter adaptation. A knowledge-based spatiotemporal dynamics estimator is then designed with this convergent RBF NN model, which can provide accurate detection/estimation of the system small dynamics change by distinguishing it from the system uncertainty. Rigorous simulation study on a typical transport-reaction process is performed to validate the effectiveness of our approach, as well as the applicability to practical applications, e.g., fault diagnosis.
|
| |
| 15:45-16:00, Paper FrC22.2 | Add to My Program |
| Distributed Koopman Learning Using Partial Trajectories for Control |
|
| Hao, Wenjian | Purdue University |
| Lu, Zehui | Independent Researcher |
| Upadhyay, Devesh | Saab |
| Mou, Shaoshuai | Purdue University |
Keywords: Distributed parameter systems, Learning
Abstract: This paper proposes a distributed data-driven framework for dynamics learning, termed distributed deep Koopman learning using partial trajectories (DDKL-PT). In this framework, each agent in a multi-agent system is assigned a partial trajectory offline and locally approximates the unknown dynamics using a deep neural network within the Koopman operator framework. By exchanging local estimated dynamics rather than training data, agents achieve consensus on a global dynamics model without sharing their private training trajectories. Simulation studies on a surface vehicle demonstrate that DDKL-PT achieves consensus on the learned dynamics, and each agent attains reasonably small approximation errors on the testing dataset. Furthermore, a model predictive control scheme is developed by integrating the learned Koopman dynamics with known kinematic relations. Results on a reference-tracking task indicate that the distributedly learned dynamics are sufficiently accurate for model-based optimal control.
|
| |
| 16:00-16:15, Paper FrC22.3 | Add to My Program |
| Parameterization of Seed Functions for Equivalent Representations of Time-Varying Delay Systems |
|
| Kisole, Sengiyumva | Arizona State University |
| Chun, Jungbae | University of Michigan, Ann Arbor |
| Seiler, Peter | University of Michigan, Ann Arbor |
| Peet, Matthew M. | Arizona State University |
Keywords: Delay systems, Distributed parameter systems, Linear parameter-varying systems
Abstract: Abel's classic transformation shows that any well-posed system with time-varying delay is equivalent to a parameter-varying system with fixed delay. The existence of such a parameter-varying constant delay representation then simplifies the problems of stability analysis and optimal control. Unfortunately, the method for construction of such transformations has been ad-hoc -- requiring an iterative time-stepping approach to constructing the transformation beginning with a seed function subject to boundary-value constraints. Moreover, a poor choice of seed function often results in a constant delay representation with large time-variations in system parameters -- obviating the benefits of such a representation. In this paper, we show how the set of all feasible seed functions can be parameterized using a basis for L_2. This parameterization is then used to search for seed functions for which the corresponding time-transformation results in smaller parameter variation. The parameterization of admissible seed functions is illustrated with numerical examples that contrast how well-chosen and poorly chosen seed functions affect the boundedness of a time transformation.
|
| |
| 16:15-16:30, Paper FrC22.4 | Add to My Program |
| Data-Driven Operator Policy Iteration with an Application to 1-D Wave Equations |
|
| Yin, Zhun | New York University |
| Jiang, Zhong-Ping | New York University |
Keywords: Distributed parameter systems, Optimal control, Reinforcement learning
Abstract: This paper investigates data-driven policy iteration (PI) for a broad class of infinite-dimensional linear systems in which both the system coefficients and those of the input operator are unknown. By leveraging semigroup theory, the method is formulated in a general operator-theoretic setting. Within this framework, we introduce the operator persistent excitation (OPE) condition, under which a sharp convergence rate is established. Furthermore, we also derive a necessary condition for the OPE property. To illustrate the applicability of the approach, we consider a 1-d wave equation with constant viscous damping. Simulation results demonstrate that the proposed method can significantly improve control performance relying solely on an offline sampled dataset.
|
| |
| 16:30-16:45, Paper FrC22.5 | Add to My Program |
| A System Level Approach to LQR Control of the Diffusion Equation |
|
| McCurdy, Addie | University of Colorado Boulder |
| Gusty, Andrew | University of Colorado |
| Jensen, Emily | University of Colorado, Boulder |
Keywords: Distributed parameter systems, Optimal control
Abstract: The optimal controller design problem for a linear, first-order spatially-invariant distributed parameter system is considered. Through a case study of the Linear Quadratic Regulator (LQR) problem for the diffusion equation over the torus, it is illustrated that the optimal controller design problem can be equivalently formulated as an optimization problem over the system's closed-loop mappings, analogous to the System Level Synthesis framework. This reformulation is solved analytically to recover the LQR for the diffusion equation, and an internally stabilizing implementation of this controller is recovered from the optimal closed-loop mappings. It is further demonstrated that a class of spatio-temporal constraints on the closed-loop maps can be imposed on this closed-loop formulation while preserving convexity.
|
| |