| |
Last updated on April 2, 2023. This conference program is tentative and subject to change
Technical Program for Wednesday May 31, 2023
To show or hide the keywords and abstract (text summary) of a paper (if available), click on the paper title
Open all abstracts
Close all abstracts
Presentation  In person  On-line  No presentation  No information
|
WeA01 RI Session, Sapphire MN |
Add to My Program |
Robotics (RI) |
|
|
Chair: Sharma, Nitin | North Carolina State University |
Co-Chair: Danielson, Claus | University of New Mexico |
|
10:00-10:04, Paper WeA01.1 | Add to My Program |
A Quadratic-Programming Approach for the Real-Time Control of the Fingers Position in Industrial Pneumatic Grippers |
|
Romeo, Rocco Antonio | Istituto Italiano Di Tecnologia |
Zocco, Agata | Istituto Italiano Di Tecnologia |
Fiorio, Luca | Istituto Italiano Di Tecnologia |
Maggiali, Marco | Istituto Italiano Di Tecnologia |
Keywords: Robotics, Optimization algorithms
Abstract: Industrial grippers are generally either electric or pneumatic, with the latter being preferred as their air- based functioning guarantees many advantages such as cost effectiveness and reduced encumbrance. Despite the very large employment, pneumatic grippers do not yet offer performance beyond the open/closed behavior in the majority of cases. Pneumatic grippers mounted on industrial robotic arms are commonly rigid as high forces might be required during the grasping operations. The poorness of the control strategies for the grasping force, as well as for the fingers position, does not match the great advancement that robotics obtained in the last years. Therefore, this letter intends to deliver a new control algorithm for the control of pneumatic grippers. The proposed algorithm permits to finely control the position of the gripper fingers, exploiting a quadratic-programming function to minimize the controller output. Different experiments are shown to demonstrate the algorithm effectiveness.
|
|
10:04-10:08, Paper WeA01.2 | Add to My Program |
Imitating Swarm Behaviors by Learning Agent-Level Controllers |
|
Musaddequr Rahman, Ibrahim | University of Michigan |
White, Stanford | University of Mississippi |
Crockett, Katelyn | West Virginia University |
Gu, Yu | West Virginia University |
Abreu Archanjo Dutra, Dimas | West Virginia University |
Pereira, Guilherme | West Virginia University |
Keywords: Robotics
Abstract: A main challenge in swarm robotics is the unknown mapping between simple agent-level behavior rules and emergent global behaviors. Currently, there is no known swarm control algorithm that maps global behaviors to local control policies. This paper proposes a novel method to circumvent this problem by learning the agent-level controllers of an observed swarm to imitate its emergent behavior. Agent-level controllers are treated as a set of policies that are combined to dictate the agent’s change in velocity. The trajectory data of known swarms is used with linear regression and nonlinear optimization methods to learn the relative weight of each policy. To show our approach’s ability for imitating swarm behavior, we apply this methodology to both simulated and physical swarms (i.e., a school of fish) exhibiting a multitude of distinct emergent behaviors. We found that our pipeline was effective at imitating the simulated behaviors using both accurate and inaccurate assumptions, being able to closely identify not only the policy gains, but also the agent’s radius of communication and their maximum velocity constraint.
|
|
10:08-10:12, Paper WeA01.3 | Add to My Program |
Chance-Constrained Optimization in Contact-Rich Systems |
|
Shirai, Yuki | University of California, Los Angeles |
Jha, Devesh | Mitsubishi Electric Research Labs |
Raghunathan, Arvind | Mitsubishi Electric Research Labs |
Romeres, Diego | Mitsubishi Electric Research Laboratories |
Keywords: Robotics, Optimization, Stochastic optimal control
Abstract: This paper presents a chance-constrained formulation for robust trajectory optimization during manipulation. In particular, we present a chance-constrained optimization for Stochastic Discrete-time Linear Complementarity Systems (SDLCS). To solve the optimization problem, we formulate Mixed-Integer Quadratic Programming with Chance Constraints (MIQPCC). In our formulation, we explicitly consider joint chance constraints for complementarity as well as states to capture the stochastic evolution of dynamics. We evaluate robustness of our optimized trajectories in simulation on several systems. The proposed approach outperforms some recent approaches for robust trajectory optimization for SDLCS.
|
|
10:12-10:16, Paper WeA01.4 | Add to My Program |
Safe Human-Robot Collaborative Transportation Via Trust-Driven Role Adaptation |
|
Zheng, Tony | University of California, Berkeley |
Bujarbaruah, Monimoy | UC Berkeley |
Borrelli, Francesco | Unversity of California at Berkeley |
Stürz, Yvonne R. | UC Berkeley |
Keywords: Robotics, Human-in-the-loop control, Cooperative control
Abstract: We study a human-robot collaborative transportation task in presence of obstacles. The task for each agent is to carry a rigid object to a common target position, while safely avoiding obstacles and satisfying the compliance and actuation constraints of the other agent. Human and robot do not share the local view of the environment. The human either assists the robot when they deem the robot actions safe based on their perception of the environment, or actively leads the task. Using estimated human inputs, the robot plans a trajectory for the transported object by solving a constrained finite time optimal control problem. Sensors on the robot measure the inputs applied by the human. The robot then appropriately applies a weighted combination of the human's applied and its own planned inputs, where the weights are chosen based on the robot's trust value on its estimates of the human's inputs. This allows for a dynamic leader-follower role adaptation of the robot throughout the task. Furthermore, under a low value of trust, if the robot approaches any obstacle potentially unknown to the human, it triggers a safe stopping policy, maintaining safety of the system and signaling a required change in the human's intent. The robot also uses the sensor feedback to infer obstacles known only by the human and updates its planner to better align with the human's movements. With experimental results, we demonstrate that our proposed approach increases the success rate of collision-free trials while decreasing the effort required by the human to intervene.
|
|
10:16-10:20, Paper WeA01.5 | Add to My Program |
NAPVIG: Local Generalized Voronoi Approximation for Reactive Navigation in Unknown and Dynamic Environments |
|
Lissandrini, Nicola | University of Padova |
Battistella, Luca | Department of Information Engineering, University of Padova |
Ryll, Markus | Technical University Munich |
Michieletto, Giulia | University of Padova |
Cenedese, Angelo | University of Padova |
Keywords: Robotics, Autonomous robots, Control applications
Abstract: In this paper, we propose a novel online approach for reactive local navigation of a robotic agent, based on a fast approximation of the Generalized Voronoi Diagram in a neighborhood of the robot’s position. We consider the context of an unknown environment characterized by some narrow passages and a dynamic configuration. Given the uncertainty and unpredictability that affect the scenario, we aim at computing trajectories that are farthest away from every obstacle: this is obtained by following the Voronoi diagram. To ensure full autonomy, the navigation task is performed relying only upon on-board sensor measurement without any a-priori knowledge of the environment. The proposed technique builds upon a smooth free space representation that is spatially continuous and based on some raw measurements. In this way, we ensure an efficient computation of a trajectory that is continuously re-planned according to incoming sensor data. A theoretical proof shows that in ideal conditions the outlined solution exactly computes the local Generalized Voronoi Diagram. Finally, we assess the reactiveness and precision of the proposed method with realistic real-time simulations and with real-world experiments.
|
|
10:20-10:24, Paper WeA01.6 | Add to My Program |
Invariant Configuration-Space Bubbles for Revolute Serial-Chain Robots |
|
Danielson, Claus | University of New Mexico |
Keywords: Robotics, Constrained control, Mechatronics
Abstract: This paper adapts the invariant-set motion planner (ISMP) for robot motion planning. We derive control invariant subsets of configuration-space bubbles lifted into the state-space. The resulting sets guarantee collision avoidance since they are both constraint admissible and control invariant. We present a command governor that enforces the positive invariance of the constraints in closed-loop. This governor can be used to transform any nominal tracking controller into a constraint enforcing controller. We use these control invariant sets to quantify a relationship between velocity and control authority that enables collision avoidance. We demonstrate our invariant-sets through an illustrative numerical example.
|
|
10:24-10:28, Paper WeA01.7 | Add to My Program |
Dynamics Learning-Based Fault Isolation for a Soft Trunk Robot |
|
Zhang, Jingting | University of Rhode Island |
Chen, Xiaotian | University of Rhode Island |
Jandaghi, Emadodin | University of Rhode Island |
Zeng, Wei | South China University of Technology |
Zhou, Mingxi | University of Rhode Island |
Yuan, Chengzhi | University of Rhode Island |
Keywords: Robotics, Fault diagnosis, Identification
Abstract: In this paper, we investigate the fault isolation (FI) problem of a soft trunk robot and propose a dynamics learning-based FI approach which is generic and applicable to general types of faults. Specifically, an adaptive radial basis function neural network (RBF NN) based dynamics learning scheme is first developed to achieve accurate identification of the robot’s dominant dynamics under different faulty modes, and the learned knowledge is stored and represented by constant RBF NN models. The learned results are then merged by using a novel merging mechanism to construct a bank of global RBF NN models, for capturing the characteristics of the robot’s dynamics under each specific faulty mode. Based on these models, a bank of FI observers are designed to develop an important capability of accurately reconstructing the robot’s dynamics under various faulty modes. The FI scheme is developed using these FI observers, which monitors the robot’s operation status online to provide accurate isolation of faults occurring in the robot. Physical experiments are performed on the soft trunk robot to validate the effectiveness of our proposed approaches.
|
|
10:28-10:32, Paper WeA01.8 | Add to My Program |
Robot Control for Simultaneous Impact Tasks through Time-Invariant Reference Spreading |
|
van Steen, Jari J. | Eindhoven University of Technology |
Van De Wouw, Nathan | Eindhoven University of Technology |
Saccon, Alessandro | Eindhoven University of Technology |
Keywords: Robotics, Switched systems, Hybrid systems
Abstract: With the goal of enabling the exploitation of impacts in robotic manipulation, a new framework is presented for control of robotic manipulators that are tasked to execute nominally simultaneous impacts. In this framework, we employ tracking of time-invariant reference vector fields corresponding to the ante- and post-impact motion, increasing its applicability over similar conventional tracking control approaches. The ante- and post-impact references are coupled through a rigid impact map, and are extended to overlap around the area where the impact is expected to take place, such that the reference corresponding to the actual contact state of the robot can always be followed. As a sequence of impacts at the different contact points will typically occur, resulting in uncertainty of the contact mode and unreliable velocity measurements, a new interim control mode catered towards time-invariant references is formulated. In this mode, a position feedback signal is derived from the ante-impact velocity reference, which is used to enforce sustained contact in all contact points without using velocity feedback. With an eye towards real implementation, the approach is formulated using a QP control framework, and is validated using numerical simulations both on a rigid robot with a hard inelastic contact model and on a realistic robot model with flexible joints and compliant partially elastic contact model.
|
|
10:32-10:36, Paper WeA01.9 | Add to My Program |
Robust Control Barrier Functions for Safety Using a Hybrid Neuroprosthesis |
|
Lambeth, Krysten | North Carolina State University |
Singh, Mayank | North Carolina State Univeristy |
Sharma, Nitin | North Carolina State University |
Keywords: Robotics, Robust control, Predictive control for nonlinear systems
Abstract: Many lower-limb hybrid neuroprostheses lack powered ankle assistance and thus cannot compensate for functional electrical stimulation-induced muscle fatigue at the ankle joint. The lack of a powered ankle joint poses a safety issue for users with foot drop who cannot volitionally clear the ground during walking. We propose zeroing control barrier functions (ZCBFs) that guarantee safe foot clearance and fatigue mitigation, provided that the trajectory begins within the prescribed safety region. We employ a backstepping-based model predictive controller (MPC) to account for activation dynamics, and we formulate a constraint to ensure the ZCBF is robust to modeling uncertainty and disturbance. Simulations show the superior performance of the proposed robust MPC-ZCBF scheme for achieving foot clearance compared to traditional ZCBFs and Euclidean safety constraints.
|
|
10:36-10:40, Paper WeA01.10 | Add to My Program |
A Subspace Method for Generalized Controller Synthesis: An Artificial Potential Field Motion Planning Approach |
|
Masoud, Ahmad A. | KFUPM |
Keywords: Stability of linear systems, Computer-aided control design, Decentralized control
Abstract: This paper integrates concepts from the frequency domain control methods, state space control and robot motion planning to produce a flexible control synthesis approach that is applicable to a wide class of dynamical systems. The method views a system's frequency response as a trajectory in the complex plane that is evolving under the influence of an artificial force to satisfy a geometric stability-performance criterion. A nonlinear subspace approach is used to translate these forces into an equivalent state space dynamical system that governs the dynamics of the parameters of the compensator used to generate the control signal. The design method places no restrictions on the order of the compensator or geometry of the frequency domain criterion. The approach is developed and a proof of its ability to converge, if a solution exists, to the tuning parameters set that satisfies the desired conditions is provided. The paper provides a set of design examples to demonstrate its applicability to different types of linear, nonlinear, SISO and MIMO systems.
|
|
WeA02 RI Session, Sapphire IJ |
Add to My Program |
Machine Learning (RI) |
|
|
Chair: Lima, Vinicius | University of Pennsylvania |
Co-Chair: Inoue, Masaki | Keio University |
|
10:00-10:04, Paper WeA02.1 | Add to My Program |
Data-Driven Deep Learning Based Feedback Linearization of Systems with Unknown Dynamics |
|
Goswami, Raktim Gautam | New York University |
Krishnamurthy, Prashanth | NYU Tandon School of Engineering |
Khorrami, Farshad | NYU Tandon School of Engineering |
Keywords: Machine learning, Feedback linearization, Uncertain systems
Abstract: A methodology is developed to learn a feedback linearization (i.e., nonlinear change of coordinates and input transformation) using a data-driven approach for a single input control-affine nonlinear system with unknown dynamics. We employ deep neural networks to learn the feedback law (input transformation) in conjunction with an extension of invertible neural networks to learn the nonlinear change of coordinates (state transformation). We also learn the matrices A and B of the transformed linear system and define loss terms to ensure controllability of the pair (A, B). The efficacy of our approach is demonstrated by simulations on a nonlinear system. Furthermore, we show that state feedback controllers designed using the feedback linearized system yield expected closed-loop behavior when applied to the original nonlinear system, further demonstrating validity of the learned feedback linearization.
|
|
10:04-10:08, Paper WeA02.2 | Add to My Program |
Decentralized Multi-Agent Reinforcement Learning for Continuous-Space Stochastic Games |
|
Altabaa, Awni | Yale University |
Yongacoglu, Bora | Queen's University |
Yuksel, Serdar | Queen's University |
Keywords: Machine learning, Game theory, Markov processes
Abstract: Stochastic games are a popular framework for studying multi-agent reinforcement learning (MARL). Recent advances in MARL have focused primarily on games with finitely many states. In this work, we study multi-agent learning in stochastic games with general state spaces and an information structure in which agents do not observe each other's actions. In this context, we propose a decentralized MARL algorithm and we establish the near-optimality of its policy updates. Furthermore, we study the global policy-updating dynamics for a general class of best-reply based algorithms and derive a closed-form characterization of convergence probabilities over the joint policy space.
|
|
10:08-10:12, Paper WeA02.3 | Add to My Program |
Federated Reinforcement Learning for Generalizable Motion Planning |
|
Yuan, Zhenyuan | Pennsylvania State University |
Xu, Siyuan | Pennsylvania State University |
Zhu, Minghui | Pennsylvania State University |
Keywords: Machine learning, Networked control systems, Robotics
Abstract: This paper considers the problem of learning a control policy that generalize well to novel environments given a set of sample environments. We develop a federated learning framework that enables collaborative learning of multiple learners and a centralized server without sharing their raw data. In each iteration, each learner uploads its local control policy and the corresponding estimated normalized arrival time to the server, which then computes the global optimum among the learners and broadcasts the optimal policy to the learners. Each learner then selects between its local control policy and that from the server for next iteration. By leveraging generalization error, our analysis shows that the proposed framework is able to provide generalization guarantees on arrival time and safety as well as consensus at global optimal value in the limiting case. Monte Carlo simulation is conducted to evaluate the proposed framework.
|
|
10:12-10:16, Paper WeA02.4 | Add to My Program |
Robust Nonlinear Set-Point Control with Reinforcement Learning |
|
Zhang, Ruoqi | Uppsala Univeristy |
Mattsson, Per | Uppsala University |
Wigren, Torbjorn | Uppsala University |
Keywords: Machine learning, Direct adaptive control, Uncertain systems
Abstract: There has recently been an increased interest in reinforcement learning for nonlinear control problems. However standard reinforcement learning algorithms can often struggle even on seemingly simple set-point control problems. This paper argues that three ideas can improve reinforcement learning methods even for highly nonlinear set-point control problems: 1) Make use of a prior feedback controller to aid amplitude exploration. 2) Use integrated errors. 3) Train on model ensembles. Together these ideas lead to more efficient training, and a trained set-point controller that is more robust to modelling errors and thus can be directly deployed to real-world nonlinear systems. The claim is supported by experiments with a real-world nonlinear cascaded tank process and a simulated strongly nonlinear pH-control system.
|
|
10:16-10:20, Paper WeA02.5 | Add to My Program |
Online Learning-Based Predictive Control of Switched Nonlinear Systems with Disturbances |
|
Hu, Cheng | National University of Singapore, |
Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Predictive control for nonlinear systems, Chemical process control
Abstract: This work presents a model predictive control (MPC) scheme using online learning of recurrent neural network (RNN) models to approximate the dynamics of switched nonlinear systems subject to unknown but bounded disturbances, for which the mode transitions follow a prescribed switching schedule. A generalization error bound for online learning RNNs using non-independent and identically distributed (non-i.i.d.) data samples from real-time operation of switched nonlinear systems is first derived. Subsequently, a Lyapunov-based MPC scheme using online learning RNNs is developed to stabilize the switched nonlinear system for each mode and guarantee satisfaction of the scheduled mode transitions, followed by closed-loop stability analysis that accounts for the RNN generalization error. Finally, the effectiveness of the proposed MPC scheme is demonstrated using a chemical process example switched between two modes.
|
|
10:20-10:24, Paper WeA02.6 | Add to My Program |
Optimal Control Via Linearizable Deep Learning |
|
Lima, Vinicius | University of Pennsylvania |
Phan, Dzung | IBM T.J. Watson Research Center |
Nguyen, Lam | IBM Research |
Kalagnanam, Jayant R. | IBM T. J. Watson Research Center |
Keywords: Machine learning, Iterative learning control, Predictive control for nonlinear systems
Abstract: Deep learning models are frequently used to capture relations between inputs and outputs and to predict operation costs in dynamical systems. Computing optimal control policies based on the resulting regression models, however, is a challenging task because of the nonlinearity and nonconvexity of deep learning architectures. To address this issue, we propose in this paper a linearizable approach to design optimal control policies based on deep learning models for handling both continuous and discrete action spaces. When using piecewise linear activation functions, one can construct an equivalent representation of recurrent neural networks in terms of a set of mixed-integer linear constraints. That in turn means that the optimal control problem reduces to a mixed-integer linear program (MILP), which can then be solved using off-the-shelf MILP optimization solvers. Numerical experiments on standard reinforcement learning benchmarks attest to the good performance of the proposed approach.
|
|
10:24-10:28, Paper WeA02.7 | Add to My Program |
Transfer Learning-Based Modeling and Predictive Control of Nonlinear Processes |
|
Xiao, Ming | National University of Singapore |
Hu, Cheng | National University of Singapore, |
Wu, Zhe | National University of Singapore |
Keywords: Machine learning, Neural networks, Predictive control for nonlinear systems
Abstract: This work develops a transfer learning (TL) framework for modeling nonlinear dynamic systems using recurrent neural networks (RNNs). The TL-based RNN models are then incorporated into the design of model predictive control (MPC) systems. Specifically, transfer learning uses a pre-trained model developed based on a source domain as the starting point, and adapts the model to a target domain with similar data distribution. The generalization error for TL-based RNNs (TL-RNNs) that depends on model capacity and discrepancy between source and target domains is first derived to demonstrate the generalization capability on target process. Subsequently, the TL-RNN model is utilized as the prediction model in MPC for the target process. Finally, a chemical process example is used to demonstrate the benefits of transfer learning.
|
|
10:28-10:32, Paper WeA02.8 | Add to My Program |
Personalization of Control Systems by Policy Update with Improving User Trust |
|
Nii, Tomotaka | Keio University |
Inoue, Masaki | Keio University |
Keywords: Machine learning, Optimization algorithms, Adaptive control
Abstract: In this paper, we address the design of personalized control systems, which pursue an individual and private objective defined for each user. To this end, a problem of policy update is formulated where an individual objective function is estimated and the corresponding optimal control law is updated. The novelty of the problem setting is in the presence of a system-user and the policy update driven by his/her rating. The system-user rates on the control system to be updated and the rating is used for estimating his/her objective function. It is assumed that the rating depends not only on his/her private objective but also on his/her trust on the control system. Then, we address the problem of the policy update improving the rating while not impairing the trust. Algorithms of the policy update, which is essentially the objective function estimation, are developed and their convergence analysis is presented. Finally, thorough a numerical experiment, the effectiveness of the algorithms is shown.
|
|
10:32-10:36, Paper WeA02.9 | Add to My Program |
Differentiable Safe Controller Design through Control Barrier Functions |
|
Yang, Shuo | University of Pennsylvania |
Chen, Shaoru | University of Pennsylvania |
Preciado, Victor M. | University of Pennsylvania |
Mangharam, Rahul | University of Pennsylvania |
Keywords: Machine learning, Robotics, Autonomous systems
Abstract: Learning-based controllers, such as neural network (NN) controllers, can show high empirical performance but lack formal safety guarantees. To address this issue, control barrier functions (CBFs) have been applied as a safety filter to monitor and modify the outputs of learning-based controllers in order to guarantee the safety of the closed-loop system. However, such modification can be myopic with unpredictable long-term effects. In this work, we propose a safe-by-construction NN controller which employs differentiable CBF-based safety layers and relies on a set-theoretic parameterization. We compare the performance and computational complexity of the proposed controller and an alternative projection-based safe NN controller in learning-based control. Both methods demonstrate improved closed-loop performance over using CBF as a separate safety filter in numerical experiments.
|
|
10:36-10:40, Paper WeA02.10 | Add to My Program |
Dynamic Covariance Prediction Using Variational Wishart Processes with Uncertain Inputs |
|
Uzzaman, Nahid | Oklahoma State University |
Bai, He | Oklahoma State University |
Keywords: Machine learning, Stochastic systems, Numerical algorithms
Abstract: We consider the problem of forecasting dynamic covariance with uncertain inputs. Various Bayesian approaches, including the variational Wishart process (VWP), were previously used to forecast such covariance with deterministic inputs. However, the VWP framework is insufficient to perform reliable predictions when the input is uncertain. To address this issue, we propose two novel VWP approaches that can model and predict covariance with uncertain inputs. Simulation results show that when the input is uncertain, the proposed novel VWP approaches outperform the original VWP and produce more reliable performance. A comparative discussion between all the approaches is presented based on the simulation results.
|
|
WeA03 Invited Session, Sapphire EF |
Add to My Program |
Modeling, Control and Estimation of Soft Material Systems |
|
|
Chair: Vikas, Vishesh | University of Alabama |
Co-Chair: Chen, Zheng | University of Houston |
Organizer: Tan, Xiaobo | Michigan State University |
Organizer: Vikas, Vishesh | University of Alabama |
|
10:00-10:15, Paper WeA03.1 | Add to My Program |
Physics-Based Modeling of Dielectric Elastomer Enabled Cuff Device (I) |
|
Kaaya, Theophilus | University of Houston |
Venkatraman, Rahul | University of Houston |
Chen, Zheng | University of Houston |
Keywords: Flexible structures, Model Validation, Modeling
Abstract: Venous system disorders such as Orthostatic hypotension, deep vein thrombosis (DVT), and edema affect the lower limbs and are common causes of decreased work performance and quality of life. Solutions such as compression devices, rotation of staff, and regular breaks help improve these problems. Some active compression devices require air compression which needs a pump thus making them bulky. A cuff muscle device that uses a dielectric elastomer as a soft actuator and sensor is proposed to supplement the existing means of reducing the effects of these disorders. A physics-based model of the device is developed by combining the physics of a thin-walled dielectric elastomer vessel with the force interactions between the active vessel and the cylindrical passive elastomer within. The couplings between the two nonlinear elastic models are solved to capture the pressure change experienced by the device under an applied voltage. The model is then validated in the normal frequency range of the device. This model may be used to predict the influence of different model parameters on the performance of the device before the fabrication process reducing the need for multiple prototyping.
|
|
10:15-10:30, Paper WeA03.2 | Add to My Program |
Rapid Design and Production of Soft Actuators Using Dynamic Modeling and Additive Manufacturing for Underwater Soft Actuators |
|
Yin, Alexander | University of Rhode Island |
Shomberg, Russell | University of Rhode Island |
Noel, Jason | University of Rhode Island |
Daeffler, Michael | University of Rhode Island |
Licht, Stephen | University of Rhode Island |
Phillips, Brennan | University of Rhode Island |
Keywords: Robotics, Modeling, Manufacturing systems
Abstract: Soft robotic actuators have repeatedly demon- strated their utility for underwater manipulation, particularly in the deep sea with delicate biological creatures and fragile artifacts. Up to this point, soft robotic actuators and gripping modules have been limited to relatively small prototypes that are on the same scale as a human hand. Scaling soft robotic grippers to larger sizes is a non-trivial task due to two major challenges: design and manufacturing. In this work, we present a complete and streamlined workflow of modeling, manufacturing, and testing scalable soft actuators that are directly produced using additive manufacturing methods and finite element modeling (FEA). The presented workflow is an iterative approach that uses information gathered from the FEA’s simulation to further improve the simplified known initial model. To demonstrate this new workflow, a series of soft actuator designs were modeled, created and tested. Additionally, a more complex theoretical actuator design that has a non-uniform bending geometry is created and modeled. Once the actuator design matches what is desired, additive manufacturing is used to physically create it. Using this full process, an actuator design is easily scaled to almost three times its original length and is manufactured in under 36 hours. The scaled up actuators are arranged in a custom full gripping array to grasp a cylinder underwater in a predictive manner.
|
|
10:30-10:45, Paper WeA03.3 | Add to My Program |
Simultaneous Motion and Stiffness Control for Soft Pneumatic Manipulators Based on a Lagrangian-Based Dynamic Model (I) |
|
Mei, Yu | Michigan State University |
Fairchild, Preston | Michigan State University |
Srivastava, Vaibhav | Michigan State University |
Cao, Changyong | Case Western Reserve University |
Tan, Xiaobo | Michigan State University |
Keywords: Robotics, Control applications, Modeling
Abstract: A soft continuum manipulator with tunable stiffness can not only take advantage of high compliance for safe adaptation in unknown environments, but also circumvent the drawbacks of instability and low loading capability. The high nonlinearity of soft manipulators and the strong coupling between actuation and stiffness-tuning make their simultaneous control challenging. In this work, a novel approach to simultaneous control of actuation and stiffness-tuning is proposed for soft pneumatic manipulators. With piecewise-constant curvature assumption, a Lagrangian-based dynamic model with realistic approximation is used for control design, where the dynamics of stiffness-tunable mechanism is incorporated. An extended Kalman filter (EKF) is proposed to estimate unmeasurable states including the stiffness and the velocity. An Nonlinear Model Predictive Control (NMPC) framework is developed first in the configuration space, and then extended to the task space, for simultaneous motion and stiffness control under inflation and vacuum pressure constraints. Simulation results are presented to support the efficacy of the proposed approach.
|
|
10:45-11:00, Paper WeA03.4 | Add to My Program |
Controlling the Shape of Soft Robots Using the Koopman Operator (I) |
|
Singh, Ajai | Colorado State University |
Sun, Jiefeng | Yale University |
Zhao, Jianguo | Colorado State University |
Keywords: Robotics, Mechatronics
Abstract: Abstract— In nature, animals with soft body parts demonstrate remarkable control over their shape, such as an elephant trunk wrapping around a tree branch to pick it up. However, most research on robotic manipulators focuses on controlling the end effector, partly because the manipulator’s arm is rigidly articulated. With recent advances in soft robotics research, controlling a soft manipulator into many different shapes will significantly improve the robot’s functionality, such as medical robots morphing their shape to navigate the digestive system and deliver drugs to specific locations. However, controlling the shape of soft robots is challenging due to their highly nonlinear dynamics that are computationally intensive. In this paper, we leverage a physics-informed, data-driven approach using the Koopman operator to realize the shape control of soft robots. We simulate the dynamics of a soft manipulator using a physics-based simulator (PyElastica) to generate the input-output data, which is then used to identify an approximated linear model based on the Koopman operator. We then formulate the shapecontrol problem as a convex optimization problem that is computationally efficient. Our linear model is over 12 times faster than the physics-based model in simulating the manipulator’s motion. Further, we can control a soft manipulator into different shapes using model predictive control. We envision that the proposed method can be effectively used to control the shapes of soft robots to interact with uncertain environments or enable shape-morphing robots to fulfill diverse tasks. This paper is complemented with a video.
|
|
11:00-11:15, Paper WeA03.5 | Add to My Program |
Feedback Control for Inflatable Soft Robotic Finger Touch Detection Based on Static Pressure-Resistance Characteristics (I) |
|
Boivin, Megan | University of California Santa Cruz |
Esch, Conrad | University of California, Santa Cruz |
Wehner, Michael | University of Wisconsin, Madison |
Milutinovic, Dejan | University of California, Santa Cruz |
Keywords: Robotics, Mechanical systems/robotics
Abstract: We present a method for feedback controlling a soft finger actuator and detecting “touch”, i.e., contact with an object in the environment. While this work uses our pneumatic elastomeric actuator with a soft bending sensitive resistive sensor along its dorsal surface, the method is suited for many soft actuator designs with a bending sensor. We use a closed- loop reference tracking controller and detect contact from the magnitude of the reference tracking error. This method uses a single bending sensitive sensor and does not require a force sensor collocated with the point of contact. We demonstrate the method’s ability to track a reference, sense and maintain contact with an external object, switch from a state of unimpeded motion to either a state of light touch or to a state of firm touch, and remain robust against orientation and saturation.
|
|
WeA04 Invited Session, Sapphire AB |
Add to My Program |
Advanced Control of Wind Farms and Wind Turbines: Session I: Wind Farm
Modeling, Estimation and Control |
|
|
Chair: Mulders, Sebastiaan Paul | Delft University of Technology |
Co-Chair: Gayme, Dennice | Johns Hopkins University |
Organizer: Mulders, Sebastiaan Paul | Delft University of Technology |
Organizer: Bay, Christopher | National Renewable Energy Laboratory |
Organizer: Fleming, Paul | National Renewable Energy Laboratory |
Organizer: van Wingerden, Jan-Willem | Delft University of Technology |
|
10:00-10:15, Paper WeA04.1 | Add to My Program |
Differentiable Control for Adaptive Wake Steering (I) |
|
Adcock, Christiane | Stanford |
Iaccarino, Gianluca | Stanford University |
King, Jennifer | National Renewable Energy Laboratory |
Keywords: Learning, Modeling, Fault tolerant systems
Abstract: Wake steering yaws upstream wind turbines to deflect their wakes from downstream turbines, increasing the total power produced by the wind farm. Most wake steering methods generate lookup tables offline which map a set of wind farm conditions, such as wind speed, to yaw offset angles for each turbine in a farm. These tables assume all turbines are operational and can be significantly non-optimal when one or more turbines shutdown--as they often do because of low wind speed, routine maintenance, or emergency maintenance. We present a new wake steering method that adapts to turbine shutdown. Using a hybrid model- and learning-based method, differentiable control, we train a neural network to determine yaw offset angles from conditions including turbine status (active/inactive). Unlike the lookup table approach, differentiable control does not solve an optimization problem for each combination of turbine shutdown in a farm; including learning in the method allows it to generalize. We present results for both standard wake steering (all turbines active) and adaptive wake steering (some turbines active). We find that differentiable control has comparable accuracy as and an order of magnitude faster offline compute time than the lookup table approach. Differentiable control enables adaptive wake steering through computationally efficient training and rapid online evaluation.
|
|
10:15-10:30, Paper WeA04.2 | Add to My Program |
Free-Flow Wind Speed Estimation for a Wind Turbine Affected by Wake (I) |
|
Bezerra Rufino Ferreira Paiva, Eduardo | IFP Energies Nouvelles |
Lepreux, Olivier | IFP Lyon |
Bresch-Pietri, Delphine | MINES ParisTech |
Keywords: Delay systems, Distributed parameter systems
Abstract: We present a method to estimate the time-varying free-flow wind speed on a wind farm based on local wind speed measurements taken by a wind turbine inside the wake zone of a turbine array. Our approach relies on a simple modeling of the speed deficit as a 1-D transport equation [1]. We propose to estimate the free-flow wind speed by integrating the error between the local wind measurement and an estimation of it computed with the free-flow estimate. We provide a bound on the estimation error which we formally prove. Finally, we provide numerical simulations to illustrate the interest and the performance of the proposed method.
|
|
10:30-10:45, Paper WeA04.3 | Add to My Program |
Actor Critic Agents for Wind Farm Control (I) |
|
Bizon Monroc, Claire | INRIA |
Busic, Ana | Inria |
Dubuc, Donatien | IFP Energies Nouvelles |
Zhu, Jiamin | IFPEN |
Keywords: Control applications, Machine learning, Smart grid
Abstract: The power output of a wind farm is influenced by wake effects, a phenomenon in which upstream turbines facing the wind create sub-optimal conditions for turbines located downstream. Yaw misaligning strategies have been shown to increase total production. Yet designing efficient methods of cooperative control to find optimal yaw angles is a challenging task. Classical optimization methods become intractable as the size of the farm grows, do not recover from model inaccuracies and ignore the dynamic propagation of the wind inflow in real conditions. Reinforcement learning methods can provide a model-free alternative, but raise issues of scalability when the control is centralized. Existing decentralized RL methods have been shown to significantly increase power production under dynamic conditions, but relied on tabular methods with state and action space discretization. To accelerate convergence, we employ an actor-critic algorithm with linear function approximation for decentralized cooperative yaw control. We validate our method in dynamic simulators for wind farms with up to 32 turbines, and show empirically that, compared to previous tabular algorithms, our method is faster and scales to larger wind farms.
|
|
10:45-11:00, Paper WeA04.4 | Add to My Program |
Yaw-Augmented Control for Wind Farm Power Tracking (I) |
|
Starke, Genevieve | Johns Hopkins University |
Meneveau, Charles | Johns Hopkins University |
King, Jennifer | National Renewable Energy Laboratory |
Gayme, Dennice | Johns Hopkins University |
Keywords: Energy systems, Predictive control for nonlinear systems
Abstract: This paper presents a wind farm control strategy to use wake steering (yaw control) to augment pitch control for tracking a power reference signal. The outer loop controller employs a recently proposed dynamic yaw model with a time-varying graph structure that accounts for changes in the farm power output due to the propagation of wakes and wake interactions from yawing turbines. The inner pitch control loop uses a PI controller combined with a novel power-sharing arrangement that reduces the needed derate at each turbine. A compensation scheme accounts for the slow timescale effects of the yaw control actions within the faster timescale pitch control. The controller is applied to track two power trajectories (typical of secondary frequency regulation signals) using a large-eddy simulation wind farm plant. The results demonstrate that the additional control authority from yaw provides some added benefit in reducing the required turbine derates needed for wind farms to track transient power increases in the proposed setting. However, the benefit decreases and pitch control alone is sufficient when the turbines are derated beyond a certain level. These findings suggest that augmenting pitch control with yaw may provide financial incentives in terms of allowing wind farms to maximize power supply to the bulk power market while still providing regulation services. Further work is needed to analyze the costs and benefits of the additional control complexity versus bandwidth in augmenting pitch control with wake steering in these applications.
|
|
11:00-11:15, Paper WeA04.5 | Add to My Program |
A New Wind Farm Active Power Control Strategy to Boost Tracking Margins in High-Demand Scenarios (I) |
|
Tamaro, Simone | Technical University of Munich |
Bottasso, Carlo Luigi | Technical University of Munich |
Keywords: Optimal control, Aerospace, PID control
Abstract: This paper presents a new active power control algorithm designed to maximize the power reserve of the individual turbines in a farm, in order to improve the tracking accuracy of a power reference signal. The control architecture is based on an open-loop optimal set-point scheduler combined with a feedback corrector, which actively regulate power by both wake steering and induction control. The control architecture is compared with a state-of-the-art PI-based controller by means of high-fidelity LES simulations. The new wind farm controller reduces the occurrence of local saturation events, thereby improving the overall tracking accuracy, and limits fatigue loading in conditions of relatively high-power demand.
|
|
WeA05 Regular Session, Sapphire 411A |
Add to My Program |
Optimization Algorithms I |
|
|
Chair: Kia, Solmaz S. | University of California Irvine (UCI) |
Co-Chair: Boroun, Morteza | University of Arizona |
|
10:00-10:15, Paper WeA05.1 | Add to My Program |
Distributed Optimal Resource Allocation Using Transformed Primal-Dual Method |
|
Kia, Solmaz S. | University of California Irvine (UCI) |
Wei, Jingrong | University of California, Irvine |
Chen, Long | University of California at Irvine |
Keywords: Optimization, Optimization algorithms, Distributed control
Abstract: We consider an in-network optimal resource allocation problem in which a group of agents interacting over a connected graph want to meet a demand while minimizing their collective cost. The contribution of this paper is to design a distributed continuous-time algorithm for this problem inspired by a recently developed first-order transformed primal-dual method. The solution applies to cluster-based setting where each agent may have a set of subagents, and its local cost is the sum of the cost of these subagents. The proposed algorithm guarantees an exponential convergence for strongly convex costs and asymptotic convergence for convex costs. Exponential convergence when the local cost functions are strongly convex is achieved even when the local gradients are only locally Lipschitz. For convex local cost functions, our algorithm guarantees asymptotic convergence to a point in the minimizer set. Through numerical examples, we show that our proposed algorithm delivers a faster convergence compared to existing distributed resource allocation algorithms.
|
|
10:15-10:30, Paper WeA05.2 | Add to My Program |
Accelerated Primal-Dual Scheme for a Class of Stochastic Nonconvex-Concave Saddle Point Problems |
|
Boroun, Morteza | University of Arizona |
Alizadeh, Zeinab | University of Arizona |
Jalilzadeh, Afrooz | University of Arizona |
Keywords: Optimization algorithms, Machine learning, Stochastic systems
Abstract: Stochastic nonconvex-concave min-max saddle point problems appear in many machine learning and control problems including distributionally robust optimization, generative adversarial networks, and adversarial learning. In this paper, we consider a class of nonconvex saddle point problems where the objective function satisfies the Polyak-Łojasiewicz condition with respect to the minimization variable and it is concave with respect to the maximization variable. The existing methods for solving nonconvex-concave saddle point problems often suffer from slow convergence and/or contain multiple loops. Our main contribution lies in proposing a novel single-loop accelerated primal-dual algorithm with new convergence rate results appearing for the first time in the literature, to the best of our knowledge.
|
|
10:30-10:45, Paper WeA05.3 | Add to My Program |
On the Global Exponential Stability of Primal-Dual Dynamics for Convex Problems with Linear Equality Constraints |
|
Ozaslan, Ibrahim Kurban | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization algorithms, Stability of nonlinear systems, Lyapunov methods
Abstract: We examine global exponential stability of the primal-dual gradient flow dynamics for differentiable convex problems with linear equality constraints. We show that if the set of equilibrium points is affine, then, regardless of the initial conditions, trajectories of the gradient flow move in the direction that is perpendicular to equilibrium set. When the objective function is strongly convex, we utilize this structure to show that the primal-dual dynamics are globally exponentially stable even if the constraint matrix is not full-row rank. We also provide an explicit characterization of the exponential convergence rate in terms of the smallest nonzero singular value of the constraint matrix.
|
|
10:45-11:00, Paper WeA05.4 | Add to My Program |
Accelerated Algorithms for a Class of Optimization Problems with Equality and Box Constraints |
|
Parashar, Anjali | Massachusetts Institute of Technology |
Srivastava, Priyank | Massachusetts Institute of Technology |
Annaswamy, Anuradha M. | Massachusetts Inst. of Tech |
Keywords: Optimization, Adaptive control, Optimization algorithms
Abstract: Convex optimization with equality and inequality constraints is a ubiquitous problem in several optimization and control problems in large-scale systems. Recently there has been a lot of interest in establishing accelerated convergence of the loss function. A class of high-order tuners was recently proposed in an effort to lead to accelerated convergence for the case when no constraints are present. In this paper, we propose a new high-order tuner that can accommodate the presence of equality constraints. In order to accommodate the underlying box constraints, time-varying gains are introduced in the high-order tuner which leverage convexity and ensure anytime feasibility of the constraints. Numerical examples are provided to support the theoretical derivations.
|
|
11:00-11:15, Paper WeA05.5 | Add to My Program |
An Event-Triggered Distributed Nonsmooth Resource Allocation Algorithm for Second-Order Multi-Agent Systems |
|
Shi, Xiasheng | China University of Mining and Technology |
Ding, Lifu | Zhejiang University |
Lin, Zhiyun | Southern University of Science and Technology |
Keywords: Optimization algorithms, Distributed control, Adaptive control
Abstract: This brief aims to solve the distributed resource allocation problem for second-order multi-agent systems over an undirected network, where the global objective function is strongly convex but not necessarily Lipschitz continuous for its subgradient. The resource state is subject to a global equality constraint and several local inequality constraints with a convex function. Unlike existing tracking deviation control strategies, a dynamic event-triggered and initialization-free distributed resource allocation algorithm is proposed to reduce the communication burden among agents. The local constraints are solved by an adaptive control approach based on the Karush-Kuhn-Tucker condition. It is shown that the proposed algorithm asymptotically converges to the optimal solution by using the set-valued LaSalle’s invariance principle. Moreover, it is guaranteed that Zeno behavior is ruled out for any agent. Finally, a simulation example shows the effectiveness of the proposed algorithm.
|
|
11:15-11:30, Paper WeA05.6 | Add to My Program |
Zero-Gradient-Sum Algorithm-Based Distributed Optimization in Finite Time for Agents on Signed Networks |
|
Yang, Ying | Northeastern University, China |
Ma, Dan | Northeastern University |
Zhang, Yingwei | Northeastern University |
Keywords: Agents-based systems, Distributed control
Abstract: Based on zero-gradient-sum (ZGS) algorithm, this paper studies the finite-time distributed optimization problem for multi-agent systems (MASs) over signed networks. First, a finite-time distributed control protocol is designed to ensure all agent states to realize bipartite consensus in a finite time under undirected signed graphs. Meanwhile, the optimized solutions of minimizing global convex objective functions under undirected signed graphs also converge to the bipartite consensus values. Moreover, the finite-time upper bound is given, which depends on the initial states and also the choice of the parameters in the proposed protocols. Then, extend the results to the case of signed digraphs. Finally, simulation results show the validity of the proposed methods.
|
|
WeA06 Regular Session, Sapphire 411B |
Add to My Program |
Nonlinear Systems |
|
|
Chair: Ma, Tong | Northeastern University |
Co-Chair: Russo, Giovanni | University of Salerno |
|
10:00-10:15, Paper WeA06.1 | Add to My Program |
High-Gain Output Feedback Control Design for a Class of Uncertain Nonlinear Systems Using Gaussian Processes |
|
Ma, Tong | Northeastern University |
Keywords: Nonlinear output feedback, Observers for nonlinear systems, Learning
Abstract: This paper considers the tracking control problem of a class of uncertain nonlinear systems with partial noisy measurements. To cope with the uncertainties including the unknown nonlinear dynamics and unmeasured state variables, simultaneous estimation of all the hidden states and unknown dynamics are required for the controller design. In this paper, a high-gain observer delivers a good property of disturbance rejection and provides state estimates which serves as the training data for learning the unknown dynamics. Because the measurements are corrupted by noise, this leads to estimation error in the state estimates. Since the Gaussian process (GP) has high flexibility to capture the complex unknown functions by using very few parameters and it inherently handles measurement noise, GP model is employed to learn the unknown dynamics from the state estimates. This provides critical information for the control design such that the unknown dynamics is compensated and a good tracking performance is delivered. In short, the high-gain observer provides state estimates for the GP model to learn the unknown dynamics from noisy measurements, which enables the development of the controller. Comparisons against L1 adaptive control and high-gain output feedback controller without GP learning are carried out.
|
|
10:15-10:30, Paper WeA06.2 | Add to My Program |
On the Design of Multiplex Control to Reject Disturbances in Nonlinear Network Systems Affected by Heterogeneous Delays |
|
Xie, Shihao | University College Dublin |
Russo, Giovanni | University of Salerno |
Keywords: Control of networks, Distributed control, Stability of nonlinear systems
Abstract: We consider the problem of designing control pro- tocols for nonlinear network systems affected by heterogeneous, time-varying delays and disturbances. For these networks, the goal is to reject polynomial disturbances affecting the agents and to guarantee the fulfilment of some desired network behaviour. To satisfy these requirements, we propose an integral control design implemented via a multiplex architecture. We give sufficient conditions for the desired disturbance rejection and stability properties by leveraging tools from contraction theory. We illustrate the effectiveness of the results via a numerical example that involves the control of a multi-terminal high-voltage DC grid.
|
|
10:30-10:45, Paper WeA06.3 | Add to My Program |
Time Delay Based Neural Network Control for Systems with State-Dependent Nonlinearity |
|
Patkar, Abhishek | Massachusetts Institute of Technology |
Meng, Qinghui | Weichai Power Co., Ltd |
Wang, Hanrui | Weichai Power |
Youcef-Toumi, Kamal | Massachusetts Inst. of Tech |
Keywords: Neural networks, Stability of nonlinear systems, Electrical machine control
Abstract: This paper tackles the problem of control for a class of nonlinear systems with state-dependent nonlinearities. A new control algorithm combining Time Delay Control and Neural Networks is proposed for such systems. Assuming all state variables are available, the proposed control algorithm is shown to learn the nonlinearity online, provide closed loop stability and achieve tracking performance better than that of time delay control. The performance of the proposed control algorithm is evaluated and compared to that of Time Delay Control and traditional PI control through simulation studies of an Interior type Permanent Magnet Synchronous Motor (IPMSM).
|
|
10:45-11:00, Paper WeA06.4 | Add to My Program |
Small Gain Theorem and L2 Gain Computation in Large Using Koopman Spectrum |
|
Sutavani, Sarang | Clemson University |
Umathe, Bhagyashree | Clemson University |
Vaidya, Umesh | Clemson University |
Keywords: Nonlinear output feedback, Optimization, LMIs
Abstract: The paper is about L2-gain computation and the small-gain theorem for nonlinear input-output systems. We show that the Koopman operator's spectrum can provide conditions for L2-gain guarantees and small-gain theorem-based stability of interconnection over a large region of the state space. The large region in the state space can be characterized in terms of the region where Koopman eigenfunctions and the solution of the Hamilton Jacobi equation are well defined. The connection of system L2-gain to the spectral properties of the Koopman operator has led to a novel approach, based on the approximation of the Koopman spectrum, for the computation of the L2-gain and stability verification of the interconnected system. We present simulation results including application of the developed framework to a power system example.
|
|
11:00-11:15, Paper WeA06.5 | Add to My Program |
Discrete-Time Transverse Feedback Linearization |
|
D'Souza, Rollen S. | Unaffiliated |
Keywords: Feedback linearization, Algebraic/geometric methods, Stability of nonlinear systems
Abstract: Applications of transverse feedback linearization (TFL) vary from path following mobile robots to vehicle formation control. These applications were, however, restricted to systems adequately modelled in continuous-time. Recent work demonstrated that the established technique fails when applied to a discrete-time system using a zero-order hold. An additional change of coordinates dependent on the sampling period that preserves the required properties was proposed as an alternative. This technique, however, only applies to sampled-data systems. This article instead proposes a direct design approach that starts with a discrete-time system and designs a discrete-time transverse feedback linearizing controller. The discrete-time transverse feedback linearization problem is posed, and resolved for a single-input nonlinear discrete-time system. An example of path following for a forward-Euler discretized, kinematic unicycle model is presented to demonstrate its effectiveness.
|
|
11:15-11:30, Paper WeA06.6 | Add to My Program |
Chance-Constrained State Feedback Control Law Design for Nonlinear Systems under Uncertainties |
|
Yang, Yu | California State University Long Beach |
Dam, Nguyen Cam Thuy | California State University Long Beach |
Keywords: Chemical process control, Randomized algorithms, Predictive control for nonlinear systems
Abstract: A chance-constrained full-state feedback control law is designed to regulate nonlinear systems under uncertainties. The proposed scheme utilizes Monte Carlo sampling to generate multiple scenarios, formulates the optimal control problem as a scenario-based nonlinear optimization, and develops a sequential algorithm to obtain probabilistic feasible solutions. The resulting controller offers three advantages: First, the optimization-based design minimizes the tracking error across considered scenarios. Second, the sampling complexity is determined adaptively and the chance of constraint violation is bounded with a guaranteed confidence interval. Third, the sequential algorithm can reach a probabilistic feasible solution faster than directly using a state-of-the-art solver for the full-scenario optimization problem. Two case studies, including a CSTR and fermentor, are presented to demonstrate the effectiveness of the proposed method.
|
|
WeA07 Regular Session, Aqua 303 |
Add to My Program |
Identification |
|
|
Chair: Chakravorty, Suman | Texas A&M University |
Co-Chair: Rivera, Daniel E. | Arizona State Univ |
|
10:00-10:15, Paper WeA07.1 | Add to My Program |
Temporal Forward-Backward Consistency, Not Residual Error, Measures the Prediction Accuracy of Extended Dynamic Mode Decomposition |
|
Haseli, Masih | University of California, San Diego |
Cortes, Jorge | University of California, San Diego |
Keywords: Identification, Computational methods, Subspace methods
Abstract: Extended Dynamic Mode Decomposition (EDMD) is a popular data-driven method to approximate the action of the Koopman operator on a linear function space spanned by a dictionary of functions. The accuracy of EDMD model critically depends on the quality of the particular dictionary span, specifically on how close it is to being invariant under the Koopman operator. Motivated by the observation that the residual error of EDMD, typically used for dictionary learning, does not encode the quality of the function space and is sensitive to the choice of basis, we introduce the novel concept of consistency index. We show that this measure, based on using EDMD forward and backward in time, enjoys a number of desirable qualities that make it suitable for data-driven modeling of dynamical systems: it measures the quality of the function space, it is invariant under the choice of basis, can be computed in closed form from the data, and provides a tight upper-bound for the relative root mean square error of all function predictions on the entire span of the dictionary.
|
|
10:15-10:30, Paper WeA07.2 | Add to My Program |
The T26.4 Method for Step Response Identification of Overdamped 2nd Order Systems |
|
Messner, William | Carnegie Mellon University |
Keywords: Identification, Control education
Abstract: The T26.4 Method is a new approach to identifying the parameters of overdamped or slightly underdamped 2 nd order LTI systems either graphically or by table look-up. The method computes the ratio of the time at which the step response reaches 26.4% of its final value to the time at which it reaches a specific fraction of its final value (such as 60%, 75%, or 90%). This ratio is the input to a table or graph to determine the values of the poles normalized by the 26.4% time. Unlike the Beta T-star Method, the T26.4 method does not require differentiation of the step response, and thus it is well-suited to system identification from noisy or sparse step response data. This paper explains the significance of the 26.4% value for 2 nd order LTI systems, derives the method, and then shows its application to identifying models of a DC motor from experimental data and a slightly underdamped 2 nd order system from simulated data.
|
|
10:30-10:45, Paper WeA07.3 | Add to My Program |
Idiographic Dynamic Modeling for Behavioral Interventions with Mixed Data Partitioning and Discrete Simultaneous Perturbation Stochastic Approximation |
|
Kha, Rachael | Arizona State University |
Rivera, Daniel E. | Arizona State Univ |
Klasnja, Predrag | University of Michigan |
Hekler, Eric | UC San Diego |
Keywords: Identification, Emerging control applications, Optimization
Abstract: This paper presents the use of discrete simultaneous perturbation stochastic approximation (DSPSA) as a routine method to efficiently determine features and parameters of idiographic (i.e. single subject) dynamic models for personalized behavioral interventions using various partitions of estimation and validation data. DSPSA is demonstrated as a valuable method to search over model features and regressor orders of AutoRegressive with eXogenous input estimated models using participant data from Just Walk (a behavioral intervention to promote physical activity in sedentary adults); results of DSPSA are compared to those of exhaustive search. In Just Walk, DSPSA efficiently and quickly estimates models of walking behavior, which can then be used to develop control systems to optimize the impacts of behavioral interventions. The use of DSPSA to evaluate models using various partitions of individual data into estimation and validation data sets also highlights data partitioning as an important feature of idiographic modeling that should be carefully considered.
|
|
10:45-11:00, Paper WeA07.4 | Add to My Program |
Exponential Resetting and Cyclic Resetting Recursive Least Squares |
|
Lai, Brian | University of Michigan, Ann Arbor |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Identification, Estimation, Filtering
Abstract: We present two extensions of recursive least squares (RLS) with exponential forgetting (EF), namely, exponential resetting (ER) RLS and cyclic resetting (CR) RLS. Both methods guarantee that the covariance matrix is bounded above and below in the absence of persistent excitation. Under zero excitation, ER-RLS guarantees convergence of the covariance matrix P_k to a user-designed positive-definite matrix P_infty. However, ER-RLS is more computationally complex than EF-RLS. In contrast, CR-RLS has the same computational complexity as EF-RLS while guaranteeing that, under zero excitation, the difference between the covariance matrix P_k and P_infty is asymptotically bounded. A numerical example shows that ER-RLS and CR-RLS both perform nearly identically to EF-RLS under persistent excitation while protecting against covariance windup when persistent excitation is lost.
|
|
11:00-11:15, Paper WeA07.5 | Add to My Program |
An Identification Approach for Descriptor Systems |
|
He, Jiabao | KTH Royal Institute of Technology |
Zhang, Xuan | Tsinghua-Berkeley Shenzhen Institute |
Xu, Feng | Tsinghua University |
Wang, Xueqian | Tsinghua University |
Keywords: Identification, Identification for control, Linear systems
Abstract: This paper proposes a novel identification approach for linear time-invariant descriptor systems based on input-state-output data. Under the assumption of regularity, the equivalent inverse form of the descriptor system is firstly introduced. Then two groups of data experiments are designed, and corresponding input, state and output data are collected. Furthermore, the parameters of the inverse form are uniquely identified based on those data sets. Since the inverse form possesses the same state, input, output signals and input-output transfer function as the original descriptor system, the identification is completed. Finally, an electrical circuit system is provided to illustrate the effectiveness of our method.
|
|
11:15-11:30, Paper WeA07.6 | Add to My Program |
The Information-State Based Approach to Linear System Identification |
|
Gul Mohamed, Mohamed Naveed | Texas A&M University |
Goyal, Raman | Texas A&M University |
Chakravorty, Suman | Texas A&M University |
Wang, Ran | Texas A&M University |
Keywords: Identification, Identification for control, Time-varying systems
Abstract: This paper considers the problem of system identification for linear systems. We propose a new system realization approach that uses an ``information-state" as the state vector, where the ``information-state" is composed of a finite number of past inputs and outputs. The system identification algorithm uses input-output data to fit an autoregressive moving average model (ARMA) to represent the current output in terms of finite past inputs and outputs. This information-state-based approach allows us to directly realize a state-space model using the estimated ARMA or time-varying ARMA parameters for linear time-invariant (LTI) or linear time-varying (LTV) systems, respectively. The paper develops the theoretical foundation for using ARMA parameters-based system representation using only the concept of linear observability, details the reasoning for exact output modeling using only the finite history, and shows that there is no need to separate the free and the forced response for identification. The proposed approach is tested on various different systems, and the performance is compared with state-of-the-art system identification techniques.
|
|
WeA08 Invited Session, Aqua 305 |
Add to My Program |
Estimation and Control of Infinite Dimensional Systems I |
|
|
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Burns, John A | Virginia Tech |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
10:00-10:15, Paper WeA08.1 | Add to My Program |
Optimization of Non-Pharmaceutical Interventions for a Mutating Virus (I) |
|
Weightman, Ryan | Rutgers University–Camden |
Piccoli, Benedetto | Rutgers University - Camden |
Keywords: Optimal control, Optimization, Computational methods
Abstract: International focus on the COVID-19 pandemic has caused a wealth of new mathematical models for capturing the impact of a virus. As COVID-19 seems to be approaching an endemic status, it is becoming increasingly clear that a new variant has a strong probability of becoming dominant in a short period of time from the first appearance. A model with the goal of representing past data and forecasting will need the flexibility to incorporate a time-evolution of variants. In this paper we explore a method for encompassing mutating viruses: coupling Ordinary Differential Equations (ODE) and Markov chains. In this approach, ODEs are used to represent classical compartmental models and Markov chains to govern the mutation of the virus between predetermined variants. This method considers a discrete variant space allowing for more simple parameter tuning to previously recorded data. A cost function is designed in order to study optimal decision making with respect to non-pharmaceutical interventions such as social distancing. This model will serve to highlight the importance of considering variants in the long-term decision making process.
|
|
10:15-10:30, Paper WeA08.2 | Add to My Program |
Parametric and Non-Parametric Estimation of a Random Diffusion Equation-Based Population Model for Deconvolving Blood/Breath Alcohol Concentration from Transdermal Alcohol Biosensor Data with Uncertainty Quantification (I) |
|
Allayioti, Maria | University of Southern California |
Oszkinat, Clemens | University of Southern California |
Saldich, Emily | University of Southern California |
Goldstein, Larry | University of Southern California |
Luczak, Susan | University of Southern California |
Wang, Chunming | Univ. of Southern California |
Rosen, I. Gary | Univ. of Southern California |
Keywords: Uncertain systems, Estimation, Distributed parameter systems
Abstract: A population model for the transdermal transport of ethanol from the blood to an alcohol biosensor on the surface of the skin in the form of a random abstract parabolic hybrid system of coupled ordinary and partial differential equations is developed. Linear semigroup theory in a Gelfand triple of Bochner spaces is used to first formulate the model as an equivalent deterministic system in state space form and to then develop a finite dimensional approximation and convergence theory. Both parametric and non-parametric techniques, the method of moments and kernel density estimation, and clinically collected drinking data are used to estimate the distributions of the model’s random parameters. Numerical results demonstrate the efficacy of using our model to 1) deconvolve an estimate of blood alcohol concentration from biosensor measured transdermal alcohol level and 2) quantify the uncertainty in the estimate.
|
|
10:30-10:45, Paper WeA08.3 | Add to My Program |
Adaptive Alternatives in the Velocity Control of Mean-Field Kuramoto Models (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems
Abstract: This paper presents an alternative control design for the macroscopic model of Kuramoto oscillators. The resulting partial differential equation describing the density of the collective dynamics is a nonlinear version of the continuity equation in which the control signals are coupled to the state. An optimal control for the bilinear system provides optimality of the controllers but it also results in an open-loop policy due to the backward in time integration of adjoint states. To tackle this, an adaptive alternative is considered and which views the control signals corresponding to a target density, as unknown spatiotemporally varying parameters. An adaptive observer is proposed along with the Lyapunov-redesign of the parameter adaptive laws. The stability of the closed-loop density equation using adaptive estimates of the controllers along with tracking convergence are examined.
|
|
10:45-11:00, Paper WeA08.4 | Add to My Program |
H-Infinity Control of a Heated/Cooled Rod under Point Actuation and Point Sensing (I) |
|
Krener, Arthur J | Naval Postgraduate School |
Keywords: Distributed parameter systems, H-infinity control
Abstract: Our long term goal is to extend the H-Infinity paradigm to nonlinear, infinite dimensional systems under point actuation and point sensing. This paper is the first step in this direction. We find the minimum L2 gain from the noisy heat flux at one end of a rod to the temperature at an arbitrary point of the rod that can be achieved by state feedback on the heat flux at the other end of the rod. We do this by completing the square to obtain a Riccati partial differential equation whose solution yields a state feedback control law that achieves a given L2 gain. We solve the Riccati PDE by Fourier series. We iterate this process to find the minimum L2 gain.
|
|
11:00-11:15, Paper WeA08.5 | Add to My Program |
A Cascading Method for Reducing Asymptotic Errors in Feedback Control of Nonlinear Distributed Parameter Systems (I) |
|
Aulisa, Eugenio | Texas Tech University |
Burns, John A | Virginia Tech |
Gilliam, David S. | Texas Tech University |
Keywords: Output regulation, Nonlinear output feedback, Distributed parameter systems
Abstract: This paper presents an error feedback controller for approximate tracking and disturbance rejection for nonlinear distributed parameter systems. The controller is error feedback because the only information available to the controller is the error given as the difference between the reference signal to be tracked and the measured output of the plant. In particular, the controller cannot directly access the output data. Also, the unknown disturbance corrupting the plant is unavailable to the controller. The controller is ``approximate'' in the sense it only guarantees a small tracking error rather than an asymptotic zero tracking error. However, the asymptotic tracking error can be reduced by solving a sequence of controllers, similar to cascade controllers, where the error at one level becomes the target to track at the next level. At each step, the error is reduced geometrically, so achieving the desired tracking level seldom requires more than one or two iterations. We present a numerical example to demonstrate the utility of the method.
|
|
11:15-11:30, Paper WeA08.6 | Add to My Program |
A Note on the Optimality of Balanced Truncation for a Class of Infinite Dimensional Systems (I) |
|
Djouadi, Seddik, M. | University of Tennessee |
Keywords: Distributed parameter systems, Reduced order modeling
Abstract: Balanced truncation is a popular model reduction method that uses balanced realizations for finite dimensional systems. The latter are state space realizations where the controllability and observability gramians are equal to the same diagonal positive matrix. In this paper, a generalization of balanced realization for a class of infinite dimensional LTI systems is employed to perform balanced truncation. It is shown that balanced truncation is optimal in the Hilbert-Schmidt sense if a particular time-varying balanced realization is used for the original LTI system. It appears that the use of time-varying balanced realizations to study the optimality and perform balanced model reduction for LTI systems is novel.
|
|
WeA09 Regular Session, Aqua 307 |
Add to My Program |
Estimation I |
|
|
Chair: Ernst, Eugen | Karlsruhe Institute of Technology |
Co-Chair: Clouatre, Maison | Texas A&M University |
|
10:00-10:15, Paper WeA09.1 | Add to My Program |
Distributed Target Tracking in Multi-Agent Networks Via Sequential Quadratic Alternating Direction Method of Multipliers |
|
Shorinwa, Ola | Stanford University |
Schwager, Mac | Stanford University |
Keywords: Estimation, Optimization, Sensor networks
Abstract: We present a distributed algorithm for multi-agent target tracking, posed as a maximum a-posteriori (MAP) optimization problem. MAP estimation is, in general, a non-convex optimization that depends on each agent's local observation of the target, necessitating a distributed algorithm. In our algorithm, each agent solves a series of local optimization problems to estimate the target's trajectory, while communicating with its one-hop neighbors over a communication network. The agents do not communicate their raw observations, which may be high dimensional (e.g., images), and they do not rely on a central coordinating node or leader, minimizing the communication bandwidth requirements of our approach. We utilize the sequential quadratic programming (SQP) paradigm, with distributed computation of the ensuing sub-problems achieved via the consensus alternating direction method of multipliers (C-ADMM). We empirically demonstrate faster convergence of our algorithm to a locally optimal solution compared to other distributed methods. In addition, our algorithm achieves about the same communication overhead as the best competing distributed algorithm.
|
|
10:15-10:30, Paper WeA09.2 | Add to My Program |
Improved Action Potential Detection for Imaging Techniques by Exploiting Fuzzy C-Means Clustering |
|
Fauser, Moritz | Rheinland-Pfälzische Technische Universitaet Kaiserslautern-Land |
Zhang, Ping | Rheinland-Pfälzische Technische Universitaet Kaiserslautern-Land |
Wadle, Simon | Technische Universität Kaiserslautern |
Hirtz, Jan | Technische Universität Kaiserslautern |
Keywords: Estimation, Pattern recognition and classification, Biological systems
Abstract: In recent years, observing neuronal activity by inferring action potentials (APs) in mammals has attracted much attention. A common way to observe thousands of neurons simultaneously is by using calcium imaging techniques. However, estimating the APs based on the fluorescence signal obtained by the calcium imaging technique is a challenging task due to noise, slow imaging rates and especially the nonlinearity of the calcium binding kinetics. Though the electrical recording technique can measure the APs very precisely, it is rather time consuming in practice. In this paper, an approach is proposed that reconstruct the APs based on the noisy fluorescence signal. For this purpose, at first the forward-backward filtering is applied on the fluorescence signal to reduce the level of noise and to avoid the nonlinear shift in time with respect to the true fluorescence signal. Then, for each local maximum in the filtered fluorescence signal, three characteristic values, namely, the integral, the amplitude and the gradient are extracted to localize the neuronal activity. By exploiting the fuzzy c-means clustering method, the time instants and the number of APs can be estimated. The proposed approach is validated by using the well-established spikefinder challenger data. The comparison shows that the proposed approach outperforms other existing AP estimation approaches.
|
|
10:30-10:45, Paper WeA09.3 | Add to My Program |
The Kernel-SME Filter with Adaptive Kernel Widths for Association-Free Multi-Target Tracking |
|
Ernst, Eugen | Karlsruhe Institute of Technology |
Pfaff, Florian | Karlsruhe Institute of Technology (KIT) |
Hanebeck, Uwe D. | Karlsruhe Institute of Technology (KIT) |
Baum, Marcus | Karlsruhe Institute of Technologie (KIT) |
Keywords: Filtering, Kalman filtering, Estimation
Abstract: Different objectives and paradigms exist for tracking multiple targets when measurements do not contain information about the target identities (IDs). The Symmetric Measurement Equation (SME) filter can be used when one is agnostic to the labels and does not attempt to assign different IDs to the different targets. We present an extension of the Kernel-SME filter that, unlike the original variant, uses adaptive kernel widths that depend on the respective uncertainty. In our evaluation, it outperformed existing SME-based approaches, while it is only second to a more complex global nearest neighbor tracker.
|
|
10:45-11:00, Paper WeA09.4 | Add to My Program |
Closed-Form Hilbert Projection for Quantum State Observers |
|
Clouatre, Maison | Texas A&M University |
Balas, Mark | Texas A&M University |
Thitsa, Makhin | Mercer University |
Keywords: Quantum information and control, Optimization, Estimation
Abstract: Designing an observer of an unknown quantum density operator is difficult because the operator must be Hermitian positive semidefinite with unit trace. In this paper, we derive a closed-form solution for projecting an arbitrary matrix onto the set of valid density operators. This allows us to design linear quantum state observers and retract the observer's state back to this set while retaining the exponential convergence rate of the linear observer. The derived closed-form projection can be used alongside any quantum state estimation technique to produce a valid state estimate without increasing the state estimation error.
|
|
11:00-11:15, Paper WeA09.5 | Add to My Program |
REACTMIN: Reactive Scanning Based Single Particle Tracking Using a Minimum of Light |
|
Vickers, Nicholas A. | Boston University |
Subedi, Sandip | Boston University |
Andersson, Sean B. | Boston University |
Keywords: Biomolecular systems, Estimation
Abstract: Real-time feedback-driven single particle tracking is an emerging method of measuring the state and dynamics of individual molecules and particles at the nanometer scale as they move inside living cells. Two problems limit current performance: the photon budget of fluorescent labels, and the control temporal budget. REACTMIN overcomes these challenges by reactively scanning a minimum of light to track diffusing particles and molecules. It is comprised of a non-holonomic based extremum seeking controller and structured illumination using a laser with a central minimum. The controller allows for estimationless tracking, addressing the control temporal budget, and the structured illumination enables information rich measurements, addressing the photon budget. We use two metrics of performance, Fisher information and tracking duration, to characterize the system and find optimal settings for one of the dominant controller parameters, the steady state orbital radius. We demonstrate a trade-off between tracking duration and localization precision and discuss how to select this parameter in the context of specific experimental aims.
|
|
11:15-11:30, Paper WeA09.6 | Add to My Program |
Digital Twin Design for hMSC Expansion in Hollow-Fiber Bioreactors |
|
Kanwar, Bharat | Georgia Institute of Technology |
Wang, Bryan | Georgia Institute of Technology |
Roy, Krishnendu | Georgia Institute of Technology |
Mazumdar, Anirban | Georgia Institute of Technology |
Balakirsky, Stephen | Georgia Tech Research Institute |
Keywords: Cellular dynamics, Estimation, Process Control
Abstract: Human Mesenchymal Stromal Cells (hMSC) have shown promising pre-clinical results by eliciting immunomodulatory effects to alleviate inflammation. In order to further study these effects, consistent and automated expansion platforms are required. Recent theoretical innovations have shown that model-based automated controls can more effectively regulate key nutrient concentrations. However, this previous work did not account for time-varying cell growth and death which resulted in inconsistent modeling and controller performance. To mitigate these effects, we propose a new model with time-varying parameters to track viable, proliferating, and dead cells and their respective growth rates with algorithms to estimate these parameters as functions of our limited measured states. We then propose an updated control architecture (referred to as smooth-controller) to leverage the additional parameters for improved estimation and control. The control objective is to regulate glucose and lactate to fixed setpoints while minimizing total media usage and large flowrate disturbances. Finally, we demonstrate the new control architecture in hMSC expansion with improved lactate setpoint MSE (58% reduction), improved observer MSE (36% for glucose and 20% for lactate), and reduced process disturbance (1 to 0 lactate spikes). Although the smooth-controller did not improve cell yield (4.91*10^7 compared to 5.08*10^7), it did reduce media usage to match the reduced growth rate thereby increasing cell yield per mL of fed media (6.3*10^4 to 8.6*10^4).
|
|
WeA10 Regular Session, Aqua 309 |
Add to My Program |
Agents-Based Systems I |
|
|
Chair: Ramirez-Neria, Mario | Universidad Iberoamericana Ciudad De Mexico |
Co-Chair: Malikopoulos, Andreas A. | University of Delaware |
|
10:00-10:15, Paper WeA10.1 | Add to My Program |
On the Emergent Hypocycloidal and Epicycloidal Formations in a Swarm of Double Integrator Agents |
|
Fedele, Giuseppe | University of Calabria |
D'Alfonso, Luigi | Università Della Calabria |
Keywords: Agents-based systems, Autonomous systems, Cooperative control
Abstract: In this work, a multi-agent system with emergent hypocycloidal and epicycloidal behaviors is analyzed. Agents are assumed to be modeled as double integrators with an acceleration law designed so that the swarm exhibits cycloidal like trajectories. The resulting control protocol benefits from speeds and positions coupling among neighbors agents to let each agent perform hypocycloidal or epicycloidal curves according to the choice of the controller parameters and the agents initial conditions. As a first step, the model properties are obtained assuming that the communication topology of the multi-agent system is defined by a complete graph. As a further step, the model main features are extended by relaxing the complete connection assumption and replacing it with a more realistic one in which a connected graph is considered. This latter step is achieved by means of a consensus-based estimation of the swarm centroid, so that to virtualize the complete graph starting from a connected one. Simulations have been performed to show the properties of the obtained agents evolution.
|
|
10:15-10:30, Paper WeA10.2 | Add to My Program |
Practical Time-Varying Formation Tracking Control for Multi-Agent Systems |
|
Thakur, Ankush | IIT Mandi |
Jain, Tushar | Indian Institute of Technology Mandi |
Keywords: Agents-based systems, Autonomous systems, Cooperative control
Abstract: This paper introduces the concept of formation switching in time-varying formation tracking (TVFT) control for linear time-invariant (LTI) multi-agent systems (MASs) under the collision avoidance constraints. One of the main challenges in designing a novel formation controller is that the leader agent does not prespecify the desired formation to the follower agents. Also, the former may change the desired formation to any other at the run time according to task requirements and environmental spatial constraints. Based on estimation of the so-called generator matrix and the formation vector, follower agents maintain the time-varying formation while tracking the leader. The exponential stability of the overall proposed distributed control scheme is proved using the Lyapunov stability theory. Finally, the effectiveness of the formation switching in TVFT is demonstrated using a numerical example.
|
|
10:30-10:45, Paper WeA10.3 | Add to My Program |
Resilient Consensus Based on Evidence Theory and Weight Correction |
|
Bonagura, Valeria | Roma Tre University |
Fioravanti, Camilla | University Campus Bio-Medico of Rome |
Oliva, Gabriele | University Campus Bio-Medico of Rome |
Panzieri, Stefano | Univ. "Roma Tre" |
Keywords: Agents-based systems, Autonomous systems, Networked control systems
Abstract: In recent years, the security and resilience of distributed algorithms have become a feature of the utmost importance, especially for applications dealing with sensitive data or critical infrastructures. In this paper, we develop a robust weighted distributed consensus algorithm based on agents’ reputations. By resorting to Evidence Theory, our algorithm is able to evaluate the reputation of each communication link in the graph and to update it over time, following the evolution of each node’s behavior. Moreover, our approach is able to detect the presence of malicious or faulty nodes that vary their propensity to adhere to the correct consensus strategy over time. Finally, the reputation evaluation process is reinforced at each step by a novel weight correction algorithm, which improves the efficacy of recognizing corrupted nodes and is able to reduce their influence on past history. A simulation campaign completes the paper and demonstrates its effectiveness experimentally
|
|
10:45-11:00, Paper WeA10.4 | Add to My Program |
Constraint-Driven Optimal Control for Emergent Swarming and Predator Avoidance |
|
Beaver, Logan E. | Boston University |
Malikopoulos, Andreas A. | University of Delaware |
Keywords: Agents-based systems, Biologically-inspired methods, Decentralized control
Abstract: In this article, we present a constraint-driven optimal control framework that achieves emergent cluster flocking within a constrained 2D environment. We formulate a decentralized optimal control problem that includes safety, flocking, and predator avoidance constraints. We explicitly derive conditions for constraint compatibility and propose an event-driven constraint relaxation scheme. We map this to an equivalent switching system that intuitively describes the behavior of each agent in the system. Instead of minimizing control effort, as it is common in the ecologically-inspired robotics literature, in our approach, we minimize each agent's deviation from their most efficient locomotion speed. Finally, we demonstrate our approach in simulation both with and without the presence of a predator.
|
|
11:00-11:15, Paper WeA10.5 | Add to My Program |
Leader-Follower ADRC Strategy for Omnidirectional Mobile Robots without Time-Derivatives in the Tracking Controller |
|
Ramirez-Neria, Mario | Universidad Iberoamericana Ciudad De Mexico |
Luviano-Juarez, Alberto | UPIITA - IPN Mexico |
Madonski, Rafal | Jinan University |
Ramirez-Juarez, Rodrigo | Facultad De Estudios Superiores Cuautitlan Campo 4-Universidad N |
Lozada-Castillo, Norma Beatriz | ESIME-Zacatenco |
Gao, Zhiqiang | Cleveland State Univ |
Keywords: Agents-based systems, Control applications, Mechatronics
Abstract: In this article, the problem of designing an active disturbance rejection control (ADRC) strategy for omnidirectional mobile robots in leader-follower formation is addressed. The proposed solution utilizes the partially available robot dynamical model and only uses robot position as measurable information. A special extended state observer in error-based form is constructed allowing the controller to be designed without an explicit use of signal time-derivatives, which increases the practical appeal of the introduced control scheme. Experimental results show the effectiveness of the approach in terms of trajectory tracking and disturbance rejection.
|
|
11:15-11:30, Paper WeA10.6 | Add to My Program |
Consensus of Double Integrators in Presence of a Reverse Edge in a Chain: Analysis and Design |
|
Dubey, Avinash Kr | Indian Institute of Technology Bombay |
Mukherjee, Dwaipayan | Indian Institute of Technology Bombay |
Arya, Kavi | Indian Institute of Technology Bombay |
Keywords: Agents-based systems, Cooperative control, Autonomous systems
Abstract: Consensus among double integrators, over a directed network in presence of a cycle, is not guaranteed even if the graph has a spanning tree. Necessary and sufficient condition for consensus in such a set-up shows that a positive lower bound exists on the `velocity coupling factor' when a cycle is present, as opposed to an acyclic graph where any positive coupling factor suffices. This positive lower bound, on account of a reverse edge addition, may require every agent in the network, to increase the coupling factor. Motivated by this, the present paper proposes several approaches to guarantee consensus for chain networks with a single reverse edge (resulting in a directed cycle). Moreover, the proposed strategies can be implemented locally since only the nodes or edge-weights involved in the cycle or adjacent to it need to effect changes to guarantee consensus while the rest of the agents remain unaffected.
|
|
WeA11 Regular Session, Aqua Salon AB |
Add to My Program |
Game Theory I |
|
|
Chair: Marden, Jason R. | University of California, Santa Barbara |
Co-Chair: Liu, Hugh Hong-Tao | Univ. of Toronto |
|
10:00-10:15, Paper WeA11.1 | Add to My Program |
Defending a Target Area with a Slower Defender |
|
Fu, Han | University of Toronto |
Liu, Hugh Hong-Tao | Univ. of Toronto |
Keywords: Game theory, Aerospace, Robust control
Abstract: The target defense game is an abstraction of the counter-UAV mission, where a defender intends to intercept an invading drone before it enters a target area. While most studies on target defense games assume the defender travels faster, defending a target area with a slower defender is a less studied yet challenging problem because capture cannot be guaranteed. This paper identifies two special cases where the defender has a chance to win, where the game region is bounded and where the target area is small. In the former case, the defender traps the invader at the corner. In the latter case, the defender delays the entering permanently by rotating around the target area at a sufficiently large angular speed. In both games, the optimal trajectory has a two-stage structure. Exploiting this feature, a novel method is proposed to solve for the barrier, which gives guidelines on how to deploy the defenders to ensure the target area been protected.
|
|
10:15-10:30, Paper WeA11.2 | Add to My Program |
Tuning Rate of Strategy Revision in Population Games |
|
Park, Shinkyu | KAUST |
Keywords: Game theory, Agents-based systems, Distributed control
Abstract: We investigate a multi-agent decision problem in population games where each agent in a population makes a decision on strategy selection and revision to engage in repeated games with others. The strategy revision is subject to time delays which represent the time it takes for an agent revising its strategy needs to spend before it can adopt a new strategy and return back to the game. We discuss the effect of the time delays on long-term behavior of the agents’ strategy revision. In particular, when the time delays are large, the strategy revision would exhibit oscillation and the agents spend substantial time in “transitioning” between different strategies, which prevents the agents from attaining the Nash equilibrium of the game. As a main contribution of the paper, we propose an algorithm that tunes the rate of the agents’ strategy revision and show such tuning approach ensures convergence to the Nash equilibrium. We validate our analytical results using simulations.
|
|
10:30-10:45, Paper WeA11.3 | Add to My Program |
Differentially Private Games Via Payoff Perturbation |
|
Chen, Yijun | University of Sydney |
Shi, Guodong | The University of Sydney |
Keywords: Game theory, Agents-based systems, Networked control systems
Abstract: In this paper, we study network games where players are involved in information aggregation processes subject to the differential privacy requirement for players’ payoff functions. We propose a Laplace linear-quadratic functional perturbation (LLQFP) mechanism, which perturbs players' payoff functions with linear-quadratic functions whose coefficients are produced from truncated Laplace distributions. For monotone games, we show that the LLQFP mechanism maintains the concavity property of the perturbed payoff functions, and produces a perturbed NE whose distance from the original NE is bounded and adjustable by Laplace parameter tuning. We focus on linear-quadratic games, which is a fundamental type of network games with players' payoffs being linear-quadratic functions, and derive explicit conditions on how the LLQFP mechanism ensures differential privacy with a given privacy budget. Lastly, numerical examples are provided for the verification of the advantages of the LLQFP mechanism.
|
|
10:45-11:00, Paper WeA11.4 | Add to My Program |
On the Adversarial Convex Body Chasing Problem |
|
Guan, Yue | Georgia Institute of Technology |
Pan, Longxu | Georgia Institute of Technology |
Shishika, Daigo | George Mason University |
Tsiotras, Panagiotis | Georgia Institute of Technology |
Keywords: Game theory, Agents-based systems, Optimization
Abstract: In this work, we extend the convex body chasing problem to an adversarial setting, where an agent (the Player) is tasked to chase a sequence of convex bodies generated adversarially by an opponent. The Player aims to minimize the cost associated with its total movements, while the Opponent tries to maximize. The set of feasible convex bodies is finite and known to both agents, which allows us to provide performance guarantees with max-min optimality rather than via the competitive ratio. Under some mild assumptions, we show the continuity of the optimal value function and provide an algorithm to numerically generate the optimal policies with performance guarantees. Finally, the theoretical results are verified through numerical examples.
|
|
11:00-11:15, Paper WeA11.5 | Add to My Program |
Analyzing Pre-Commitment Strategies in General Lotto Games |
|
Paarporn, Keith | University of Colorado, Colorado Springs |
Chandan, Rahul | Amazon.com |
Kovenock, Daniel | Chapman University |
Alizadeh, Mahnoosh | University of California Santa Barbara |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Agents-based systems
Abstract: A salient feature of many optimal decision-making policies in adversarial environments is a level of unpredictability, or randomness, which keeps opponents uncertain about the system's strategies. These considerations, along with feedback from adversarial behaviors, are crucial in ensuring the security of modern infrastructures and complex systems. This paper considers policies that do just the opposite, namely ones that reveal strategic intentions to an opponent before engaging in competition. We consider such scenarios in the context of General Lotto games, which models the competitive allocation of resources between opposing players. Here, we consider a dynamic extension where one of the players has the option to publicly pre-commit assets to a battlefield in the first stage. In response, the opponent decides whether to secure the battlefield by matching the pre-commitment with its own resources, or to withdraw from it entirely. They then engage over the remaining set of battlefields in the second stage. We show that the weaker-resource player can have incentives to pre-commit when the battlefield values are asymmetric across players. Previous work asserts this never holds when the values are symmetric across players. Our analysis demonstrates the viability of alternate strategic mechanisms that a competitor may be able to employ.
|
|
11:15-11:30, Paper WeA11.6 | Add to My Program |
Path Defense in Dynamic Defender-Attacker Blotto Games (dDAB) with Limited Information |
|
Chen, Austin Ku | University of Pennsylvania |
Ferguson, Bryce L. | University of California, Santa Barbara |
Shishika, Daigo | George Mason University |
Dorothy, Michael | Combat Capabilities Development Command Army Research Laboratory |
Marden, Jason R. | University of California, Santa Barbara |
Pappas, George J. | University of Pennsylvania |
Kumar, Vijay | University of Pennsylvania |
Keywords: Game theory, Autonomous robots
Abstract: We consider a path guarding problem in dynamic Defender-Attacker Blotto games (dDAB), where a team of robots must defend a path in a graph against adversarial agents. Multi-robot systems are particularly well suited to this application, as recent work has shown the effectiveness of these systems in related areas such as perimeter defense and surveillance. When designing a defender policy that guarantees the defense of a path, information about the adversary and the environment can be helpful and may reduce the number of resources required by the defender to achieve a sufficient level of security. In this work, we characterize the necessary and sufficient number of assets needed to guarantee the defense of a shortest path between two nodes in dDAB games when the defender can only detect assets within k-hops of a shortest path. By characterizing the relationship between sensing horizon and required resources, we show that increasing the sensing capability of the defender greatly reduces the number of defender assets needed to defend the path.
|
|
WeA12 Invited Session, Aqua Salon C |
Add to My Program |
Advanced Controls in Automotive Systems |
|
|
Chair: Chen, Pingen | Tennessee Technological University |
Co-Chair: Wang, Zejiang | Oak Ridge National Laboratory |
Organizer: Wang, Zejiang | Oak Ridge National Laboratory |
Organizer: Ozkan, Mehmet | Texas Tech University |
Organizer: Chen, Pingen | Tennessee Technological University |
|
10:00-10:15, Paper WeA12.1 | Add to My Program |
A Novel and Elliptical Lattice Design of Flocking Control for Multi-Agent Ground Vehicles (I) |
|
Wang, Gang | Arizona State University |
Liu, Mingzhe | Arizona State University |
Wang, Fengchen | The MathWorks, Inc |
Chen, Yan | Arizona State University |
Keywords: Autonomous vehicles, Agents-based systems, Automotive control
Abstract: Flocking control of multi-agent ground vehicles recently attracted rising attention because of its strength in extending 1D platooning to coordinated 2D movements. However, the uniform interaction ranges and the non-defined orientation of the flocking lattice make flocking control of ground vehicles face some key issues. To achieve cooperative motions of connected and automated vehicles (CAVs), this letter proposed a novel and elliptical lattice to extend the existing flocking theory with a uniform hexagon lattice. The proposed elliptical lattice is designed based on the characteristics of the vehicle heading direction, velocity, minimum safety distance, and lane width to analytically adapt to vehicle driving environments. Moreover, a new flocking control law considering road boundaries’ (permanent) repulsive forces is developed to ensure the desired formation at a steady state. Simulation results show that the proposed elliptical lattice of flocking control can be applied to realize cooperative driving of multi-agent CAVs with the desired formation on the road.
|
|
10:15-10:30, Paper WeA12.2 | Add to My Program |
Paceijka-Like Curve-Based Speed Reference Generators for Electric Vehicles Powered by In-Wheel Motors (I) |
|
Verrelli, Cristiano Maria | Tor Vergata University |
El Arayshi, Mohamed | University of Rome Tor Vergata, Electronic Engineering Departmen |
Keywords: Automotive control, Control applications
Abstract: Recently, motor speed reference generators have been designed for the cruise control of electric vehicles with either centralized electric motors (straight manoeuvres) or in-wheel motors (bend manoeuvres with sufficiently small constant steering angles). Steady-state operation at a safe (though conservative) tire longitudinal slip can be achieved, with no a priori knowledge regarding the occurrence of a specific external condition. Here, an innovative solution is presented. It is based on an ingenious use of Pacejka-like curves - representing the torque current-slip characteristics of the vehicle - to design a new contraction-mapping-based automatic tuning procedure for the longitudinal velocity of the vehicle. Such an innovative procedure overcomes the highly conservative nature of the previous approach in terms of unduly small longitudinal velocities (and yaw rates) under relatively favourable external conditions while guaranteeing a safe operating condition close to the maximum of the torque-slip characteristics with no knowledge of the tire-road adhesion coefficient. Comparative CarSim simulations illustrate the effectiveness of the proposed approach, in the presence of uncertain parameters and complex vehicle dynamics that are neglected at the control design stage.
|
|
10:30-10:45, Paper WeA12.3 | Add to My Program |
Online Parameter Estimation Using Physics-Informed Deep Learning for Vehicle Stability Algorithms (I) |
|
Koysuren, Muhammed Kemal | Bilkent University |
Keles, Ahmet Faruk | Bilkent University |
Cakmakci, Melih | Bilkent University |
Keywords: Identification for control, Control applications, Automotive control
Abstract: Physics-informed deep learning is a popular trend in the modeling and control of dynamical systems. This paper presents a novel method for rapid online identification of vehicle cornering stiffness coefficient, a crucial parameter in vehicle stability control models and control algorithms. The new method enables designers to rapidly identify the vehicle front and rear cornering stiffness parameters so that the controller reference gains can be re-adjusted under varying road and vehicle conditions to improve the reference tracking performance of the control system during operation. The proposed method based on vehicle model-based deep learning is compared to other alternatives such as traditional neural network training and identification, and Pacejka model estimation with regression. Our initial findings show that, in comparison to these classical methods, high fidelity estimations can be done with much smaller data sets simple enough to be obtained from a lane-changing or vehicle overtake maneuver. In order to conduct experiments, and collect sensor data, a custom-built 1:8 scaled test vehicle platform is used real-time wireless networking capabilities. The proposed method is applicable to predict derived vehicle parameters such as the understeering coefficient so it can be used in parallel with conventional MIMO controllers. Our H∞ yaw rate regulation controller test results show that the reference gains updated with the proposed online estimation method improve the tracking performance in both simulations and vehicle experiments.
|
|
10:45-11:00, Paper WeA12.4 | Add to My Program |
Adversarial Learning for Safe Highway Driving Based on Two-Player-Zero-Sum Game (I) |
|
Li, Fangjian | Clemson University |
Zhao, Mengtao | Clemson University |
Wagner, John R. | Clemson University |
Wang, Yue | Clemson University |
Keywords: Autonomous vehicles, Machine learning, Game theory
Abstract: In this paper, we set up a two-player-zero-sum Markov game (TZMG) framework to train a safe driving policy network so that the worst intentions of the neighbor vehicles can be considered. Compared to the conventional policy learning frameworks, the TZMG framework can embed the adversary from the neighbor vehicle throughout its training process. Furthermore, a novel TZMG Q-learning algorithm based on the Wolpertinger policy is proposed to be scalable to multiple adversarial neighbor vehicles. Finally, simulations and humans-in-the-loop experiments are conducted to verify the effectiveness of the TZMG framework and novel algorithm. Compared to the benchmarking safety controllers in the literature, our proposed novel TZMG algorithm can achieve a much lower collision rate when dealing with adversarial neighbor vehicles.
|
|
11:00-11:15, Paper WeA12.5 | Add to My Program |
Safe Lane-Keeping with Feedback Delay: Bifurcation Analysis and Experiments (I) |
|
Vörös, Illés | Budapest University of Technology and Economics |
Takacs, Denes | Budapest University of Technology and Economics |
Orosz, Gabor | University of Michigan |
Keywords: Autonomous vehicles, Delay systems, Automotive control
Abstract: A lane-keeping controller for automobiles is analyzed in this paper, with the consideration of time delay in the feedback loop. Using numerical continuation, unstable periodic orbits are identified inside the linearly stable domain of control gains. Based on the amplitude of these unstable solutions, safe parameter zones can be identified, where the closed loop system is robust against perturbations, i.e., where the basin of attraction of the stable equilibrium is larger. The sensitivity to initial conditions in different regions of linearly stable control gains is demonstrated by a series of real vehicle experiments. Finally, modifications of the control law are proposed that lead to significant improvements in terms of the robustness against perturbations of the controlled vehicle, achieving global stability in the practical sense.
|
|
11:15-11:30, Paper WeA12.6 | Add to My Program |
Energy Impact of Connecting Multiple Signalized Intersections to Energy-Efficient Driving: Simulation and Experimental Results (I) |
|
Han, Jihun | Argonne National Laboratory |
Shen, Daliang | Argonne National Laboratory |
Jeong, Jongryeol | Argonne National Laboratory |
Di Russo, Miriam | Argonne National Laboratory |
Kim, Namdoo | Argonne National Laboratory |
Karbowski, Dominik | Argonne National Laboratory |
Rousseau, Aymeric | Argonne National Laboratory |
Stutenberg, Kevin | Argonne National Laboratory |
Keywords: Autonomous vehicles, Automotive control, Optimal control
Abstract: Vehicle-to-infrastructure (V2I) communication connects vehicles and enables collision-free and energy-efficient driving, such as eco-approaches and departures at signalized intersections. An increased connectivity range can connect multiple signalized intersections and lead to long-term energy-efficient driving using richer information. However, no published studies to date provide insights into the energy-saving potential of increasing the connectivity range. In this paper, we present a V2I-enabled eco-driving control that can perform multiple traffic signal eco-approaches, and we systematically design a large-scale simulation study to quantify the energy impact of the increased V2I range for various scenarios. Simulation results show that the V2I-enabled eco-driving control can reduce energy use by up to 40%, on average, compared to the baseline, depending on road attributes and vehicle powertrain type. We validate these findings by evaluating the controller through a vehicle-in-the-loop (VIL) testing platform.
|
|
WeA13 Regular Session, Aqua Salon D |
Add to My Program |
Distributed Control I |
|
|
Chair: Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
Co-Chair: Loria, Antonio | CNRS |
|
10:00-10:15, Paper WeA13.1 | Add to My Program |
An Agent-Related Asynchronous Consensus Method for Fast Scheduling of UAV Swarm |
|
Chen, Runfeng | National University of Defense Technology |
Li, Jie | National University of Defense Technology |
Chen, Yiting | National University of Defense Technology |
Huang, Yuchong | National University of Defense Technology |
Keywords: Distributed control, Agents-based systems, Autonomous robots
Abstract: UAV swarm needs careful task and time arrangement to complete complex tasks with spatiotemporal constraints such as search and rescue, which requires distributed scheduling methods. Market-based approach is the optimal choice to satisfy the centerless and self-organized characteristics of the swarm. However, the consensus methods adopted by most market-based algorithms require synchronous communication, which takes time to wait, so scholars resort to asynchronous consensus methods. The existing asynchronous method's communication traffic increases with the number of tasks, and the total number of messages sent can be further reduced. Therefore, this paper proposes a new asynchronous method to further reduce the communication traffic and the number of messages sent required by the market-based approach. Firstly, the timestamp of when a UAV updates its information is used cleverly, which reduces the communication traffic of the original timestamp of when a task's information is updated. And the new timestamp contains more information about the non-winner that is absent in the original timestamp, which facilitates new asynchronous consensus rules to resolve task conflicts faster. Secondly, agent-related asynchronous consensus rules with new timestamps are designed to resolve task conflicts, which effectively solves the problem of information arriving out of order and potentially reduces the number of messages sent. Finally, through a self-developed ad-hoc network simulation system, the swarm scheduling under real networking conditions is simulated. A large number of Monte Carlo simulation experiments show that compared with the most representative asynchronous method Asynchronous Consensus-Based Bundle Algorithm (ACBBA), the number of messages sent and communication traffic can be reduced by 61% and 70% at most, and the scheduling time can be reduced by up to 60%.
|
|
10:15-10:30, Paper WeA13.2 | Add to My Program |
Localization and Tracking Control of Autonomous Vehicles in Time-Varying Bearing Formation |
|
Tang, Zhiqi | Instituto Superior Técnico, Universidade De Lisboa |
Loria, Antonio | CNRS |
Keywords: Distributed control, Agents-based systems, Time-varying systems
Abstract: This letter proposes an observer-based formation tracking control approach for multi-agent velocity-controlled vehicles under the assumption that either relative or global position measurements are unavailable for all the vehicles. It is assumed that only some vehicles (at least one) have access to their own global position, and all vehicles are equipped with sensors capable of sensing the bearings relative to neighboring vehicles. Each vehicle estimates its global position using relative bearing measurements and estimates of neighboring vehicles received over a communications network. Then, a distributed output-feedback observer-based controller is designed relying on bearing measurements and the estimated global positions. In contrast with the literature on bearing-based localization and control, we relax the common assumption of so-called bearing rigidity, and, in addition, we do not assume that the interconnections are constant. To the best of our knowledge, the bearing-based localization-and-tracking control problem under such assumptions remains open. In support of our theoretical findings, some simulation results are presented to illustrate the performance of the proposed observer-based tracking controllers.
|
|
10:30-10:45, Paper WeA13.3 | Add to My Program |
On the Equivalence of Multi-Agent 2D Coverage Control and Leader-Follower Consensus Network |
|
Xu, Xiaotian | University of Maryland College Park |
Davydov, Alexander | University of California, Santa Barbara |
Diaz-Mercado, Yancy | University of Maryland |
Keywords: Distributed control, Autonomous systems, Networked control systems
Abstract: Coverage control algorithms seek to spatially distribute agents in a domain of coverage, e.g., to minimize proximity to all points. Leader-follower consensus network algorithms use local coordination rules to influence the behavior of a multi-agent system (MAS) as a whole through explicit control of a subset of agents (called leaders) and neighbor interactions. In this paper, the equivalence of these two classes of distributed algorithms for swarm robotics, that were once considered inherently different, is established. We present a swarm robotics application, where the real agents (i.e., the robots) in the domain of coverage are followers; and virtual agents (i.e., the leaders) are introduced based on the domain of coverage. The dynamics of followers are shown to be in the form of a weighted, state-dependent consensus protocol and the dynamics of the leaders (dependent on the evolution of the domain) are provided. Formulating a standard coverage algorithm (i.e., Lloyd's algorithm) over 2D polygonal domains as a leader-follower consensus protocol makes the structure of the ensemble-level dynamics for the MAS explicit with respect to neighbor interaction. The resultant weighted graph Laplacian may contribute to the future investigation on the performance guarantees of a MAS tracking a time-varying domain. The equivalence of the two classes of algorithms is validated in simulation.
|
|
10:45-11:00, Paper WeA13.4 | Add to My Program |
Exponential Bipartite Containment Tracking Over Multi-Leader Coopetition Networks |
|
Sekercioglu, Pelin | ONERA, Univ Paris-Saclay |
Panteley, Elena | CNRS |
Sarras, Ioannis | ONERA |
Loria, Antonio | CNRS |
Marzat, Julien | ONERA - the French Aerospace Lab |
Keywords: Distributed control, Control of networks, Lyapunov methods
Abstract: This paper addresses the distributed bipartite containment tracking-control problem for autonomous vehicles steered by multiple leaders. Some leaders are cooperative and others are competitive, so the vehicles form a so-called coopetition network; in which the interaction links may be negative or positive. The presence of cooperative and antagonistic leaders does not enable the system to achieve consensus. Instead, the followers’ states converge to a residual compact set, not predefined, but depending only by the leaders’ states. We establish global exponential stability for this so-called bipartite containment set, and we compute the exact equilibria to which all agents converge inside of it. Our proofs are constructive, that is, we provide strict Lyapunov functions, which also allow us to establish robustness with respect to external disturbances. Numerical simulations illustrate our theoretical findings.
|
|
11:00-11:15, Paper WeA13.5 | Add to My Program |
Distributed Nash Equilibrium Seeking for Single-Integrator Dynamics Subject to Disturbances with Unknown Bounds |
|
He, Xiongnan | Chinese University of Hong Kong |
Huang, Jie | The Chinese University of Hong Kong |
Keywords: Distributed control, Cooperative control, Adaptive control
Abstract: In this paper, we study the problem of Nash equilibrium seeking of N-player games for single integrator dynamics subject to bounded disturbances with unknown bounds. Compared with the existing results, two new features are worth mentioning. First, the communication network among players is jointly strongly connected, which can be disconnected at every time instant. Second, the class of the disturbances contains any bounded time function with the bounds unknown. To achieve our objective, we have proposed a novel approach by integrating the distributed estimator, some nonlinear control technique, and adaptive control technique. Our design is illustrated by the example of a group of velocity-actuated robots in sensor networks.
|
|
11:15-11:30, Paper WeA13.6 | Add to My Program |
An Observer-Based Distributed Adaptive Control Algorithm for Coordination of Multiagent Systems in the Presence of Coupled Dynamics |
|
Aly, Islam | Embry-Riddle Aeronautical University |
Kurttisi, Atahan | Embry-Riddle Aeronautical University |
Dogan, Kadriye Merve | Embry-Riddle Aeronautical University |
Keywords: Distributed control, Cooperative control, Adaptive control
Abstract: In this paper, a distributed adaptive control algorithm is designed for an uncertain multiagent system in the presence of unmeasurable coupled dynamics that adopts user-assigned Laplacian matrix nullspaces. Specifically, we use observer dynamics that help us to guarantee the overall system stability, low-frequency learning methods to deal with high-frequency learning, and a modified Laplacian matrix to coordinate the multiagent system. Our algorithm proposes the coordination of multiagent systems and an asymptotic decoupling approach. An illustrative numerical example is given to demonstrate our theoretical contributions.
|
|
WeA14 Invited Session, Aqua 311A |
Add to My Program |
Predictive Control and Planning Methods for Robotic Systems |
|
|
Chair: Zhang, Fumin | Georgia Institute of Technology |
Co-Chair: Hou, Mengxue | Purdue University |
Organizer: Hou, Mengxue | Purdue University |
Organizer: Zhang, Fumin | Georgia Institute of Technology |
Organizer: Mou, Shaoshuai | Purdue University |
Organizer: Sundaram, Shreyas | Purdue University |
|
10:00-10:15, Paper WeA14.1 | Add to My Program |
Integrated Task and Motion Planning for Process-Aware Source Seeking (I) |
|
Li, Yingke | Georgia Institute of Technology |
Hou, Mengxue | Purdue University |
Zhou, Enlu | Georgia Institute of Technology |
Zhang, Fumin | Georgia Institute of Technology |
Keywords: Robotics, Autonomous robots, Estimation
Abstract: The process-aware source seeking (PASS) problem in flow fields aims to find an informative trajectory to reach an unknown source location while taking the energy consumption in the flow fields into consideration. Taking advantage of the existing methods on flow field partition, this paper formulates this problem as a task and motion planning (TAMP) problem and proposes a bi-level hierarchical planning framework to decouple the planning of inter-region transition and inner-region trajectory by introducing inter-region junctions. An integrated strategy is utilized to enable efficient upper-level planning by investigating the optimal solution of the lower-level planner. The proposed algorithm provides guaranteed convergence of the trajectory, and achieves automatic trade-off between exploration and exploitation, which has been validated by the simulation results.
|
|
10:15-10:30, Paper WeA14.2 | Add to My Program |
Variable Sampling MPC Via Differentiable Time-Warping Function (I) |
|
Lu, Zehui | Purdue University |
Mou, Shaoshuai | Purdue University |
Keywords: Predictive control for nonlinear systems, Optimal control, Energy systems
Abstract: Designing control inputs for a system that involves dynamical responses in multiple timescales is nontrivial. This paper proposes a parameterized time-warping function to enable a non-uniformly sampling along a prediction horizon given some parameters. The horizon should capture the responses under faster dynamics in the near future and preview the impact from slower dynamics in the distant future. Then a variable sampling MPC (VS-MPC) strategy is proposed to jointly determine optimal control and sampling parameters at each timestamp. VS-MPC adapts how it samples along the horizon and determines optimal control accordingly at each timestamp without offline tuning or trial and error. A numerical example of a wind farm battery energy storage system is also provided to demonstrate that VS-MPC outperforms the uniform sampling MPC.
|
|
10:30-10:45, Paper WeA14.3 | Add to My Program |
An Interleaved Algorithm for Integration of Robotic Task and Motion Planning (I) |
|
Hou, Mengxue | Purdue University |
Li, Yingke | Georgia Institute of Technology |
Zhang, Fumin | Georgia Institute of Technology |
Sundaram, Shreyas | Purdue University |
Mou, Shaoshuai | Purdue University |
Keywords: Robotics, Autonomous robots, Predictive control for nonlinear systems
Abstract: We propose an interleaved method for robotic task and motion planning (TAMP) problems, which involves optimizing both continuous and discrete dynamic behaviors. The coupling between the task planning and motion planning results in a large search space, causing challenge for computing the optimal solution. To address this challenge, we develop a novel bi-level algorithm leveraging the Depth First Search (DFS) algorithm and the Monte Carlo Tree Search (MCTS) algorithm to solve the TAMP. Incorporating task completion cost estimation from the motion planning level, we solve the task planning problem in a computationally efficient manner. We prove that our proposed TAMP algorithm is complete, i.e., it always finds the optimal solution if there exists one. Finally, we present simulation results to demonstrate that the proposed algorithm can find the optimal solution of the TAMP problem with lower computation cost than existing algorithms.
|
|
10:45-11:00, Paper WeA14.4 | Add to My Program |
Poisoning Attacks against Data-Driven Predictive Control (I) |
|
Yu, Yue | The University of Texas at Austin |
Zhao, Ruihan | UT Austin |
Chinchali, Sandeep | UT Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Predictive control for nonlinear systems, Optimization, Machine learning
Abstract: Data-driven predictive control (DPC) is a feedback control method for systems with unknown dynamics. It repeatedly optimizes a system's future trajectories based on past input-output data. We develop a numerical method that computes poisoning attacks that inject additive perturbations to the online output data to change the trajectories optimized by DPC. This method is based on implicitly differentiating the solution map of the trajectory optimization in DPC. We demonstrate that the resulting attacks can cause an output tracking error one order of magnitude higher than random perturbations in numerical experiments.
|
|
11:00-11:15, Paper WeA14.5 | Add to My Program |
Motion Tomography Performed by Robotic Fish with Active Heading Control (I) |
|
Zuo, Wenyu | University of Houston |
Zhang, Fumin | Georgia Institute of Technology |
Chen, Zheng | University of Houston |
Keywords: Biologically-inspired methods, Identification, Robotics
Abstract: Water flow plays an important role in the operation of marine robotic vehicles (MRVs). In confined survey areas, accurate perception of the flow field can greatly assist MRVs in path planning and improve energy efficiency. Traditional flow observations rely on data from buoys and satellites, which is expensive and time-consuming. Therefore, predicting the flow field using the vehicle's position and velocity information can significantly enhance work efficiency. Motion tomography (MT) is a time-efficient and convenient technique that estimates the flow field by separating the movement caused by the flow field from the trajectory of the MRV. Additionally, robotic fish are ideal for this flow field sensing task due to their maneuverability and versatility. Using robotic fish to sense the flow field can greatly benefit transportation and environment studies. However, MT ignores the rigid-body dynamics, which can significantly undermine the estimation accuracy. To address this challenge, we add an active heading control (AHC) to moderate the passive heading change caused by the flow field. With AHC's help, the position and direction data collected from a robotic fish can provide a promising estimation of the flow field in both simulations and experiments.
|
|
11:15-11:30, Paper WeA14.6 | Add to My Program |
UrbanFly: Uncertainty-Aware Planning for Navigation Amongst High-Rises with Monocular Visual-Inertial SLAM Maps |
|
S Harithas, Sudarshan | IIIT Hyderabad |
Thatavarthy, Ayyappa Swamy | Robotics Research Center, IIIT Hyderabad |
Singh, Gurkirat | The International Institute of Information Technology - Hyderaba |
Singh, Arun Kumar | University of Tartu |
Krishna, K. Madhava | IIIT-Hyderabad |
Keywords: Autonomous robots, Robotics, Uncertain systems
Abstract: We present UrbanFly: an uncertainty-aware real-time planning framework for quadrotor navigation in urban high-rise environments. A core aspect of UrbanFly is its ability to robustly plan directly on the sparse point clouds generated by a Monocular Visual Inertial SLAM (VI-SLAM) backend. It achieves this by using the sparse point clouds to build an uncertainty-integrated cuboid representation of the environment through a data-driven monocular plane segmentation network. Our chosen world model provides faster distance queries than the more common voxel-grid representation. UrbanFly leverages this capability in two different ways leading to two trajectory optimizers. The first optimizer uses the gradient-free cross-entropy method to compute trajectories that minimize collision probability and smoothness cost. Our second optimizer is a a simplified version of the first and uses a sequential convex programming optimizer initialized based on probabilistic safety estimates on a set of randomly drawn trajectories. Both our trajectory optimizers are made computationally tractable and independent of the nature of underlying uncertainty by embedding the distribution of collision violations in Reproducing Kernel Hilbert Space. Empowered by the algorithmic innovation, UrbanFly outperforms competing baselines in metrics such as collision rate, trajectory length, etc., on a high-fidelity AirSim simulator augmented with synthetic and real-world dataset scenes.
|
|
WeA15 Invited Session, Aqua 311B |
Add to My Program |
Enery Efficiency in Smart Buildings and Cities |
|
|
Chair: Jones, Colin N. | EPFL |
Co-Chair: Stockar, Stephanie | The Ohio State University |
Organizer: Stockar, Stephanie | The Ohio State University |
Organizer: Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Organizer: Jones, Colin N. | EPFL |
|
10:00-10:15, Paper WeA15.1 | Add to My Program |
A General Purpose Real-Time Optimization Strategy Applied to Minimizing Simultaneous Heating and Cooling in Buildings (I) |
|
Salsbury, Timothy | PNNL |
Terrill, Trevor | Texas A&M University |
Goddard, James | Battelle |
Yu, Min Gyung | Pacific Northwest National Laboratoy |
Yoder, Tim | Pacific Northwest National Laboratory |
Duan, Xiaoli | Pacific Northwest National Laboratory |
Keywords: Building and facility automation, Optimization algorithms, Decentralized control
Abstract: Supervisory control strategies in heating, ventilating, and air-conditioning (HVAC) systems hold the potential for significant energy savings but they are difficult to set up and often depend on unreliable sensor measurements. In this paper, we propose a sensor-free approach to minimizing the energy penalties due to simultaneous heating and cooling in variable-air-volume (VAV) systems with reheat. A general-purpose real-time optimizer is described that is easy to set-up and integrate with existing control logic. A cost function is derived that quantifies the simultaneous heating and cooling penalty that does not require any sensor measurements. The optimizer then minimizes this cost function by automatically adjusting the supply temperature setpoint in the air-handling unit. The paper describes the approach and shows results from application from field trials on a 27-zone building.
|
|
10:15-10:30, Paper WeA15.2 | Add to My Program |
Two-Level Decentralized-Centralized Control of Distributed Energy Resources in Grid-Interactive Efficient Buildings (I) |
|
Huo, Xiang | University of Utah |
Dong, Jin | Oak Ridge National Laboratory |
Cui, Borui | Oak Ridge National Lab |
Liu, Boming | Oak Ridge National Laboratory |
Lian, Jianming | Oak Ridge National Laboratory |
Liu, Mingxi | University of Utah |
Keywords: Smart cities/houses, Decentralized control, Smart grid
Abstract: The flexible, efficient, and reliable operation of grid-interactive efficient buildings (GEBs) is increasingly impacted by the growing penetration of distributed energy resources (DERs). Besides, the optimization and control of DERs, buildings, and distribution networks are further complicated by their interconnections. In this paper, we exploit the load-side flexibility and clean energy resources to develop a novel two-level hybrid decentralized-centralized (HDC) algorithm for the coordination and control of DER-connected GEBs. The proposed HDC 1) achieves scalability w.r.t. to a large number of grid-connected buildings and devices, 2) incorporates a two-level design where an aggregator centrally controls at the building level and the system operator coordinates at the distribution network level in a decentralized way, and 3) improves the computing efficiency and communicating resilience with heterogeneous temporal scales. Simulations are conducted based on the prototype of a campus building at the Oak Ridge National Laboratory to show the efficiency and efficacy of the proposed approach.
|
|
10:30-10:45, Paper WeA15.3 | Add to My Program |
LSR-BO: Local Search Region Constrained Bayesian Optimization for Performance Optimization of Vapor Compression Systems (I) |
|
Paulson, Joel | The Ohio State University |
Sorourifar, Farshud | Ohio State University |
Laughman, Christopher R. | Mitsubishi Electric Research Labs |
Chakrabarty, Ankush | Mitsubishi Electric Research Laboratories (MERL) |
Keywords: Machine learning, Control applications, Optimization algorithms
Abstract: Bayesian optimization (BO) has recently been demonstrated as a powerful tool for efficient derivative-free optimization of expensive black-box functions, such as those prevalent in performance optimization of complex energy systems. Classical BO algorithms ignore the relationship between consecutive optimizer candidates, resulting in jumps in the admissible search space which can lead to fail-safe mechanisms being triggered, or undesired transient dynamics that violate operational constraints. In this paper, we propose LSR-BO, a novel global optimization methodology that enforces local search region (LSR) constraints by design, which restricts how much the optimizer candidate can be changed at every iteration. We demonstrate that naively incorporating LSR constraints into BO causes the algorithm to get stuck in local sub-optimal solutions, and overcome this challenge through the development a novel exploration strategy that can gracefully navigate the tradeoff between short-term local, and long-term global, performance improvement. Furthermore, we provide theoretical guarantees on the convergence of LSR-BO. Finally, we verify the effectiveness of our proposed LSR-BO method on an illustrative benchmark and a real-world energy minimization problem for a commercial vapor compression system.
|
|
10:45-11:00, Paper WeA15.4 | Add to My Program |
Uncertainty-Aware Flexibility Envelope Prediction in Buildings with Controller-Agnostic Battery Models (I) |
|
Scharnhorst, Paul | EPFL |
Schubnel, Baptiste | CSEM S.A |
Carrillo, Rafael Eduardo | CSEM S.A |
Alet, Pierre-Jean | CSEM S.A |
Jones, Colin N. | EPFL |
Keywords: Identification for control, Stochastic systems, Smart grid
Abstract: Buildings are a promising source of flexibility for the application of demand response. In this work, we introduce a novel battery model formulation to capture the state evolution of a single building. Being fully data-driven, the battery model identification requires one dataset from a period of nominal controller operation, and one from a period with relative flexibility requests, without making any assumptions on the underlying, but fixed, controller structure. We consider parameter uncertainty in the model formulation and show how to use risk measures to encode risk preferences of the user in robust uncertainty sets. Finally, we demonstrate the uncertainty-aware prediction of flexibility envelopes for a building simulation model from the Python library Energym.
|
|
11:00-11:15, Paper WeA15.5 | Add to My Program |
Thermal Comfort Control on Sustainable Building Via Data-Driven Robust Model Predictive Control |
|
Chen, Wei-Han | Cornell University |
Yang, Shiyu | Cornell University |
You, Fengqi | Cornell University |
Keywords: Process Control, Modeling, Machine learning
Abstract: While implementing renewable energy systems and model predictive control (MPC) could reduce non-renewable energy consumption, one challenge to building climate control using MPC is the weather forecast uncertainty. In this work, we propose a data-driven robust model predictive control (DDRMPC) framework to address climate control of a sustainable building with renewable hybrid energy systems under weather forecast uncertainty. The control and energy system configurations include heating, ventilation, and air conditioning, geothermal heat pump, photovoltaic panel, and electricity storage battery. Historical weather forecast and measurement data are gathered from the weather station to identify the forecast errors and for the use of uncertainty set construction. The data-driven uncertainty sets are constructed with multiple machine learning techniques, including principal component analysis with kernel density estimation, K-means clustering coupled with PCA and KDE, density-based spatial clustering of applications with noise, and the Dirichlet process mixture model. Lastly, a data-driven robust optimization problem is developed to obtain the optimal control inputs for a building with renewable energy systems. A case study on controlling a building with renewable energy systems located on the Cornell University campus is used to demonstrate the advantages of the proposed DDRMPC framework.
|
|
11:15-11:30, Paper WeA15.6 | Add to My Program |
Mixed-Integer Real-Time Control of a Building Energy Supply System |
|
Burda, Artyom | Hannover University of Applied Sciences and Arts |
Bitner, Dimitri | Hannover University of Applied Sciences and Arts |
Bestehorn, Felix | Technical University of Braunschweig |
Kirches, Christian | Technical University of Braunschweig |
Grotjahn, Martin | Hannover University of Applied Science and Arts, Faculty of Mech |
Keywords: Predictive control for nonlinear systems, Energy systems, Control applications
Abstract: We present a methodology based on mixed-integer nonlinear model predictive control for a real-time building energy management system in application to a single-family house with a combined heat and power (CHP) unit. The developed strategy successfully deals with the switching behavior of the system components as well as minimum admissible operating time constraints by use of a special switch-cost-aware rounding procedure. The quality of the presented solution is evaluated in comparison to the globally optimal dynamic programming method and conventional rule-based control strategy. Based on a real-world scenario, we show that our approach is more than real-time capable while maintaining high correspondence with the globally optimal solution. We achieve an average optimality gap of 2.5% compared to 20% for a conventional control approach, and are faster and more scalable than a dynamic programming approach.
|
|
WeA16 Invited Session, Aqua 313 |
Add to My Program |
Learning and Stochastic Optimal Control |
|
|
Chair: Halder, Abhishek | University of California, Santa Cruz |
Co-Chair: Mesbah, Ali | University of California, Berkeley |
Organizer: Halder, Abhishek | University of California, Santa Cruz |
Organizer: Mesbah, Ali | University of California, Berkeley |
|
10:00-10:15, Paper WeA16.1 | Add to My Program |
Stochastic Model Predictive Control Utilizing Bayesian Neural Networks (I) |
|
Pohlodek, Johannes | TU Darmstadt |
Alsmeier, Hendrik | TU Darmstadt |
Morabito, Bruno | OVG University Magdeburg |
Savchenko, Anton | Technical University of Darmstadt |
Findeisen, Rolf | TU Darmstadt |
Keywords: Process Control, Predictive control for nonlinear systems, Chemical process control
Abstract: Integrating measurements and historical data can enhance control systems through learning-based techniques, but ensuring performance and safety is challenging. Robust model predictive control strategies, like stochastic model predictive control, can address this by accounting for uncertainty. Gaussian processes are often used but have limitations with larger models and data sets. We explore Bayesian neural networks for stochastic learning-assisted control, comparing their performance to Gaussian processes on a wastewater treatment plant model. Results show Bayesian neural networks achieve similar performance, highlighting their potential as an alternative for control designs, particularly when handling extensive data sets.
|
|
10:15-10:30, Paper WeA16.2 | Add to My Program |
A Physics-Informed Deep Learning Approach for Minimum Effort Stochastic Control of Colloidal Self-Assembly (I) |
|
Nodozi, Iman | University of California, Santa Cruz |
O'Leary, Jared | University of California, Berkeley |
Mesbah, Ali | University of California, Berkeley |
Halder, Abhishek | University of California, Santa Cruz |
Keywords: Stochastic optimal control, Uncertain systems, Machine learning
Abstract: We propose formulating the finite-horizon stochastic optimal control problem for colloidal self-assembly in the space of probability density functions (PDFs) of the underlying state variables (namely, order parameters). The control objective is formulated in terms of steering the state PDFs from a prescribed initial probability measure towards a prescribed terminal probability measure with minimum control effort. For specificity, we use a univariate stochastic state model from the literature. Both the analysis and the computational steps for control synthesis as developed in this paper generalize for multivariate stochastic state dynamics given by generic nonlinear in state and non-affine in control models. We derive the conditions of optimality for the associated optimal control problem. This derivation yields a system of three coupled partial differential equations together with the boundary conditions at the initial and terminal times. The resulting system is a generalized instance of the so-called Schr"{o}dinger bridge problem. We then determine the optimal control policy by training a physics-informed deep neural network, where the "physics" are the derived conditions of optimality. The performance of the proposed solution is demonstrated via numerical simulations on a benchmark colloidal self-assembly problem.
|
|
10:30-10:45, Paper WeA16.3 | Add to My Program |
Minimal Entropy Production in Anisotropic Temperature Fields (I) |
|
Movilla Miangolarra, Olga | University of Calfornia, Irvine |
Taghvaei, Amirhossein | University of Washington Seattle |
Georgiou, Tryphon T. | University of California, Irvine |
Keywords: Optimal control, Stochastic optimal control, Control applications
Abstract: Anisotropy of temperature fields, chemical potentials and ion concentration gradients provide the fuel that feeds dynamical processes that sustain life. Dynamical flows in respective environments incur losses manifested as entropy production. In this work we consider a rudimentary model of an overdamped stochastic thermodynamic system in an anisotropic temperature heat bath, and analyze the problem to minimize entropy production while driving the system between thermodynamic states in finite time. It is noted that entropy production in a fully isotropic temperature field, can be expressed as the Wasserstein-2 length of the path traversed by the thermodynamic state of the system. In the presence of an anisotropic temperature field, the mechanism of entropy production is substantially more complicated as, besides dissipation, it entails seepage of energy between the ambient heat sources by way of the system dynamics. We show that, in this case, the entropy production can be expressed as the solution of a suitably constrained and generalized Optimal Mass Transport (OMT) problem. In contrast to the situation in standard OMT, entropy production may not be identically zero, even when the thermodynamic state remains unchanged. Physically, this is due to the fact that maintaining a Non-Equilibrium Steady State (NESS), incurs an intrinsic entropic cost. As already noted, NESSs are the hallmark of life and living systems by necessity operate away from equilibrium. Thus our problem of minimizing entropy production appears of central importance in understanding biological processes, such as molecular motors and motor proteins, and on how such processes may have evolved to optimize for available usage of resources.
|
|
10:45-11:00, Paper WeA16.4 | Add to My Program |
Duality-Based Stochastic Policy Optimization for Estimation with Unknown Noise Covariances (I) |
|
Talebi, Shahriar | University of Washington |
Taghvaei, Amirhossein | University of Washington Seattle |
Mesbahi, Mehran | University of Washington |
Keywords: Machine learning, Estimation, Iterative learning control
Abstract: Duality of control and estimation allows mapping recent advances in data-guided control to the estimation setup. This paper formalizes and utilizes such a mapping to consider learning the optimal (steady-state) Kalman gain when process and measurement noise statistics are unknown. Specifically, building on the duality between synthesizing optimal control and estimation gains, the filter design problem is formalized as direct policy learning. In this direction, the duality is used to extend existing theoretical guarantees of direct policy updates for Linear Quadratic Regulator (LQR) to establish global convergence of the Gradient Descent (GD) algorithm for the estimation problem–while addressing subtle differences between the two synthesis problems. Subsequently, a Stochastic Gradient Descent (SGD) approach is adopted to learn the optimal Kalman gain without the knowledge of noise covariances. The results are illustrated via several numerical examples.
|
|
11:00-11:15, Paper WeA16.5 | Add to My Program |
An Optimal Control Approach to Particle Filtering on Lie Groups (I) |
|
Yuan, Bo | Georgia Institute of Technology |
Zhang, Qinsheng | Georgia Institution of Technology |
Chen, Yongxin | Georgia Institute of Technology |
Keywords: Filtering, Stochastic optimal control
Abstract: We study the filtering problem over a Lie group that plays an important role in robotics and aerospace applications. We present a new particle filtering algorithm based on stochastic control. In particular, our algorithm is based on a duality between smoothing and optimal control. Leveraging this duality, we reformulate the smoothing problem into an optimal control problem, and by approximately solving it (using, e.g., iLQR) we establish a superior proposal for particle smoothing. Combining it with a suitable designed sliding window mechanism, we obtain a particle filtering algorithm that suffers less from sample degeneracy compared with existing methods. Finally, we exemplify our algorithm in a filtering problem over SO(3) for satellite attitude estimation.
|
|
11:15-11:30, Paper WeA16.6 | Add to My Program |
Constrained Policy Optimization for Stochastic Optimal Control under Nonstationary Uncertainties (I) |
|
Shin, Sungho | Argonne National Laboratory |
Pacaud, Francois | Argonne National Laboratory |
Constantinescu, Emil Mihai | Argonne National Laboratory |
Anitescu, Mihai | Argonne National Laboratory |
Keywords: Stochastic optimal control, Optimization algorithms, Learning
Abstract: This article presents a constrained policy optimization approach for the optimal control of systems under nonstationary uncertainties. We introduce an assumption that we call Markov embeddability that allows us to cast the stochastic optimal control problem as a policy optimization problem over the augmented state space. Then, the infinite-dimensional policy optimization problem is approximated as a finite-dimensional nonlinear program by applying function approximation, deterministic sampling, and temporal truncation. The approximated problem is solved by using automatic differentiation and condensed-space interior-point methods. We formulate several conceptual and practical open questions regarding the asymptotic exactness of the approximation and the solution strategies for the approximated problem. As proof of concept, we present numerical examples demonstrating the performance of the proposed method.
|
|
WeA17 Tutorial Session, Aqua 314 |
Add to My Program |
Combining Physics and Machine Learning Methods to Accelerate Innovation in
Sustainability: A Control Perspective |
|
|
Chair: Onori, Simona | Stanford Univeristy |
Co-Chair: Pozzato, Gabriele | Stanford University |
Organizer: Onori, Simona | Stanford Univeristy |
|
10:00-10:30, Paper WeA17.1 | Add to My Program |
Combining Physics-Based and Machine Learning Methods to Accelerate Innovation in Sustainable Transportation and Beyond: A Control Perspective (I) |
|
Pozzato, Gabriele | Stanford University |
Onori, Simona | Stanford Univeristy |
Keywords: Modeling, Energy systems, Machine learning
Abstract: Lithium-ion batteries are playing a key role in the sustainable energy transition. To fully exploit the potential of this technology, a variety of modeling, estimation, and prediction problems need to be addressed to enhance its design and optimize its utilization. Batteries are complex electrochemical systems whose behavior drastically changes as a function of aging, temperature, C-rate, and state of charge, posing unique modeling and control research questions. In this tutorial paper, we provide insights into three battery modeling methodologies, namely first principle, machine learning, and hybrid modeling. Each approach has its own strengths and weaknesses, and by means of three case studies we describe main characteristics and challenges of each of the three methods.
|
|
10:30-10:45, Paper WeA17.2 | Add to My Program |
Machine Learning in Lithium-Sulfur Battery Modeling and Control: Key Challenges and Opportunities (I) |
|
Fathy, Hosam K. | University of Maryland |
Keywords: Automotive control
Abstract: This talk is motivated by the potential of both solid-state and liquid electrolyte lithium-sulfur batteries to provide significant performance advantages over state-of-the-art lithium-ion batteries, especially in terms of specific energy. The talk provides a brief introduction to the lithium-sulfur chemistry, focusing on the modeling, estimation, and control challenges associated with this chemistry. The talk then surveys some of the key challenges and opportunities associated with the application of machine learning methods, especially the deep learning of battery dynamics, to this chemistry. Areas of overlap and potential synergy between machine learning and physics-based modeling approaches are particularly emphasized in this discussion.
|
|
10:45-11:00, Paper WeA17.3 | Add to My Program |
Physics-Constrained Learning with Application to Fuel Cells and Batteries (I) |
|
Siegel, Jason B. | University of Michigan |
|
11:00-11:15, Paper WeA17.4 | Add to My Program |
What Is the Meaning of (battery) Life? (I) |
|
Paxton, William | VW |
|
11:15-11:30, Paper WeA17.5 | Add to My Program |
Integrating Physics and Machine Learning for Battery Management in the Cloud (I) |
|
Li, Weihan | RWTH Aachen University |
|
WeB01 RI Session, Sapphire MN |
Add to My Program |
Advances in Mechatronics (RI) |
|
|
Chair: Ho, Duc Tho | Nagaoka University of Technology |
Co-Chair: Chen, Yan | Arizona State University |
|
14:00-14:04, Paper WeB01.1 | Add to My Program |
Development and Application of a Novel High-Order Fully Actuated System Approach: Part I. 3-DOF Quadrotor Control (I) |
|
Lu, Shi | Arizona State University |
Tsakalis, Kostas | Arizona State Univ |
Chen, Yan | Arizona State University |
Keywords: Mechatronics
Abstract: The quadrotor hierarchical control design (position-attitude) based on the state-space modeling has been widely applied in the past. Although the state-space representation, based on a group of first-order differential equations, is effective in modeling many dynamic systems, inherent high-order dynamics and control of quadrotor systems may not be properly handled by the state-space modeling. This letter proposes a modified high-order fully actuated (HOFA) theory for a group of high-order dynamic systems, including the quadrotor system, without relying on pseudo strict-feedback forms required by the original HOFA approach. Hence, the quadrotor model can be essentially converted into two HOFA subsystems. A nonlinear 3-DOF quadrotor modeling and control is applied as an example to demonstrate the effectiveness of the proposed approach, which can achieve arbitrarily assignable eigenstructure like a stabilized linear system.
|
|
14:04-14:08, Paper WeB01.2 | Add to My Program |
Development and Application of a Novel High-Order Fully Actuated System Approach: Part II. 6-DOF Quadrotor Control (I) |
|
Lu, Shi | Arizona State University |
Tsakalis, Kostas | Arizona State Univ |
Chen, Yan | Arizona State University |
Keywords: Modeling, Flight control, Linear systems
Abstract: Traditional cascade controller design based on state-space modeling has been widely applied for quadrotor systems. The state-space representation can effectively model many dynamic systems. Yet, a group of first-order differential equations may not be the most suitable way to model and control inherent high-order dynamic systems, such as quadrotors. This paper proposes a modified high-order fully actuated (HOFA) approach with recursive actions based on a mixed-order quadrotor model. Unlike the existing HOFA approach developed for generalized strict-feedback systems, the modified HOFA method does not require the system in a strict-feedback form. Hence, the 6-DOF quadrotor model can be essentially converted into three HOFA subsystems. Based on the obtained HOFA systems, the control design of 6-DOF nonlinear quadrotor systems can be readily achieved like linear systems with arbitrarily assignable eigenstructure. Simulation and experimental results are shown to verify the effectiveness of the proposed HOFA modeling and control approach.
|
|
14:08-14:12, Paper WeB01.3 | Add to My Program |
Input Shaping Control of an Overhead Crane with Time-Varying Cable Length Using a Generalized Input Shaper |
|
Ho, Duc Tho | Nagaoka University of Technology |
Terashima, Kazuhiko | Toyohashi Univ. of Tech |
Miyoshi, Takanori | Nagaoka Univ. of Tech |
Keywords: Mechatronics, Control applications, Mechanical systems/robotics
Abstract: In this brief, a new input shaping-based control law is proposed for an overhead crane with time-varying cable length. The proposed approach can accommodate a generalized input shaper and general hoisting motions. It will be shown that a non-robust input shaper can be used within the established scheme to completely suppress residual vibration at the completion of a maneuver, outperforming the standard (robust) input shaping controllers. Simulation results are provided.
|
|
14:12-14:16, Paper WeB01.4 | Add to My Program |
Sequence-To-Sequence LSTM-Based Dynamic System Identification of Piezo-Electric Actuators |
|
Yin, Ruocheng | Iowa State University |
Ren, Juan | Iowa State University |
Keywords: Mechatronics, Identification, Machine learning
Abstract: During the past few year, recurrent neural network (RNN) has been proposed to model the nonlinear dynamics of various dynamic systems, such as nano positioning systems (e.g, piezo electric actuators (PEAs)). Although high modeling accuracy has been demonstrated using RNNs, it has been found that the conventional RNNs (such as vanilla RNN) are susceptible to gradient vanishing or exploding issue and hence difficult to train. Deep RNNs, such as Long short-term memory (LSTM), have been proposed to address these issues. However, due to the conventional training data construction, the training is susceptible to overfitting and the computation is extensive. In this paper, we propose a new type of LSTM in the application of PEA system identification: a sequence-to- sequence learning approach (namely, LSTMseq2seq). The structure of LSTMseq2seq and its training data construction are presented in detail. The efficacy of LSTMseq2seq in terms of modeling accuracy and computation speed is demonstrated by applying it for PEA system identification and comparing its performance with that of vanilla RNN.
|
|
14:16-14:20, Paper WeB01.5 | Add to My Program |
Feedforward Compensation of Scan-Induced Disturbances for a High-Precision Robotic 3D Measurement System |
|
Wertjanz, Daniel | TU Wien |
Kern, Thomas | TU Wien |
Csencsics, Ernst | Vienna University of Technology |
Schitter, Georg | Vienna University of Technology |
Keywords: Mechatronics, Mechanical systems/robotics, Closed-loop identification
Abstract: This paper proposes a feedforward compensation approach of scan-induced disturbances to improve the uncertainty of an active-sample tracking 3D measurement module. The measurement module acts as a robotic endeffector and is designed for precise robotic measurement applications directly in the vibration-prone environment of an industrial production line. By means of a feedback control-induced stiff link, a constant position of the electromagnetically levitated measurement platform (MP) is maintained with respect to the sample surface under test. Precise 3D imaging is enabled by scanning the measuring light spot of a 1D confocal chromatic sensor with a 2D fast steering mirror (FSM). Disturbances are caused due to scanning-induced reaction forces on the MP, impairing the system’s sample-tracking and 3D measurement performance. Based on the identified disturbance dynamics, a tailored feedforward control is designed to compensate the causing reaction forces. To experimentally evaluate the system performance, 3D measurements at the maximum frame of 1 fps are performed with disabled and enabled feedforward control. Evaluating the experimental results, the sample-tracking error in the MP’s translational degree of freedom z is significantly reduced by a factor of 7 down to 42 nm rms, being close to the MP’s static positioning noise. The reduced sample-tracking error further enables a higher 3D measurement performance, reducing the structural height uncertainty by 36% down to 180 nm rms.
|
|
14:20-14:24, Paper WeB01.6 | Add to My Program |
Mitigating Non-Linear DAC Glitches Using Dither in Closed-Loop Nano-Positioning Applications |
|
Faza, Ahmad M.A. | University of Stavanger |
Leth, John | Aalborg University |
Eielsen, Arnfinn A | University of Stavanger |
Keywords: Mechatronics, Modeling, Switched systems
Abstract: Digital-to-analog conversion is essential in digital signal processing applications, including closed-loop control schemes. Noise and distortion in digital-to-analog converters result in reduced performance for high-precision mechatronics such as nano-positioning. Glitches are common in practical switched systems such as digital-to-analog converters; observed as an output disturbance. Due to the wide-bandwidth, impulse-like behavior, control law bandwidth is generally too low to provide adequate attenuation; deteriorating open and closed-loop performance. This article demonstrates how large-amplitude high-frequency periodic dither mitigates the effect of glitches in a nano-positioning system under closed-loop control. Simulations are performed using a model that includes significant non-linearities with a response fitted to an off-the-shelf commercial device, as well as using standard linear time-invariant models for other system components fitted to the responses of common, commercially available devices. The results highlight the significance of reconstruction filter design when applying dithering in this setting.
|
|
14:24-14:28, Paper WeB01.7 | Add to My Program |
A Novel Control Design for High-Speed Atomic Force Microscopy |
|
Gorugantu, Ram Sai | ASML |
Salapaka, Srinivasa M. | University of Illinois |
Keywords: Mechatronics, Robust control, Control applications
Abstract: In this paper, we propose a novel transform-based imaging mode for Atomic Force Microscopy (AFM), where the cantilever oscillations are made to track the output of a mock-model system. The states of the resulting tracking error dynamics is appended by another set of suitably designed states, which enable a specific time-varying coordinate transformation, which in turn results in dynamic models that are very conducive to linear control synthesis design methods. The proposed imaging mode enables higher throughput in AFM imaging without the need for significantly high resonant frequency AFM cantilever probes. This method overcomes the fundamental limitation of nonlinear input-output relationship in Amplitude Modulation AFM (AM-AFM) imaging mode by applying an appropriately chosen real-time transform on the output signal. In combination with model-based reference generation, the proposed real-time transform yields linear dynamical input-output characteristics. Simulations on detailed AFM models with H ∞-optimal control designs show the efficacy of the proposed imaging mode for feature bandwidths higher than 5% of the cantilever resonant frequency.
|
|
14:28-14:32, Paper WeB01.8 | Add to My Program |
Point-To-Point Motion Trajectory Generation for Uncertain Systems: A Closed-Form Solution |
|
Al-Rawashdeh, Yazan Mohammad | Memorial University of Newfoundland |
Al Janaideh, Mohammad | University of Guelph |
Heertjes, Marcel | Eindhoven University of Technology |
Keywords: Mechatronics
Abstract: We present a point-to-point trajectory generation framework that helps in enhancing the positioning of systems having uncertain lightly-damped vibration modes. To avoid exciting these modes, their uncertainties are matched with frequency bands and then used to specify the rejected frequencies under a k-cascaded second-order notch filter. To facilitate trajectory generation, the impulse response of this filter is altered by composing it off-line with a certain polynomial function of even degree in the time domain followed by a transformation. This will result in a unit-pulse acceleration signal with attenuated frequency contents in the rejected frequency bands, and therefore a reduced excitation of the lightly-damped modes is achieved. The resulting off-line generated unit-pulse acceleration signal is used as a template to ease real-time trajectory extrapolation using simple calculations due to the closed-form solutions provided. Therefore, real-time computational overhead is significantly reduced. Comparisons between the proposed method and frequency-shaped polynomials, finite-impulse response and input shaping are provided. The effectiveness of the proposed framework is illustrated through a numeric simulation.
|
|
14:32-14:36, Paper WeB01.9 | Add to My Program |
Contactless Suspension of a Silicon Disk |
|
Pyle, Kenneth | University of California, Los Angeles |
M'Closkey, Robert | University of California, Los Angeles |
Keywords: MEMs and Nano systems, Mechatronics, Manufacturing systems
Abstract: A system to suspend a silicon disk between two sets of stator electrodes is reported. Electrode pairs are used for both control and sensing by exerting electrostatic forces on the disk and measuring differential capacitances related to the disk's position. The disk is a six degree-of-freedom system, however, lateral and yaw motion are not measurable by the electrode arrangement so only the disk's vertical position, roll, and pitch are regulated. Two separate control strategies are pursued --decentralized feedback around the electrode-disk gaps and feedback around a decoupled coordinate frame related to the disk's controllable degrees-of-freedom. Experimental frequency responses obtained from closed-loop results of the suspended disk are reported and compared to analytical models.
|
|
14:36-14:40, Paper WeB01.10 | Add to My Program |
Learning Object Manipulation with Under-Actuated Impulse Generator Arrays |
|
Kong, Chuizheng | Mitsubishi Electric Research Laboratories |
Yerazunis, William | Mitsubishi Electric Research |
Nikovski, Daniel | Mitsubishi Electric Research Labs |
Keywords: Stochastic optimal control, Mechatronics, Robotics
Abstract: For more than half a century, vibratory bowl feeders have een the standard in automated assembly for singulation, orientation, and manipulation of small parts. Unfortunately, these feeders are expensive, noisy, and highly specialized on single part design bases. We consider an alternative device and learning control method for singulation, orientation, and manipulation by means of seven fixed-position variable-energy solenoid impulse actuators located beneath a semi-rigid part supporting surface. Using computer vision to provide part pose information, we tested various machine learning (ML) algorithms to generate a control policy that selects the optimal actuator and actuation energy. Our manipulation test object is a 6-sided craps-style die. Using the most suitable ML algorithm, we were able to flip the die to any desired face 30.4% of the time with a single impulse, and 51.3% with two chosen impulses, versus a random policy succeeding 5.1%% of the time (that is, a randomly chosen impulse delivered by a randomly chosen solenoid).
|
|
WeB02 RI Session, Sapphire IJ |
Add to My Program |
Optimization (RI) |
|
|
Chair: Kim, Hunmin | Mercer University |
Co-Chair: Ghaffari, Azad | Wayne State University |
|
14:00-14:04, Paper WeB02.1 | Add to My Program |
GPU Accelerated Batch Trajectory Optimization for Autonomous Navigation |
|
Rastgar, Fatemeh | University of Tartu |
Masnavi, Houman | University of Tartu |
Kruusamäe, Karl | University of Tartu |
Aabloo, Alvo | University of Tartu |
Singh, Arun Kumar | University of Tartu |
Keywords: Optimization algorithms, Autonomous robots, Robotics
Abstract: Trajectory optimizations encountered in mobile robot navigation are non-convex, and thus the solution process is prone to get stuck at poor local optima, resulting in collisions with the environment. A conceptually simple workaround is to simply run the optimizer from several initializations in parallel and choose the best solution. But realizing this simple trick with off-the-shelf optimizers is challenging since they are not customized for parallel/batch operation. We fill this gap by proposing a novel batchable and GPU accelerated trajectory optimizer for autonomous navigation. Our batch optimizer can run several hundred instances of the problem in parallel in real time. We improve the state-of-the-art in the following respects. First, we show that parallel initialization naturally discovers a distribution of locally optimal trajectories residing in different homotopies. Second, we improve the navigation quality (success rate, tracking) compared to the baseline approach that relies on computing a single locally optimal trajectory at each control loop. Finally, we show that when initialized with trajectory samples from a Gaussian distribution, our batch optimizer outperforms the state-of-the-art cross-entropy method in solution quality. textbf{Codes:} url{https://tinyurl.com/a3b99m8}, Video: url{https://www.youtube.com/watch?v=ZlWJk-w03d8}
|
|
14:04-14:08, Paper WeB02.2 | Add to My Program |
Multi-Objective Trajectory Planning for Unmanned Aerial Vehicles Using CLF-CBF-Based Quadratic Programs |
|
Khan, Sufyan Hafeez | Wayne State University |
Ghaffari, Azad | Wayne State University |
Keywords: Optimization, Autonomous systems
Abstract: Control barrier function-based quadratic programs (CBF-based QP) provide an avenue for agile and numerically efficient obstacle avoidance algorithms. However, the CBF-based QP methods may lead to lengthy detours and undesirable transient tracking performance without proper trajectory planning. This paper expands the CBF-based QP concept to create a modified safe reference trajectory with a prescribed avoidance radius and direction, where the modified reference shadows the actual reference during the avoidance maneuver. We use a control Lyapunov function (CLF) to match the modified reference with the actual reference and three CBFs to formulate safety and performance objectives to maintain distance, adjust velocity, and determine the direction of the avoidance maneuver. These formulations produce constraints that are synthesized by means of a quadratic program. The QP generates a desirable velocity profile for the safe reference trajectory. Numerical simulations verify the effectiveness of the proposed trajectory planning method.
|
|
14:08-14:12, Paper WeB02.3 | Add to My Program |
Safety Index Synthesis Via Sum-Of-Squares Programming |
|
Zhao, Weiye | Carnegie Mellon University |
He, Tairan | Carnegie Mellon University |
Wei, Tianhao | Carnegie Mellon University |
Liu, Simin | Carnegie Mellon University |
Liu, Changliu | Carnegie Mellon University |
Keywords: Optimization, Constrained control, Robotics
Abstract: Control systems often need to satisfy strict safety requirements. Safety index provides a handy way to evaluate the safety level of the system and derive the resulting safe control policies. However, designing safety index functions under control limits is difficult and requires a great amount of expert knowledge. This paper proposes a framework for synthesizing the safety index for general control systems using sum-of-squares programming. Our approach is to show that ensuring the non-emptiness of safe control on the safe set boundary is equivalent to a local manifold positiveness problem. We then prove that this problem is equivalent to sum-of-squares programming via the Positivstellensatz of algebraic geometry. We validate the proposed method on robot arms with different degrees of freedom and ground vehicles. The results show that the synthesized safety index guarantees safety and our method is effective even in high-dimensional robot systems.
|
|
14:12-14:16, Paper WeB02.4 | Add to My Program |
Photovoltaic Inverter Efficiency and Lifetime Trade-Off Using Model Based Real-Time Optimization |
|
Ingalalli, Aravind | Siemens |
Robinson, Jonathan | Siemens |
Nandola, Nareshkumar | Siemens Technology |
Keywords: Optimization, Control applications, Power electronics
Abstract: Lifetime of the photovoltaic (PV) inverters is influenced by its power profile. The reliability of such PV inverters is affected by the thermal fatigue cycles witnessed by the underlying components. However, there is a trade-off between the inverter efficiency and the fatigue witnessed by its components. With a systematic formulation of this trade-off, a real-time nonlinear optimization problem is formulated to generate the appropriate reactive power set-points to the PV inverter controller. The proposed approach improves the lifetime of the inverters while keeping its efficiency above desired threshold value. The time domain loss and damage models that uses PV power profile as an input are critical to the proposed optimization framework. The proposed framework provides an option to the customer to operate the PV inverter with an objective of lifetime improvement under the acceptable losses by flattening the component thermal fatigue cycles. The framework is evaluated using the PV power profile of a 10 kVA PV inverter with various simulated case studies.
|
|
14:16-14:20, Paper WeB02.5 | Add to My Program |
Formal Comparison of Simultaneous Perturbation Stochastic Approximation and Random Direction Stochastic Approximation |
|
Peng, Ducheng | Johns Hopkins University |
Chen, Yiwen | Johns Hopkins University |
Spall, James C. | Johns Hopkins Univ |
Keywords: Optimization, Estimation, Computational methods
Abstract: Stochastic approximation (SA) algorithms can be used in system optimization problems when only noisy measurements of a system are available. This paper formally compares the performance of two popular SA algorithms in a multivariate Kiefer-Wolfowitz setting of simultaneous-perturbation SA (SPSA) and the random-directions SA (RDSA). This paper provides sufficient conditions to demonstrate which algorithm has the smaller asymptotic mean squared error (MSE) and numerically presents comparison of SPSA and RDSA in a test function and a model-free control system. The theory and supporting numerics indicate that SPSA has better efficiency (lower MSE) across a broad range of problem settings.
|
|
14:20-14:24, Paper WeB02.6 | Add to My Program |
From Open Loop to Real-Time Recipe Optimization for Complex Industrial Batch Processes |
|
Kong, Lingxun | Dow |
Castillo, Ivan | The Dow Chemical Company |
Peng, You | Massachusetts Institute of Technology |
Rendall, Ricardo | Dow Inc |
Wang, Zhenyu | Dow Chemical |
Trahan, Daniel | The Dow Chemical Company |
Bentley, David | Dow Inc |
Keywords: Optimization, Optimal control, Learning
Abstract: We propose a fundamental-model-based optimization framework for open-loop and real-time batch recipe optimization using mathematical programing (MP) and reinforcement learning (RL). The dynamic fundamental model gives rise to a complex nonlinear differential algebraic equations (DAE) system. We introduce a decomposition-based initialization algorithm for solving the large-scale nonlinear program (NLP) resulting from the discretization of the DAE system. The proposed MP and RL-based approaches are implemented to optimize the recipe of a semi-batch process in the Dow Chemical Company. For open-loop optimization, we find the optimal profiles of two input variables and the batch length that maximize average profit. For real time optimization, we train the RL agent using the fundamental model with uncertainties. The trained agent can interact with the actual process and provide control actions in real time.
|
|
14:24-14:28, Paper WeB02.7 | Add to My Program |
Sequential Sum-Of-Squares Programming for Analysis of Nonlinear Systems |
|
Cunis, Torbjørn | University of Stuttgart |
Legat, Benoît | UCLouvain |
Keywords: Optimization algorithms, Variational methods, Lyapunov methods
Abstract: Numerous interesting properties in nonlinear systems analysis can be written as polynomial optimization problems with nonconvex sum-of-squares problems. To solve those problems efficiently, we propose a sequential approach of local linearizations leading to tractable, convex sum-of-squares problems. We prove local convergence under the assumption of strong regularity, a common condition in variational analysis. The new approach is applied to estimate the region of attraction of a polynomial aircraft model, where it greatly outperforms previous methods for nonconvex sum-of-squares problems.
|
|
14:28-14:32, Paper WeB02.8 | Add to My Program |
Guaranteed Privacy of Distributed Nonconvex Optimization Via Mixed-Monotone Functional Perturbations |
|
Khajenejad, Mohammad | University of California, San Diego |
Martinez, Sonia | University of California at San Diego |
Keywords: Optimization, Optimization algorithms
Abstract: In this paper, we introduce a new notion of guaranteed privacy for distributed nonconvex optimization algorithms. In particular, leveraging mixed-monotone inclusion functions, we propose a privacy-preserving mechanism which is based on deterministic, but unknown affine perturbations of the local objective functions. The design requires a robust optimization method to characterize the best accuracy that can be achieved by an optimal perturbation. This is used to guide the refinement of a guaranteed-private perturbation mechanism that can achieve a quantifiable accuracy via a theoretical upper bound that is independent of the chosen optimization algorithm. Finally, simulation results illustrate that our approach outperforms a benchmark differentially private distributed optimization algorithm in the literature.
|
|
14:32-14:36, Paper WeB02.9 | Add to My Program |
Robust Control Co-Design Using Tube-Based Model Predictive Control |
|
Tsai, Ying-Kuan | Texas A&M University |
Malak, Richard | Texas A&M University |
Keywords: Optimization, Uncertain systems, Computational methods
Abstract: Control co-design (CCD) has received much attention since it can achieve superior system performance by optimizing physical and control systems simultaneously. Despite many successful examples from diverse engineering fields using CCD, a lack of attention toward accounting for uncertainty hinders application to real-world systems. This paper aims to solve CCD problems under uncertainty by proposing a robust CCD formulation and algorithm. A robust feedback controller using tube-based model predictive control (tube-based MPC) approach is incorporated into a bi-level optimization architecture. The use of the set invariance theory and an approximation algorithm helps identify the set of all possible states due to disturbances, this set is known as a tube, and quantify system robustness by calculating the tube size. It enables designers to make performance-robustness tradeoffs with the approximate Pareto fronts. A numerical example and a simplified model of the satellite attitude control system are used to demonstrate the proposed method. Results show that the CCD solutions dominate most of the solutions from the traditional sequential design and control design only. This study will be extended into nonlinear applications in our future work.
|
|
14:36-14:40, Paper WeB02.10 | Add to My Program |
RRT Guided Model Predictive Path Integral Method |
|
Tao, Chuyuan | University of Illinois Urbana-Champaign |
Kim, Hunmin | Mercer University |
Hovakimyan, Naira | University of Illinois at Urbana-Champaign |
Keywords: Optimization algorithms, Optimal control
Abstract: This work presents an optimal sampling-based method to solve the real-time motion planning problem in static and dynamic environments, exploiting the Rapid-exploring Random Trees (RRT) algorithm and the Model Predictive Path Integral (MPPI) algorithm. The RRT algorithm provides a nominal mean value of the random control distribution in the MPPI algorithm, resulting in satisfactory control performance in static and dynamic environments without a need for fine parameter tuning. We also discuss the importance of choosing the right mean of the MPPI algorithm, which balances exploration and optimality gap, given a fixed sample size. In particular, a sufficiently large mean is required to explore the state space enough, and a sufficiently small mean is required to guarantee that the samples reconstruct the optimal controls. The proposed methodology automates the procedure of choosing the right mean by incorporating the RRT algorithm. The simulations demonstrate that the proposed algorithm can solve the motion planning problem in real time for static or dynamic environments.
|
|
WeB03 Regular Session, Sapphire EF |
Add to My Program |
Robotics I |
|
|
Chair: Saldarriaga, Carlos | Escuela Superior Politécnica Del Litoral |
Co-Chair: Cai, Mingyu | Lehigh University |
|
14:00-14:15, Paper WeB03.1 | Add to My Program |
Decision Making of Ball-Batting Robots Based on Deep Reinforcement Learning |
|
Hsiao, Tesheng | National Yang Ming Chiao Tung University |
Kao, Hsuan-Che | National Yang Ming Chiao Tung University |
Keywords: Mechanical systems/robotics, Control applications, Vision-based control
Abstract: Ball-batting is a challenging task because it requires excellent eye-hand coordination and instantaneous decision making. Moreover, as a winning strategy, the task of ball-batting concerns not only about “hitting a flying ball with a bat”, but about “sending the rebounding ball to a prespecified location”. Therefore, the decisions on when and where to hit the ball and what the velocity of the bat is at the impact time are crucial for a successful ball-batting. Making such decisions should consider the flying and rebounding behavior of the ball and is very complicated. In this paper, we apply the deep reinforcement learning (DRL) method to train the ball-batting robot developed by the authors for making timely and appropriate batting decisions. A simulated environment consisting of a physical flying model and a neural network rebounding model is constructed for efficient training. Then experiments in the real world are conducted and the results show that after being trained by DRL, the robot can hit the incoming ball in all tests and send the rebounding ball to the target location with a successful rate of 58.8%.
|
|
14:15-14:30, Paper WeB03.2 | Add to My Program |
Controlling a Double-Pendulum Crane by Combining Reinforcement Learning and Conventional Control |
|
Eaglin, Gerald | University of Louisiana at Lafayette |
Poche, Thomas | University of Louisiana at Lafayette |
Vaughan, Joshua | Oak Ridge National Laboratory |
Keywords: Mechanical systems/robotics, Learning, Optimal control
Abstract: Controlling oscillation is vital for applications in which flexible systems are employed. Many existing control methods rely on knowledge of the system dynamics to mitigate unwanted vibration. However, model-free methods can also be employed to control vibration. One method for model-free control is reinforcement learning (RL). Although the RL agent does not require information about the system to learn a control policy, domain knowledge of dynamics and control can be used to augment the agent and aid in generating an effective control policy. This work analyzes the effectiveness of training RL controllers that operate in combination with conventional controllers. Agents were trained in simulation using a model of a small-scale double-pendulum crane. The effect of the conventional control component on training as well as sensitivity to modeling error are analyzed. Agent transferability is investigated by implementing the simulation-trained controllers on a physical small-scale double-pendulum crane.
|
|
14:30-14:45, Paper WeB03.3 | Add to My Program |
On Mapping Stiffness and Damping in Robotic Impedance Control: A Spatial Validation of Coupling |
|
Patiño Miñán, José Johil | Escuela Superior Politécnica Del Litoral |
Saldarriaga, Carlos | Escuela Superior Politécnica Del Litoral |
Keywords: Mechanical systems/robotics, Modeling, Simulation
Abstract: In order to calculate the necessary joint torques and end-effector forces in accordance with a desired dynamic behavior for assembly and manipulation tasks in the field of robotics, robotic impedance control has been extensively employed. Current complete congruent mapping equations have not been validated through actual robotic manipulators in impedance control tasks. In this paper, we show and validate from fundamental principles, with the use of a 6 DoF robotic manipulator, the transformation equations of the stiffness and damping matrices from Cartesian to joint space without losing generality. Our findings highlight the significance of the Cartesian damping coupling term, which is typically absent from the existent robotics literature, in the stiffness matrices after mapping them to the joint space.
|
|
14:45-15:00, Paper WeB03.4 | Add to My Program |
Adaptive Attitude Control for Foldable Quadrotors |
|
Patnaik, Karishma | Arizona State University |
Zhang, Wenlong | Arizona State University |
Keywords: Robotics, Flexible structures, Emerging control applications
Abstract: Recent quadrotors have transcended conventional designs, emphasizing more on foldable and reconfigurable bodies. The state of the art still focuses on the mechanical feasibility of such designs with limited discussions on the tracking performance of the vehicle during configuration switching. In this article, we first present a common framework to analyse the attitude errors of a folding quadrotor via the theory of switched systems. We then employ this framework to investigate the attitude tracking performance for two case scenarios - one with a conventional geometric controller for precisely known system dynamics; and second, with our proposed morphology-aware adaptive controller that accounts for any modeling uncertainties. Finally, we cater to the desired switching requirements from our stability analysis by exploiting the trajectory planner to obtain superior tracking performance while switching. Simulation results are presented that validate the proposed control and planning framework for a foldable quadrotor's flight through a passageway.
|
|
15:00-15:15, Paper WeB03.5 | Add to My Program |
Accelerated Learning and Control of Robots with Uncertain Kinematics and Unknown Disturbances |
|
Yilmaz, Cemal Tugrul | UC San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Robust adaptive control, Robotics, Neural networks
Abstract: This paper develops accelerated model-free robust adaptive control algorithms for unknown static maps which correspond to forward kinematics in robotics. The technique is based on the estimation of unknown Jacobian matrix by using a neural-network based approximation and use of monotonically increasing gain functions in the controller and update law. The introduced algorithms provide robustness against the unknown environmental disturbances and achieve asymptotic, exponential and prescribed-time reference trajectory tracking. The fixed-time stabilization in prescribed time is the strongest notion among them that allows user to predefine a terminal time irrespective of initial condition and system parameters. A formal stability analysis for each algorithm is presented and theoretical results are validated through numerical simulations conducted on a single-section three-actuator continuum robot.
|
|
15:15-15:30, Paper WeB03.6 | Add to My Program |
Vessel Inspection In-The-Wild: Practical Planning in Large-Scale Industrial Environments |
|
Hansen, Jakob Grimm | Aarhus University |
Heiß, Micha | Aarhus University |
Li, Dengyun | University of Twente |
Kozłowski, Michal | Aarhus University |
Kayacan, Erdal | Aarhus University |
Keywords: Robotics, Autonomous robots, Autonomous systems
Abstract: In this paper, a novel strategy for practical inspection planning in dry docks using unmanned aerial vehicles (UAVs) is presented. Planning is a fundamental prerequisite for accurate navigation and control of the UAV. The proposed method utilises the random sample consensus (RANSAC) algorithm to extract plane models from a voxel grid representation of the environment. For high-level planning, semantic knowledge of the environment is leveraged in a novel manner to exploit of structured obstacles, such as straight walls and orthogonal corners. In order to deal with lower-level navigation, the approach incorporates a simple graph-based local replanner to generate paths that avoid obstacles in the environment. The proposed method is compared with state-of-the-art graph-based planner in simulation and subsequently evaluated in a real environment. The paper maintains the use case of UAV vessel inspection and presents exhaustive simulation and field testing, which demonstrate the viability of the proposed approach in a fully working large-scale industrial environment.
|
|
WeB04 Invited Session, Sapphire AB |
Add to My Program |
Advanced Control of Wind Farms and Wind Turbines: Session II: Wind Farm
Wake Control |
|
|
Chair: Mulders, Sebastiaan Paul | Delft University of Technology |
Co-Chair: Zare, Armin | University of Texas at Dallas |
Organizer: Mulders, Sebastiaan Paul | Delft University of Technology |
Organizer: Bay, Christopher | National Renewable Energy Laboratory |
Organizer: Fleming, Paul | National Renewable Energy Laboratory |
Organizer: van Wingerden, Jan-Willem | Delft University of Technology |
|
14:00-14:15, Paper WeB04.1 | Add to My Program |
Robustness of Two-Dimensional Stochastic Dynamical Wake Models for Yawed Wind Turbines (I) |
|
Rodrigues, Mireille | The University of Texas at Dallas |
Burgess, Nicolas | University of Texas at Dallas |
Bhatt, Aditya | The University of Texas at Dallas |
Leonardi, Stefano | The University of Texas at Dallas |
Zare, Armin | University of Texas at Dallas |
Keywords: Distributed parameter systems, Linear systems, Energy systems
Abstract: We develop stochastic dynamical reduced-order models of wind farm turbulence that capture the effects of yaw misalignment due to control or atmospheric variability on turbine wakes and their interactions. Our models are based on the stochastically forced linearized Navier-Stokes equations around analytical descriptions of the wake velocity provided by low-fidelity engineering wake models. The power-spectral density of the source of additive stochastic excitation is identified via convex optimization to ensure statistical consistency with high-fidelity models while preserving model parsimony. We demonstrate the utility of our approach in capturing turbulence intensity variations in accordance with large-eddy simulations of the flow over a cascade of wind turbines. While our models are developed to match velocity correlations from sensors that are placed directly behind perfectly aligned wind turbine rotors, their predictions maintain a desirable level of accuracy even when the turbines are yawed.
|
|
14:15-14:30, Paper WeB04.2 | Add to My Program |
Enhancing Wake Mixing in Wind Farms by Multi-Sine Signals in the Helix Approach (I) |
|
Huang, Lu-Jan | Delft University of Technology |
Mulders, Sebastiaan Paul | Delft University of Technology |
Taschner, Emanuel | TU Delft |
van Wingerden, Jan-Willem | Delft University of Technology |
Keywords: Emerging control applications, Simulation, Electrical machine control
Abstract: In most current offshore wind farms, the turbines are controlled greedily, neglecting any coupling by wake effects with other turbines. By neglecting these effects of aerodynamic interactions, the power production performance is substantially reduced. Besides the well-known wake steering and dynamic induction control wake control strategies, a novel wind farm flow control strategy called the Helix approach has been recently proposed to mitigate the impacts of wake effects and optimize wind farm performance. The Helix approach adopts the individual pitch control (IPC) technique to dynamically deform the wake into the helical shape, which induces wake instability and thereby stimulates wake recovery. The first results employing a single-harmonic signal have demonstrated promising enhancement in wake recovery effects. However, more complex signals to potentially improve the effectiveness of the Helix approach have never been studied. This paper explores the potential of using higher-harmonic signals in the Helix approach to further enhance wake mixing. The aeroelastic simulator, OpenFAST, with its recently developed free vortex wake codes is adopted to simulate the dynamic wake evolution. A Fourier stability analysis is used to quantitatively identify the wake breakdown position. Results show that the wake breaks down at 1.75 rotor diameter (D) from the rotor using optimized multi-sine signals, which is a significant improvement over the breakdown distance at 2.50 D resulting from the conventional single-sine Helix. The earlier wake breakdown indicates faster wake recovery and is to be validated by future higher-fidelity simulation studies.
|
|
14:30-14:45, Paper WeB04.3 | Add to My Program |
Enhanced Wake Mixing in Wind Farms Using the Helix Approach: A Loads Sensitivity Study (I) |
|
van Vondelen, Aemilius | TU Delft |
Navalkar, Sachin | Delft University of Technology |
Kerssemakers, Daan | TU Delft |
van Wingerden, Jan-Willem | Delft University of Technology |
Keywords: Energy systems, Simulation, Control applications
Abstract: The Helix approach is a control technology that reduces the wake effect in wind farms by accelerating wake mixing through individual pitch control, resulting in significant AEP gain. However, this study found that the controller may increase pitch bearing damage and loads on some turbine components, depending on its settings. Using a modified version of NREL's Reference OpenSource Controller in OpenFAST, this study analysed the sensitivity of loads and pitch bearing damage to different Helix controller settings on the IEA-15MW reference offshore wind turbine. Results showed that loads increased with the amplitude of the excitation signal but were less affected by its frequency. Additionally, more pitch bearing damage was observed in the counterclockwise Helix direction, while slightly higher loads were observed in the clockwise direction when using the same excitation signal amplitude and frequency for both directions.
|
|
14:45-15:00, Paper WeB04.4 | Add to My Program |
Wind Tunnel Testing of Wake Tracking Methods Using a Model Turbine and Tailored Inflow Patterns Resembling a Meandering Wake (I) |
|
Onnen, David | ForWind - Center for Wind Energy Research, University of Oldenbu |
Petrović, Vlaho | Universität Oldenburg |
Neuhaus, Lars | University of Oldenburg, ForWind - Institute of Physics |
Langidis, Apostolos | University of Oldenburg |
Kühn, Martin | University of Oldenburg |
Keywords: Estimation, Energy systems, Kalman filtering
Abstract: Wind farm control can be enhanced by feedback information about the current flow situation in the farm. Especially for closed-loop wake-steering control, the position of a wind turbine's wake with respect to the next turbine is valuable. Depending on the context, e.g. installed sensory or required time- and wake position resolution, different methods for wake tracking exist. With increasing fidelity, not only time-averaged wakes but also instantaneous wake conditions, arising from the meandering nature of a wind turbine wake, are considered for control. In this work, two methods for the dynamic estimation of the wake centre position are experimentally tested. One of them is based on wind turbine blade loads, the other on a fixed-beam staring lidar. Both methods use an Extended Kalman Filter with a dynamic model that takes the meandering nature of a wind turbine wake into account. In the atmospheric boundary layer, turbulence patterns of multiple rotor diameters are driving the downstream meandering of the wake. In wind tunnel setups with large model turbines, these spatial scale ratios are hardly reachable, thus meandering conditions were difficult to reproduce up to now. In this work, the inflow conditions are generated in a wind tunnel with an active grid, which imprints both a wind speed deficit and additional turbulence in a meandering frame of reference. The paper describes the creation and characterization of such inflow, while not claiming for an exact representation of a meandering wake in an atmospheric boundary layer. Inflow conditions of different meandering dynamics are created and their impact on the tracking methods is analyzed. The experiments show that the generated inflow conditions both resemble a wake and induce an expected response of the model turbine. The results from the wake tracking methods are comparable with the available literature, thus further indicate the potential of the setup for control-oriented experiments in the context of meandering wakes, turbine-turbine interaction and load alleviation.
|
|
15:00-15:15, Paper WeB04.5 | Add to My Program |
Radar Based Wake Control for Reducing the Levelized Cost of Energy in Offshore Wind Farms (I) |
|
D'Amato, Fernando | GE Research |
Boutselis, George I. | Georgia Institute of Technology |
Bonanni, Pierino | GE Global Research |
Szczepanski, Walter | Helios Remote Sensing Systems, Inc |
Lopez-Negrete, Rodrigo | General Electric Global Research |
Keywords: Control applications, Energy systems, Sensor networks
Abstract: Wake controls in wind farms has evolved significantly in the last twenty years, motivated mainly by its potential to increase annual energy production (AEP) through reduction of wake losses. Engineering models that characterize the wakes in the farm have enhanced fidelity and computational efficiency. Computational environments have been developed to adjust turbine control settings based on these models to reduce the impact of wakes. Several experimental campaigns have been carried out to validate the computational predictions. Yet, experimental results have typically shown lower AEP gains than expected. The variability in wake characteristics and the inability to calculate them online are key factors limiting the practical value of existing wake control solutions. This work presents a wake control approach that proposes new sensors to measure the wakes online and uses accurate wake characteristics to enable further energy capture in offshore wind farms. A network of low-cost radar sensors is specifically designed to detect wakes in wind farms. A model-based estimation approach is developed to reduce the online wake uncertainty. Then, a model-based optimization framework is used to calculate AEP gains achieved by steering wakes via yaw actuation. The feasibility of the proposed approach is assessed by quantifying the changes in the levelized cost of energy (LCOE) resulting from the additional AEP gains and the extra cost of the new sensors.
|
|
WeB05 Regular Session, Sapphire 411A |
Add to My Program |
Optimization Algorithms II |
|
|
Chair: Kia, Solmaz S. | University of California Irvine (UCI) |
Co-Chair: Hale, Matthew | University of Florida |
|
14:00-14:15, Paper WeB05.1 | Add to My Program |
Noise Amplifiation of Momentum-Based Optimization Algorithms |
|
Mohammadi, Hesameddin | University of Southern California |
Razaviyayn, Meisam | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Optimization algorithms, Optimization, Stability of linear systems
Abstract: We study momentum-based first-order optimization algorithms in which the iterations utilize information from the two previous steps and are subject to additive white noise. For strongly convex quadratic problems, we utilize Jury stability criterion to provide a novel geometric characterization of linear convergence and exploit this insight to derive alternative proofs of standard convergence results and identify fundamental performance tradeoffs. We use the steady-state variance of the error in the optimization variable to quantify noise amplification and establish analytical lower bounds on the product between the settling time and the smallest/largest achievable noise amplification that scale quadratically with the condition number. This extends the prior work [1], where only the special cases of Polyak's heavy-ball and Nesterov's accelerated algorithms were studied. We also use this geometric characterization to introduce a parameterized family of algorithms that strikes a balance between noise amplification and settling time while preserving order-wise Pareto optimality.
|
|
14:15-14:30, Paper WeB05.2 | Add to My Program |
An Almost Sure Convergence Analysis of Zeroth-Order Mirror Descent Algorithm |
|
Paul, Anik Kumar | JADAVPUR UNIVERSITY |
Mahindrakar, Arun D. | Indian Institute of Technology Madras |
Kalaimani, Rachel Kalpana | Indian Institute of Technology Madras |
Keywords: Optimization algorithms, Optimization
Abstract: In this paper, we study the almost sure convergence analysis of zeroth-order mirror descent algorithm. The algorithm admits non-smooth convex function with an estimate of gradient using Nesterov's Gausssian Approximation technique. We establish that under suitable condition of step-size iterates of the algorithm converge to a neighborhood of optimal function value almost surely. We extend the analysis in distributed optimization. We validate the almost sure convergence on a non-smooth optimization problem.
|
|
14:30-14:45, Paper WeB05.3 | Add to My Program |
Federated Learning Using Variance Reduced Stochastic Gradient for Probabilistically Activated Agents |
|
Rostami, Mohammadreza | University of California, Irvine |
Kia, Solmaz S. | University of California Irvine (UCI) |
Keywords: Learning, Machine learning, Optimization algorithms
Abstract: This paper proposes an algorithm for Federated Learning (FL) with a two-layer structure that achieves both variance reduction and a faster convergence rate to an optimal solution in the setting where each agent has an arbitrary probability of selection in each iteration. The first layer of our algorithm corresponds to the model parameter propagation across agents done by the server. In the second layer, each agent does its local update with a stochastic and variance-reduced technique called Stochastic Variance Reduced Gradient (SVRG). We leverage the concept of variance reduction from stochastic optimization when the agents want to do their local update step to reduce the variance caused by stochastic gradient descent (SGD). The special attention in this paper is on FL operation where the agents' participation in the update process in each round is probabilistic and non-uniform. We provide a convergence bound for our algorithm which improves the rate from O(1/K^(1/2))to O(1/K) by using a constant step-size when the cost is strongly convex. For non-convex costs, we establish a O(1/K^(2/3)) to a stationary point using a vanishing stepsize. We demonstrate the performance of our algorithm using numerical~simulations.
|
|
14:45-15:00, Paper WeB05.4 | Add to My Program |
On the Stability Analysis of Open Federated Learning Systems |
|
Sun, Youbang | Northeastern University |
Fernando, Heshan | Rensselaer Polytechnich Institute |
Chen, Tianyi | Rensselaer Polytechnic Institute |
Shahrampour, Shahin | Northeastern University |
Keywords: Optimization algorithms, Optimization, Machine learning
Abstract: We consider the open federated learning (FL) systems, where clients may join and/or leave the system during the FL process. Given the variability of the number of present clients, convergence to a fixed model cannot be guaranteed in open systems. Instead, we resort to a new performance metric that we term the stability of open FL systems, which quantifies the magnitude of the learned model in open systems. Under the assumption that local clients' functions are strongly convex and smooth, we theoretically quantify the radius of stability for two FL algorithms, namely local SGD and local Adam. We observe that this radius relies on several key parameters, including the function condition number as well as the variance of the stochastic gradient. Our theoretical results are further verified by numerical simulations on synthetic data.
|
|
15:00-15:15, Paper WeB05.5 | Add to My Program |
A Totally Asynchronous Block-Based Heavy Ball Algorithm for Convex Optimization |
|
Hustig-Schultz, Dawn | University of California, Santa Cruz |
Hendrickson, Katherine | University of Florida |
Hale, Matthew | University of Florida |
Sanfelice, Ricardo G. | University of California at Santa Cruz |
Keywords: Optimization algorithms, Stability of hybrid systems
Abstract: We present a totally asynchronous multiagent algorithm, based on the heavy ball method, that guarantees fast convergence to the minimizer of a twice continuously differentiable, convex objective function over a convex constraint set. The algorithm is parallelized in the sense that the decision variable is partitioned into blocks, each of which is updated by only a single agent. We consider two types of asynchrony: in agents’ computations and in communications between agents, both under arbitrarily long delays. We show that, for certain values of the step size and other parameters, the heavy ball algorithm exponentially converges to a minimizer, even under total asynchrony. Numerical results validate these findings and demonstrate significantly faster convergence than a comparable gradient descent algorithm.
|
|
15:15-15:30, Paper WeB05.6 | Add to My Program |
Online Convex Optimization with Long Term Constraints for Predictable Sequences |
|
Muthirayan, Deepan | University of California at Irvine |
Yuan, Jianjun | University of Minnesota |
Khargonekar, Pramod | Univ. of California, Irvine |
Keywords: Optimization algorithms
Abstract: In this paper, we investigate the framework of Online Convex Optimization (OCO) for online learning. OCO offers a very powerful online learning framework for many applications. In this context, we study a specific framework of OCO called {it OCO with {long-term} constraints}. {Long-term} constraints are introduced typically as an alternative to reduce the complexity of projection at every update step in online optimization. While many algorithmic advances have been made towards online optimization with {long-term} constraints, these algorithms typically assume that the sequence of cost functions over a certain T finite steps that determine the cost to the online learner are adversarially generated. In many circumstances, the sequence of cost functions may not be unrelated, and thus predictable from those observed till a point of time. In this paper, we study the setting where the sequences are predictable. We present a novel algorithm for online optimization with {long-term} constraints that can leverage such predictability for linear cost functions. We show that, with a predictor that can supply the gradient information of the next function in the sequence, our algorithm can achieve an overall regret and constraint violation rate that is strictly less than the rate that is achievable without prediction.
|
|
WeB06 Regular Session, Sapphire 411B |
Add to My Program |
Nonholonomic Systems |
|
|
Chair: Taha, Haithem | University of California, Irvine |
Co-Chair: Chen, Tan | Michigan Technological University |
|
14:00-14:15, Paper WeB06.1 | Add to My Program |
Singularly Perturbed Averaging with Application to Bio-Inspired 3D Source Seeking |
|
Abdelgalil, Mahmoud | University of California, Irvine |
Eldesoukey, Asmaa | University of California at Irvine |
Taha, Haithem | University of California, Irvine |
Keywords: Nonholonomic systems, Adaptive control, Biologically-inspired methods
Abstract: We propose a novel 3D source seeking algorithm for rigid bodies with a non-collocated sensor inspired by the chemotactic navigation strategy of sea urchin sperm known as helical klinotaxis. We work directly with the rotation group text{SO}(3) without parameterization in representing the attitude of a rigid body. As a consequence, the proposed algorithm does not require attitude feedback for implementation as opposed to all previous work on 3D source seeking. The stability of the proposed algorithm is proven using a careful application of singular perturbation and second order averaging.
|
|
14:15-14:30, Paper WeB06.2 | Add to My Program |
Computationally Efficient Tracking Control of Differential Drive Wheeled Mobile Robots |
|
Zhang, Boyang | Duke University |
Gavin, Henri P. | Duke University |
Keywords: Nonholonomic systems, Constrained control, Mechanical systems/robotics
Abstract: This paper presents a computationally efficient, heuristic-free, and nonlinear feedback control framework for the tracking control of the position and orientation of a differential drive wheeled mobile robot (WMR) subjected to high-speed maneuvers and external disturbances. We synthesize the control law by an extension of Gauss’s principle of least constraint with dynamic incorporation of holonomic and nonholonomic equality constraints and with coordinate transformation. The command control actions for the WMR’s constrained dynamics result from solving a linear matrix equation (a Karush-Kuhn-Tucker system) at each point in time. No dynamics linearization or iterative solution is involved in the framework. Numerical experiments of a high-speed, differential drive WMR under sinusoidal and Gaussian external disturbances are presented to showcase the effectiveness of the proposed method.
|
|
14:30-14:45, Paper WeB06.3 | Add to My Program |
Global and Almost-Global Controllability of Underactuated Mechanical Systems by Using Time-Reversal Symmetry |
|
Chen, Tan | Michigan Technological University |
Goodwine, Bill | University of Notre Dame |
Keywords: Nonholonomic systems, Lyapunov methods, Robotics
Abstract: This paper uses time-reversal symmetry (T-symmetry), which is inherent in many mechanical systems, to establish global controllability results for a class of underactuated mechanical systems, i.e., the systems with fewer actuators than the number of degrees of freedom (DoF). The idea is to find a control law guaranteeing a globally asymptotically stabilizable equilibrium (GAS) state, at which small-time local controllability (STLC) is also granted. Then, such equilibrium state can be used as a connection to design a global controller by using the time-reversal symmetry, i.e., the system can be driven from any initial state to any target state within finite time via the connection. By using the same line of reasoning, an UMS with an almost GAS equilibrium state proves to be almost-globally controllable. It shows that underactuated pendula with one degree of unactuation are almost-globally controllable (except a set of Lebesgue measure zero), for which the connection state is when all the links are at the downmost positions.
|
|
14:45-15:00, Paper WeB06.4 | Add to My Program |
Optimal Control of Nonholonomic Systems Via Magnetic Fields |
|
Oprea, Maria | Cornell University |
Ruth, Maximilian | Cornell University |
Kassabova, Dora | Cornell University |
Clark, William | Cornell University |
Keywords: Nonholonomic systems, Optimal control
Abstract: Geometric optimal control utilizes tools from differential geometry to analyze the structure of a problem to determine the control and state trajectories to reach a desired outcome while minimizing some cost function. For a controlled mechanical system, the control usually manifests as an external force which, if conservative, can be added to the Hamiltonian. In this work, we focus on mechanical systems with controls added to the symplectic form rather than the Hamiltonian. In practice, this translates to controlling the magnetic field for an electrically charged system. We develop a basic theory deriving necessary conditions for optimality of such a system subjected to nonholonomic constraints. We consider the representative example of a magnetically charged Chaplygin Sleigh, whose resulting optimal control problem is completely integrable.
|
|
15:00-15:15, Paper WeB06.5 | Add to My Program |
Contracting Forced Lagrangian and Contact Lagrangian Systems: Application to Nonholonomic Systems with Symmetries |
|
Anahory Simões, Alexandre | IE University |
Colombo, Leonardo Jesus | Spanish National Research Council |
Keywords: Algebraic/geometric methods, Nonholonomic systems, Autonomous systems
Abstract: In this paper we address the problem of identifying contracting systems among dynamical systems appearing in mechanics. First, we introduce a sufficient condition to identify contracting systems in a general Riemannian manifold. Then, we apply this technique to establish that a particular type of dissipative forced mechanical system is contracting, while stating immediate consequences of this fact for its stability. Finally, we use the previous results to study the stability of particular types of Contact and Nonholonomic systems.
|
|
15:15-15:30, Paper WeB06.6 | Add to My Program |
Homotopy Method for Optimal Motion Planning with Homotopy Class Constraints |
|
He, Wenbo | Washington University in St. Louis |
Huang, Yunshen | Washington University in St. Louis |
Wang, Jie | Washington University in St. Louis |
Zeng, Shen | Washington University in St. Louis |
Keywords: Computational methods, Nonholonomic systems, Optimal control
Abstract: Optimal motion planning is an essential task within the field of control theory. Therein, the key task is to synthesize optimal system trajectories that pass through cluttered environments while respecting given homotopy class constraints, which is critical in many topology-restricted applications such as search and rescue. In this paper, we introduce a novel optimal motion planning technique with 2-dimensional homotopy class constraints for general dynamical systems. We first initialize an optimal system trajectory regardless of obstacles and homotopy class constraints, and design an auxiliary obstacle trajectory for each obstacle such that the system trajectory belongs to the desired homotopy class regarding these auxiliary obstacle trajectories. During the procedure of deforming the auxiliary obstacle trajectory to the original counterparts, we propose a homotopy method based on nonlinear programming (NLP) such that the synthesized optimal system trajectories fulfill the aforementioned homotopy class constraints. The proposed method is validated with numerical results on two classic nonlinear systems with planar static and moving obstacles.
|
|
WeB07 Regular Session, Aqua 303 |
Add to My Program |
Nonlinear Systems Identification |
|
|
Chair: Kamalapurkar, Rushikesh | Oklahoma State University |
Co-Chair: Naitali, Abdessamad | National Graduate School for Arts and Crafts of the University Mohammed 5 at Rabat (UM5R) |
|
14:00-14:15, Paper WeB07.1 | Add to My Program |
Black Box Identification of the Phase Bulk Inductance of Current and Flux Fed Short Pitched Winding Switched Reluctance Motors |
|
Fouadi, Mehdi | CEDOC Information Technology and Engineering Sciences (ST2I), EN |
Naitali, Abdessamad | ERRERA, Research Center of Engineering and Healt Sciences and Te |
Keywords: Nonlinear systems identification, Electrical machine control, Estimation
Abstract: A New solution to the identification problem of current and flux fed Short Pitched Winding Switched Reluctance Motors (SPW-SRM) in the presence of strong saturation is developed. This is done by expanding the rotational periodicity and magnetic saturation in a high dimensional feature space spanned by a user defined combined periodic and smooth functions. The importance of this identification scheme lies in the good generalization ability of the constructed models, their robustness with respect to output measurement noise, and the diversity of their use as an estimation, prediction or diagnosis tool. Furthermore, no particular attention is required while the identification experiment since that data can be collected in vehicle normal operating regimes. Identification results obtained for a 60 kW SRM in the presence of measurement noise within automotive applications involving Fourier series and rational polynomial functions are quite satisfactory.
|
|
14:15-14:30, Paper WeB07.2 | Add to My Program |
Carleman Lifting for Nonlinear System Identification with Guaranteed Error Bounds |
|
Abudia, Moad | Oklahoma State University |
Kamalapurkar, Rushikesh | Oklahoma State University |
Rosenfeld, Joel A. | University of South Florida |
Keywords: Nonlinear systems identification, Identification, Estimation
Abstract: This paper concerns identification of uncontrolled or closed loop nonlinear systems using a set of trajectories that are generated by the system in a domain of attraction. The objective is to ensure that the trajectories of the identified systems are close to the trajectories of the real system, as quantified by an error bound that is prescribed a priori. A majority of existing methods for nonlinear system identification rely on techniques such as neural networks, autoregressive moving averages, and spectral decomposition that do not provide systematic approaches to meet pre-defined error bounds. The developed method is based on Carleman linearization-based lifting of the nonlinear system to an infinite dimensional linear system. The linear system is then truncated to a suitable order, computed based on the prescribed error bound, and parameters of the truncated linear system are estimated from data. The effectiveness of the technique is demonstrated by identifying an approximation of the Van der Pol oscillator from data within a prescribed error bound.
|
|
14:30-14:45, Paper WeB07.3 | Add to My Program |
Inferring Dynamics of Discrete-Time, Fractional-Order Control-Affine Nonlinear Systems |
|
Yaghooti, Bahram | Washington University in St. Louis |
Sinopoli, Bruno | Washington University in St Louis |
Keywords: Nonlinear systems identification, Identification, Identification for control
Abstract: In this paper, a system identification algorithm is proposed for control-affine, discrete-time fractional-order nonlinear systems. The proposed algorithm is a data-integrated framework that provides a mechanism for data generation and then uses this data to obtain the drift-vector, control-vector fields, and the fractional order of the system. The proposed algorithm includes two steps. In the first step, experiments are designed to generate the required data for dynamic inference. The second step utilizes the generated data in the first step to obtain the system dynamics. The memory-dependent property of fractional-order Gr"{u}nwald-Letnikov difference operator is used to compute the fractional order of the system. Then, drift-vector and control-vector fields are reconstructed using orthonormal basis functions, and calculation of the coefficients is formulated as an optimization problem. Finally, simulation results are provided to illustrate the effectiveness of the proposed framework. Additionally, one of the methods for identifying integer-order systems is applied to the generated data by a fractional-order system, and the results are included to show the benefit of using a fractional-order model in long-range dependent processes.
|
|
14:45-15:00, Paper WeB07.4 | Add to My Program |
Control-Aware Learning of Koopman Embedding Models |
|
Uchida, Daisuke | University of Michigan |
Duraisamy, Karthik | University of Michigan |
Keywords: Nonlinear systems identification, Identification for control, Modeling
Abstract: A learning method is proposed for Koopman operator-based models with the goal of improving closed-loop control behavior. A neural network-based approach is used to discover a space of observables in which nonlinear dynamics is linearly embedded. While accurate state predictions can be expected with the use of such complex state-to-observable maps, undesirable side-effects may be introduced when the model is deployed in a closed-loop environment. This is because of modeling or residual error in the linear embedding process, which can manifest itself in a different manner compared to the state prediction. To this end, a technique is proposed to refine the originally trained model with the goal of improving the closed-loop behavior of the model while retaining the state-prediction accuracy obtained in the initial learning. Finally, a simple data sampling strategy is proposed to use inputs deterministically sampled from continuous functions, leading to additional improvements in the controller performance for nonlinear dynamical systems. Several numerical examples are provided to show the efficacy of the proposed method.
|
|
15:00-15:15, Paper WeB07.5 | Add to My Program |
A Fading Memory Discontinuous EKF for the Online Model Identification of Cable-Driven Robots with Backlash |
|
Poignonec, Thibault | University of Strasbourg |
Nageotte, Florent | University of Strasbourg |
Bayle, Bernard | University of Strasbourg |
Keywords: Nonlinear systems identification, Kalman filtering, Robotics
Abstract: This paper deals with the online model identification of robots suffering from backlash in their transmission, such as is the case for cable-driven endoscopic robots. Although online backlash identification is advantageous due to potential textit{in-situ} evolution of the backlash behavior, most existing methods focus on offline identification. Existing online approaches are either designed for DOF-by-DOF identification or limited by the simplicity of the underlying backlash models. We propose a new identification method based on Discontinuous EKF (DEKF) filtering to learn, online, the correct parameters of a multi-DOF backlash model. This allows to account for measurement noise and to include more complex backlash behaviors. A proof of concept on a simulated 3 DOF flexible endoscopic robot is presented to demonstrate the potential of such an approach.
|
|
15:15-15:30, Paper WeB07.6 | Add to My Program |
Model Reference Gaussian Process Regression: Data-Driven Output Feedback Controller |
|
Kim, Hyuntae | ASRI, Seoul National University |
Chang, Hamin | Seoul National University |
Shim, Hyungbo | Seoul National University |
Keywords: Nonlinear output feedback, Nonlinear systems identification, Identification for control
Abstract: Data-driven controls using Gaussian process regression have recently gained much attention. In such approaches, system identification by Gaussian process regression is mainly followed by model-based controller designs. However, the outcomes of Gaussian process regression are often too complicated to apply conventional control designs, which makes the numerical design such as model predictive control employed in many cases. To overcome the restriction, our idea is to perform Gaussian process regression to the inverse of the plant with the same input/output data for the conventional regression. With the inverse, one can design a model reference controller without resorting to numerical control methods. This paper considers single-input single-output (SISO) discrete-time nonlinear systems of minimum phase with relative degree one. It is highlighted that the model reference Gaussian process regression (MR-GPR) controller is designed directly from pre-collected input/output data without identification of the system itself.
|
|
WeB08 Invited Session, Aqua 305 |
Add to My Program |
Estimation and Control of Infinite Dimensional Systems II |
|
|
Chair: Demetriou, Michael A. | Worcester Polytechnic Institute |
Co-Chair: Burns, John A | Virginia Tech |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
14:00-14:15, Paper WeB08.1 | Add to My Program |
Failure Rate Identification of a Reparable System Governed by Coupled ODE-PDEs and Deep Learning Based Implementation (I) |
|
Hu, Weiwei | University of Georgia |
Tepper, Alexander | University of Georgia |
Xie, Bin | University of Georgia |
Zhang, Qing | University of Georgia |
Keywords: Distributed parameter systems, Identification, Machine learning
Abstract: This paper is concerned with the problem of machine failure rate identification of a 3-state reparable system and its implementation via deep learning. The mathematical model is governed by a distributed parameter system involving coupled partial and integro-differential equations. The objective of this work is to identify the failure rates using the sampled system output measurements. Deep learning based failure rate identification methods are proposed. Numerical examples are provided to illustrate the designs and results.
|
|
14:15-14:30, Paper WeB08.2 | Add to My Program |
Task Space Tracking of Soft Manipulators: Inner-Outer Loop Control Based on Cosserat-Rod Models (I) |
|
Zheng, Tongjia | University of Notre Dame |
Han, Qing | University of Notre Dame |
Lin, Hai | University of Notre Dame |
Keywords: Distributed parameter systems, Robotics
Abstract: The control problem of soft robots has been considered a challenging subject because they are of infinite degrees of freedom and highly under-actuated. Existing studies have mainly relied on approximated finite-dimensional models. In this work, we exploit infinite-dimensional feedback control for soft robots. We adopt the Cosserat-rod theory and employ nonlinear partial differential equations (PDEs) to model the kinematics and dynamics of soft manipulators, including their translational motions (for shear and elongation) and rotational motions (for bending and torsion). The objective is to achieve position tracking of the entire manipulator in a planar task space by controlling the moments (generated by actuators). The design is inspired by the energy decay property of damped wave equations and has an inner-outer loop structure. In the outer loop, we design desired rotational motions that rotate the translational component into a direction that asymptotically dissipates the energy associated with position tracking errors. In the inner loop, we design inputs for the rotational components to track their desired motions, again by dissipating the rotational energy. We prove that the closed-loop system is exponentially stable and evaluate its performance through simulations.
|
|
14:30-14:45, Paper WeB08.3 | Add to My Program |
Adaptive Output Synchronization of Networked Positive Real Infinite Dimensional Systems Using Adaptive Functional Estimation (I) |
|
Demetriou, Michael A. | Worcester Polytechnic Institute |
Keywords: Distributed parameter systems
Abstract: This paper considers the synchronization of identical structurally perturbed infinite dimensional systems. Due to the structured perturbations, represented by nonlinear functions of output signals, a standard synchronization via standard consensus protocols cannot provide any form of agreement unless one removes the presence of structured perturbations. When the structured perturbations are estimated adaptively, via an adaptive functional estimation scheme, the inclusion of their adaptive estimates facilitates and enhances the synchronization of the networked systems. Adaptive consensus protocols are applied to both the synchronization controllers ensuring that all networked systems agree with each other, and to the functional estimates ensuring that all functional estimates are synchronized.
|
|
14:45-15:00, Paper WeB08.4 | Add to My Program |
Learning Theory Convergence Rates for Observers and Controllers in Native Space Embedding (I) |
|
Burns, John A | Virginia Tech |
Kurdila, Andrew J. | Virginia Tech |
Oesterheld, Derek | Virginia Tech |
Stilwell, Daniel J. | Virginia Tech |
Wang, Haoran | Virginia Tech |
Keywords: Adaptive control, Distributed parameter systems
Abstract: This paper derives rates of convergence of ap- proximations of observers and controllers arising in the native space embedding method for adaptive estimation and control of a class of nonlinear ordinary differential equations (ODEs) that feature functional uncertainty. The native space embedding method views the nonlinear ODE as a type of distributed parameter system (DPS), and ideal controllers are derived from the DPS representation. Implementable estimators or controllers for the ODE are obtained by approximation of the DPS using history-dependent, scattered bases in the native space. The basis functions are defined in terms of their centers of approximation. This paper shows that for a large collection of choices of the native space, it is possible to derive convergence rates for implementable schemes that are expressed in terms of the fill distance of the centers of approximation in a subset that supports the observation or measurement process. The error bounds are derived in terms of the power function of the reproducing kernel and resemble those derived recently in machine learning theory and Bayesian estimation as applied to discrete stochastic systems.
|
|
15:00-15:15, Paper WeB08.5 | Add to My Program |
Output Model Predictive Control for a Wave Equation (I) |
|
Humaloja, Jukka-Pekka | University of Alberta |
Dubljevic, Stevan | University of Alberta |
Keywords: Distributed parameter systems, Observers for Linear systems, Predictive control for linear systems
Abstract: We present an observer-based model predictive control scheme for a wave equation with non-collocated boundary control and observation. The design comprises a Luenberger-type state estimator combined with a model predictive controller which we show to stabilize the considered wave equation in the sense that the energy of the system decays asymptotically to zero. Moreover, if the initial state estimation error is sufficiently small, the controller is guaranteed to robustly satisfy given input constraints. The performance of the proposed design is demonstrated on a numerical simulation.
|
|
15:15-15:30, Paper WeB08.6 | Add to My Program |
Explicit Backstepping Kernel Solutions for Leak Detection in Branched Pipe Flows (I) |
|
Wilhelmsen, Nils Christian Aars | NTNU |
Aamo, Ole Morten | NTNU |
Keywords: Distributed parameter systems, Observers for Linear systems, Fluid flow systems
Abstract: Explicit solutions are given for a set of n + m linear hyperbolic observer backstepping kernel equations used for leak detection in branched pipe flows. It is identified that the kernel equations can be separated into N+1 distinct Goursat problems for 2(j+1) coupled PDEs each, j in {0,1,...,N} and N+1 being the number of pipes connected via the branching point. Expressing the solutions as infinite matrix power series, the solution to each set of equations is shown to depend on a simplified, scalar Goursat problem, the solution of which is given in terms of derivatives of a modified Bessel function of the first kind. Furthermore, it is shown that the infinite matrix power series expressing the solution writes in terms of modified Bessel functions of the first kind and Marcum Q-functions, as is the case for the previously solved 2×2 constant coefficient case. A numerical example showing adaptive observer gains for leak detection computed via the explicit solutions for multiple operating points of a branched pipe flow is given to illustrate the results.
|
|
WeB09 Regular Session, Aqua 307 |
Add to My Program |
Estimation II |
|
|
Chair: Namerikawa, Toru | Keio University |
Co-Chair: Arioui, Hichem | Evry Val d'Essonne University |
|
14:00-14:15, Paper WeB09.1 | Add to My Program |
Vision-Based Approach for Estimating Lateral Dynamics of Powered Two-Wheeled Vehicles |
|
Alrazouk, Obaida | IBISC LAB |
Chellali, Amine | IBISC LAB |
Nehaoua, Lamri | Evry Univeristy |
Arioui, Hichem | Evry Val d'Essonne University |
Keywords: Estimation, Automotive systems, Automotive control
Abstract: A novel approach based on a mounted monocular camera and an IMU to estimate the lateral dynamics (sideslip angle and lateral velocity) of Powered Two-Wheeled Vehicles (P2WV) is proposed in this paper. This approach is divided into two parts, the estimation of the derivative of the lateral offset and the relative heading. For the first, a method based on inverse perspective mapping to detect the lane markers and fit them into a second-degree polynomial that gives the lateral offset, then the derivative is estimated from the latter. For the relative heading, a method based on detecting the vanishing point is used to estimate it. Finally, these two main parameters are used to estimate the sideslip angle and the lateral velocity. The proposed method is tested on a popular motorcycle simulator software (BikeSim) throughout several scenarios on straight and curved roads with high and low speeds while executing a Double Lane Change (DLC) maneuver. The proposed method shows promising results in terms of error and real-time execution.
|
|
14:15-14:30, Paper WeB09.2 | Add to My Program |
Secure State Estimation for Multi-Agent Systems: On the Relationship between the Number of Agents and System Resilience |
|
Shinohara, Takumi | Keio University |
Namerikawa, Toru | Keio University |
Keywords: Estimation, Linear systems, Agents-based systems
Abstract: This paper addresses the problem of secure state estimation in multi-agent systems consisting of n agents. For the problem, the sparse observability index is a well-known measure of the system resilience. This index is defined as the largest integer δ for which the system observability is preserved even if any δ sensors are eliminated. The larger δ is, the more the system state can be reconstructed even if the system is subjected to more sensor attacks. In this paper, we analyze the relationship between the number of agents n and the sparse observability index δ especially when the network structure is path and cycle graphs. Intuitively, it is assumed that, as the number of agents n increases, the system resilience, i.e., δ, increases accordingly. However, as this study provides, δ does not monotonically increase with an increase in n. In path graphs, δ depends on whether n+1 is prime or composite, while in cycle graphs, δ depends on whether n is prime or composite. In path graphs, δ depends on whether n+1 is prime or composite, and the system resilience is enhanced when n+1 is prime. In cycle graphs, δ obeys whether n is prime or composite, and the system is more resilient when n is prime.
|
|
14:30-14:45, Paper WeB09.3 | Add to My Program |
Observability Gramian for Bayesian Inference in Nonlinear Systems with Its Industrial Application |
|
Lee, Kunwoo | Kyoto University |
Umezu, Yusuke | Kawasaki Heavy Industries, Ltd |
Konno, Kaiki | Kawasaki Heavy Industries, Ltd |
Kashima, Kenji | Kyoto University |
Keywords: Estimation, Machine learning, Observers for nonlinear systems
Abstract: In this paper, we present a novel (empirical) observability Gramian for nonlinear stochastic systems in the light of Bayesian inference. First, we define our observability Gramian, which we refer to as the estimability Gramian, based on the relation to the so-called Bayesian Fisher information matrix for initial state estimation. Then, we study the fundamental properties of an empirical version of the estimability Gramian. The practical usefulness of the proposed framework is examined through its application to a parameter and initial state estimation in a natural gas engine cylinder.
|
|
14:45-15:00, Paper WeB09.4 | Add to My Program |
Real-Time Clamping Force Estimation of Brake By-Wire System for Electric Autonomous Vehicles |
|
Wei, Wenpeng | Southeast University |
He, Tianyi | Utah State University |
Pal, Anuj | Michigan State University |
Keywords: Estimation, Observers for nonlinear systems, Automotive systems
Abstract: Brake-by-wire systems are prevailing more than ever for electric autonomous vehicles owing to several advantages, such as faster clamping force response, more accurate clamping force control, and more convenient to be integrated with other vehicle active safety functions. This paper mainly focuses on Electromechanical Brake (EMB) system. However, due to high sensor costs and limited sensor installation spaces, wide applications of EMB systems are prevented. One such example is the clamping force sensor. To replace the clamping force sensor, an innovative clamping force estimation method, i.e., the Braking Force Separation Strategy, is proposed in this paper. Different from existing estimation approaches that model the clamping force as a nonlinear polynomial function of motor position and brake gap distance, the proposed approach estimates the clamping force without brake gap distance. The decoupling is achieved by the application of an unknown input observer that enables simultaneous states and inputs estimation. To demonstrate the effectiveness of the proposed algorithm, simulations are performed under different input scenarios with or without fictitious measurement noise. The results show that the braking force can be accurately estimated by the proposed algorithm. It also demonstrates better performance by comparing with traditional method.
|
|
15:00-15:15, Paper WeB09.5 | Add to My Program |
On Challenges in Coordinate Transformation for Using a High-Gain Multi-Output Nonlinear Observer |
|
Alai, Hamidreza | University of Minnesota |
Zemouche, Ali | CRAN UMR CNRS 7039 & Inria: EPI-DISCO |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Estimation, Observers for nonlinear systems, Control applications
Abstract: This paper considers the design of a multi-output high gain observer for a vehicle trajectory tracking application. The high gain observer approach offers the advantages of guaranteed feasibility and global stability with just one constant observer gain for this application. The challenges of transforming the vehicle dynamic model into the required companion form for applying the high gain observer technique are addressed. Transforming a traditional kinematic model to companion form is found to result in an increased number of states. Instead, a coordinate transformation that allows for varying velocity and varying slip angle is shown to be appropriate. The high gain observer methodology for a dynamic system with multiple outputs is presented and the calculation of the Lipschitz constant for the vehicle tracking application is discussed.
|
|
15:15-15:30, Paper WeB09.6 | Add to My Program |
Comparative Analysis of a Nonlinear Observer and Nonlinear Kalman Filters for Magnetic Position Estimation |
|
Movahedi, Hamidreza | University of Minnesota |
Zemouche, Ali | CRAN UMR CNRS 7039 & Inria: EPI-DISCO |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Estimation, Observers for nonlinear systems, Mechatronics
Abstract: This paper analyzes state estimation for nonlinear systems in the presence of sensor noise and process disturbances. A class of systems involving nonlinear functions of vector arguments in the output equations is considered. A nonlinear observer that satisfies a H_∞ disturbance rejection constraint in addition to providing asymptotic stability in the absence of disturbances is developed using Lyapunov analysis. The observer is shown analytically to provide a guaranteed upper bound on the norm of the estimation error. The performance of the nonlinear observer is compared with the performance of two of the most popular and powerful methods for estimation in nonlinear systems - the extended Kalman filter (EKF) and the unscented Kalman filter (UKF). A magnetic position estimation problem is utilized as the real-world application for the evaluation. In the case of the disturbances being gaussian noise, the UKF and the nonlinear observer provide approximately the same level of performance and they both surpass the performance of the EKF. However, in the case of 2-norm-bounded non gaussian noise such as spikes/ pulses, the nonlinear observer is shown to significantly outperform both the UKF and the EKF.
|
|
WeB10 Regular Session, Aqua 309 |
Add to My Program |
Agents-Based Systems II |
|
|
Chair: Ramirez-Neria, Mario | Universidad Iberoamericana Ciudad De Mexico |
Co-Chair: Mandal, Nirabhra | University of California San Diego |
|
14:00-14:15, Paper WeB10.1 | Add to My Program |
Distance-Based Formation-Motion Control for Unicycle Agents |
|
Chen, Jin | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Garcia de Marina, Hector | Universidad De Granada |
Keywords: Agents-based systems, Cooperative control, Nonholonomic systems
Abstract: We present a distance-based distributed formation-motion control law for unicycle agents that are required to move together at a constant reference speed. The distributed control law consists of the standard distance-based formation gradient control law, which is projected to the longitudinal and angular velocity inputs of the unicycle, and of a linear term that depends on the reference speed. The main contribution of this paper is to realize the consistency of unicycle agents' orientations without orientation measurement, thereby reducing the possibility of significant orientation measurement errors in real-world applications. We prove and show numerically the local exponential convergence of the unicycle agents to the desired formation, where they eventually move in unison at a constant reference speed.
|
|
14:15-14:30, Paper WeB10.2 | Add to My Program |
Private and Accurate Decentralized Optimization Via Encrypted and Structured Functional Perturbation |
|
Zhou, Yijie | The Chinese University of Hong Kong, Shenzhen |
Pu, Shi | The Chinese University of Hong Kong, Shenzhen |
Keywords: Optimization algorithms, Decentralized control
Abstract: We propose a decentralized optimization algorithm that preserves the privacy of agents' cost functions without sacrificing accuracy, termed EFPSN. The algorithm adopts Paillier cryptosystem to construct zero-sum functional perturbations. Then, based on the perturbed cost functions, any existing decentralized optimization algorithm can be utilized to obtain the accurate solution. We theoretically prove that EFPSN is (epsilon, delta)-differentially private and can achieve nearly perfect privacy under deliberate parameter settings. Numerical experiments further confirm the effectiveness of the algorithm.
|
|
14:30-14:45, Paper WeB10.3 | Add to My Program |
Distributed Hybrid Attitude Estimation for Multi-Agent Systems on SO(3) |
|
Boughellaba, Mouaad | Lakehead University |
Tayebi, Abdelhamid | Lakehead University |
Keywords: Agents-based systems, Estimation, Observers for nonlinear systems
Abstract: We consider the problem of distributed attitude estimation of multi-agent systems, evolving on SO(3), relying on individual angular velocity and relative attitude measurements. We propose a nonlinear distributed hybrid attitude estimation scheme guaranteeing global asymptotic convergence of the attitude estimation errors to a common constant orientation, under an undirected, connected and acyclic graph topology. Moreover, in the presence of a leader in the group (knowing its absolute orientation), one can guarantee global asymptotic convergence of the attitude estimation errors to zero. Numerical simulation results are presented to illustrate the performance of our proposed scheme.
|
|
14:45-15:00, Paper WeB10.4 | Add to My Program |
Opinion Dynamics for Utility Maximizing Agents with Heterogeneous Resources |
|
Wankhede, Prashil | Indian Institute of Science |
Mandal, Nirabhra | University of California San Diego |
Tallapragada, Pavankumar | Indian Institute of Science |
Keywords: Agents-based systems, Game theory, Network analysis and control
Abstract: In this paper we propose a continuous-time non-linear model of opinion dynamics. One of the main novelties of our model is that it costs resources for an agent to express an opinion. Each agent receives a utility based on the complete opinion profile of all agents. Each agent seeks to maximize its own utility function by suitably revising its opinion and the proposed dynamics arises from all agents simultaneously doing this. For the proposed model, we show ultimate boundedness of opinions. We also show stability of equilibrium points and convergence to an equilibrium point when all agents are non-contrarian. We give conditions for the existence of a consensus equilibrium and analyze the role that resources play in determining the social power of the agents in terms of the deviation of the consensus value from the agents' internal preference. We also carry out a Nash equilibrium analysis of the underlying game and show that when all agents are non-contrarian, the set of equilibria of the opinion dynamics is the same as the set of Nash equilibria for the underlying game. We illustrate our results using simulations.
|
|
15:00-15:15, Paper WeB10.5 | Add to My Program |
A Distributed Observer for Consensus of Multi-Agent Systems under Cyber Attack |
|
Jinman, Yang | Harbin Institute of Technology, Shenzhen |
Li, Peng | Harbin Institute of Technology, Shenzhen |
Keywords: Agents-based systems, Identification for control, Adaptive control
Abstract: In this paper, the consensus problem of the leader following system is addressed by considering the false data injection (FDI) attacks. An integral-based observer induced by nonlinear activation functions is proposed. By suitably choosing the activation function, the proposed observer is able to attenuate the effects of the fault data injection attack and achieve a no-error consensus. The consensus property and the attack rejection feature are analyzed considering both constant and time-varying attacks. It has been proven that the proposed method is able to mitigate the effects of different types of attacks. The effectiveness of the proposed observer is been verified by extensive numerical examples with comparisons to recent methods.
|
|
15:15-15:30, Paper WeB10.6 | Add to My Program |
Consensus Robustness under PI Protocols with Undirected Graphs |
|
Mao, Qi | City University of Hong Kong |
Ma, Dan | Northeastern University |
Ding, Yanling | City University of Hong Kong |
Peng, Hui | Guangdong University of Technology, School of Automation |
Chen, Jie | City University of Hong Kong |
Keywords: Stability of linear systems, PID control, Agents-based systems
Abstract: This paper concerns robust consensus problems for continuous-time first-order multi-agent systems with respect to uncertain gain and phase of the agent's frequency response varying within certain ranges. We consider dynamic output feedback control protocol in the form of proportional-integral (PI) control. The agents can in general be nonminimum phase and unstable systems, which are interconnected by an undirected graph network. We seek to determine the largest ranges of gain and phase variations, referred to as the gain consensus margin (GCM) and the phase consensus margin (PCM), respectively, so that consensus can be achieved robustly within these ranges. Our main results consist of explicit analytical expressions of the GCM and PCM, which demonstrate how the agent's unstable pole and nonminimum phase zero, as well as the network connectivity may fundamentally confine the gain and phase variation ranges so that consensus can or cannot be maintained.
|
|
WeB11 Regular Session, Aqua Salon AB |
Add to My Program |
Game Theory II |
|
|
Chair: Poveda, Jorge I. | University of California, San Diego |
Co-Chair: Hu, Jianghai | Purdue University |
|
14:00-14:15, Paper WeB11.1 | Add to My Program |
Competitive Information Provision among Internet Routing Nodes |
|
Seaton, Joshua | University of Colorado at Colorado Springs |
Hounsinou, Sena | University of Colorado Colorado Springs |
Bloom, Gedare | University of Colorado Colorado Springs |
Brown, Philip N. | University of Colorado, Colorado Springs |
Keywords: Game theory, Communication networks
Abstract: In proposed path-aware designs for the Internet, end hosts can select which path their packets use. What criteria should the end hosts use to select paths? Recent work has proposed path-aware access control frameworks in which routing nodes publicly report their knowledge of the security postures of other nodes; end hosts can then base their routing choices on these reports. However, nothing is known regarding the nodes' incentives to report their knowledge truthfully. In this paper, we consider the case in which each network node is strategic, and seeks to craft its public reports to manipulate traffic patterns in its own favor. In the context of a simple selfish routing problem with two strategic nodes, we show that for a wide swath of the parameter space, each node has a dominant reporting strategy, meaning that its individually optimal strategy does not depend on the strategy of the other node. These dominant strategies are generally not truthful. At the resulting dominant-strategy Nash equilibrium, we show that the expected social cost is (often considerably) higher than that achieved when both nodes are completely truthful. Nonetheless, we prove that these equilibrium reporting strategies are never perverse, meaning that their resulting social cost is never worse than if traffic were uninformed as to network state.
|
|
14:15-14:30, Paper WeB11.2 | Add to My Program |
On the Convergence Rates of a Nash Equilibrium Seeking Algorithm in Potential Games with Information Delays |
|
Huang, Yuanhanqing | Purdue University |
Hu, Jianghai | Purdue University |
Keywords: Game theory, Delay systems, Large-scale systems
Abstract: This paper investigates the equilibrium convergence properties of a proposed algorithm for potential games with continuous strategy spaces in the presence of feedback delays, a main challenge in multi-agent systems that compromises the performance of various optimization schemes. The proposed algorithm is built upon an improved version of the accelerated gradient descent method. We extend it to a decentralized multi-agent scenario and equip it with a delayed feedback utilization scheme. By appropriately tuning the step sizes and studying the interplay between delay functions and step sizes, we derive the convergence rates of the proposed algorithm to the optimal value of the potential function when the growth of the feedback delays in time is subject to sublinear, linear, and superlinear upper bounds. Finally, simulations of a routing game are performed to empirically verify our findings.
|
|
14:30-14:45, Paper WeB11.3 | Add to My Program |
High-Order Decentralized Pricing Dynamics for Congestion Games: Harnessing Coordination to Achieve Acceleration |
|
Chen, Yilan | University of Colorado Boulder |
Ochoa, Daniel E. | University of California San Diego |
Poveda, Jorge I. | University of California, San Diego |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Hybrid systems, Networked control systems
Abstract: We introduce a class of decentralized high-order pricing dynamics (HOPD) for the solution of optimal incentive problems in affine congestion games with full resource utilization. The dynamics incorporate momentum and decentralized coordinated resets to achieve better transient performance compared to traditional first-order gradient-based pricing algorithms. The proposed dynamics are studied using tools from graph theory, game theory, and hybrid dynamical systems theory. Our main results establish suitable stability and convergence properties with respect to the set of incentives that generate Nash flows that also maximize the social welfare function of the game. The theoretical results are illustrated via numerical examples in two different types of communication graphs, highlighting the effect of the communication topology and the coordination between players on the transient performance of the HOPD.
|
|
14:45-15:00, Paper WeB11.4 | Add to My Program |
Excess Payoff Evolutionary Dynamics with Strategy-Dependent Revision Rates: Convergence to Nash Equilibria for Potential Games |
|
Kara, Semih | University of Maryland, College Park |
Martins, Nuno C. | University of Maryland |
Keywords: Game theory, Lyapunov methods, Optimization
Abstract: Evolutionary dynamics in the context of population games models the dynamic non-cooperative strategic interactions among many nondescript agents. Each agent follows one strategy at a time from a finite set. A game assigns a payoff to each strategy as a function of the so-called population state vector, whose entries are the proportions of the population adopting the available strategies. Each agent repeatedly revises its strategy according to a revision protocol. We focus on a well-known class of protocols that prioritizes strategies with higher excess payoffs relative to a population-weighted average. In contrast to existing work for these protocols, we allow each agent’s revision rate to depend explicitly on its current strategy. Motivated by applications and relevance to distributed optimization, we focus on potential games and investigate the population state’s convergence to the game’s Nash equilibria. Our contributions are twofold: (1) For the considered protocol class, prior work established conditions that ensure convergence under strategy-independent revision rates. We show that these conditions may be violated when the revision rates are strategy-dependent. (2) We prove that a minor, well-motivated modification of the considered protocol class satisfies these conditions for any strategy-dependent revision rates. We also illustrate our results using a distributed task allocation example.
|
|
15:00-15:15, Paper WeB11.5 | Add to My Program |
Sample Complexity of Decentralized Tabular Q-Learning for Stochastic Games |
|
Gao, Zuguang | The University of Chicago |
Ma, Qianqian | Boston University |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Birge, John | University of Chicago |
Keywords: Game theory, Markov processes, Learning
Abstract: In this paper, we carry out finite-sample analysis of decentralized Q-learning algorithms in the tabular setting for a significant subclass of general-sum stochastic games (SGs) – weakly acyclic SGs, which includes potential games and Markov team problems as special cases. In the practical while challenging decentralized setting, neither the rewards nor the actions of other agents can be observed by each agent. In fact, each agent can be completely oblivious to the presence of other decision makers. In this work, the sample complexity of the decentralized tabular Q-learning algorithm to converge to a Markov perfect equilibrium is developed.
|
|
15:15-15:30, Paper WeB11.6 | Add to My Program |
Nash Equilibria for Exchangeable Team against Team Games and Their Mean Field Limit |
|
Sanjari, Sina | University of Illinois at Urbana-Champaign |
Saldi, Naci | Bilkent University |
Yuksel, Serdar | Queen's University |
Keywords: Game theory, Mean field games, Decentralized control
Abstract: We study stochastic mean-field games among finite number of teams each with large finite as well as infinite numbers of decision makers (DMs). We establish the existence of a Nash equilibrium (NE) and show that a NE exhibits exchangeability in the finite DM regime and symmetry in the infinite one. We establish the existence of a randomized NE that is exchangeable (not necessarily symmetric) among DMs within each team for a general class of exchangeable stochastic games. As the number of DMs within each team drives to infinity (that is for the mean-field games among teams), using a de Finetti representation theorem, we establish the existence of a randomized NE that is symmetric (i.e., identical) among DMs within each team and also independently randomized. Finally, we establish that a NE for a class of mean-field games among teams (which is symmetric) constitutes an approximate NE for the corresponding pre-limit game among teams with mean-field interaction and large but finite number of DMs.
|
|
WeB12 Invited Session, Aqua Salon C |
Add to My Program |
Advanced Vehicle Safety Controls |
|
|
Chair: Zhao, Junfeng | Arizona State University |
Co-Chair: Chen, Pingen | Tennessee Technological University |
Organizer: Zhao, Junfeng | Arizona State University |
Organizer: Wang, Zejiang | Oak Ridge National Laboratory |
Organizer: HomChaudhuri, Baisravan | Illinois Institute of Technology |
Organizer: Chen, Pingen | Tennessee Technological University |
|
14:00-14:15, Paper WeB12.1 | Add to My Program |
Disturbance Observers for Robust Safety-Critical Control with Control Barrier Functions (I) |
|
Alan, Anil | University of Michigan |
Molnar, Tamas G. | California Institute of Technology |
Das, Ersin | Caltech |
Ames, Aaron D. | California Institute of Technology |
Orosz, Gabor | University of Michigan |
Keywords: Observers for nonlinear systems, Robust control, Autonomous vehicles
Abstract: This work provides formal safety guarantees for control systems with disturbance. A disturbance observer-based robust safety-critical controller is proposed, that estimates the effect of the disturbance on safety and utilizes this estimate with control barrier functions to attain provably safe dynamic behavior. The observer error bound – which consists of transient and steady-state parts – is quantified, and the system is endowed with robustness against this error via the proposed controller. An adaptive cruise control problem is used as illustrative example through simulations including real disturbance data.
|
|
14:15-14:30, Paper WeB12.2 | Add to My Program |
An Obstacle-Avoidance Receding Horizon Control Scheme for Constrained Differential-Drive Robot Via Dynamic Feedback Linearization (I) |
|
Tiriolo, Cristian | Concordia University |
Franze, Giuseppe | Universita' Della Calabria |
Lucia, Walter | Concordia University |
Keywords: Constrained control, Feedback linearization, Autonomous robots
Abstract: This paper proposes a collision avoidance control strategy for constrained differential-drive robots moving in static but unknown obstacle scenarios. We assume that the robot is equipped with an on-board path planner providing a sequence of obstacle-free waypoints, and we design an ad-hoc constrained control strategy for ensuring absence of collisions and velocity constraints fulfillment. To this end, the nonlinear robot kinematics is redefined via a dynamic feedback linearization procedure, while a receding horizon control strategy is tailored to deal with time-varying state and input constraints. First, by considering the worst-case constraints realization, a conservative solution is offline determined to guarantee stability, recursive feasibility, and absence of collisions. Then, online, the tracking performance is significantly improved leveraging a non-conservative representation of the input constraints and set-theoretical containment conditions. Simulation results involving a differential-drive robot operating in a maze-like obstacle scenario are presented to show the effectiveness of the proposed solution.
|
|
14:30-14:45, Paper WeB12.3 | Add to My Program |
Failure-Safe Control for Automated Driving by MPC with Integrated Evasive Maneuvers (I) |
|
Skibik, Terrence | University of Colorado Boulder |
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Weiss, Avishai | Mitsubishi Electric Research Labs |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Automotive control, Constrained control, Predictive control for linear systems
Abstract: Automated vehicles may encounter non-nominal situations called failure scenarios, due for instance to errors in perception or environment prediction. In some failure scenarios, a risk area must suddenly be avoided, possibly at the price of no longer satisfying all the constraints enforced in nominal driving conditions. We propose a design for a failure-safe controller that operates the vehicle according to the specifications in nominal conditions, while ensuring that, should a known failure occur, an evasive maneuver can be performed that avoids the risk area and satisfies a, possibly relaxed, set of driving constraints. We design evasive maneuver controllers parametrized in their reference, and we leverage set based methods to determine the region where such controllers satisfy the constraints and avoid the risk area. Membership in such a region during nominal operation is achieved by imposing additional constraints on the controller for nominal driving. We demonstrate the approach in simulations in a few different scenarios.
|
|
14:45-15:00, Paper WeB12.4 | Add to My Program |
A Novel Trust-Based Shared Steering Control for Automated Vehicles with Tire Blowout (I) |
|
Li, Ao | Arizona State University |
Chen, Yan | Arizona State University |
Lin, Wen-Chiao | General Motors Global R&D |
Du, Xinyu | General Motors Global R&D |
Keywords: Automotive control, Automotive systems, Cooperative control
Abstract: Tire blowout strongly affects vehicle stability and road safety by introducing sudden and intensive tire force disturbances. In such an emergent event, vehicles equipped with an automatic controller for normal path following (i.e., SAE driving automation level 2/3) may have degraded performance, which requires cooperation with a human driver. Considering a panicked driver could perform improper or wrong operations, this paper proposes a novel trust-based shared steering control for vehicle stabilization after tire blowout. Based on specific and crucial factors in tire blowout events, the driver’s steering input and the resulting driver’s fault and performance are comprehensively evaluated in a designed controller-to-human (C2H) trust module. The real-time computational trust simultaneously adjusts the cooperative level in a dynamic control authority allocation function. Matlab/Simulink and CarSim® co-simulation results validate that the proposed shared control can effectively enhance vehicle stability and driving safety after tire blowout, even with the faulty and excessive steering inputs from a panicked driver.
|
|
15:00-15:15, Paper WeB12.5 | Add to My Program |
On Queue Length Estimation in Urban Traffic Intersections Via Inductive Loops (I) |
|
Sassella, Andrea | Politecnico Di Milano |
Abbracciavento, Francesco | Politecnico Di Milano |
Formentin, Simone | Politecnico Di Milano |
Savaresi, Sergio M. | Politecnico Di Milano |
Bianchessi, Andrea Giovanni | SCAE S.p.A |
Keywords: Traffic control
Abstract: Queue length estimation in urban intersections represents a crucial issue for real-time traffic flows optimization. In this paper, we discuss different approaches to estimate the queue length using only inductive loops over two possible sensor layouts. Firstly, a model describing the queue dynamics is derived and a methodology to estimate the queue length at the preceding traffic light cycle is formulated, under the assumption of a single inductive loop sensor. Then, two strategies relying on a double-sensor layout are investigated to improve the quality of the queue length estimate, showing the potential of the use of a second inductive loop. The effectiveness of the proposed techniques is validated both on real-world data and through a microscopic traffic simulator, where real-world traffic profiles are employed.
|
|
15:15-15:30, Paper WeB12.6 | Add to My Program |
Automated Vehicle Highway Merging: Motion Planning Via Adaptive Interactive Mixed-Integer MPC (I) |
|
Bhattacharyya, Viranjan | Clemson University |
Vahidi, Ardalan | Clemson University |
Keywords: Automotive control, Autonomous systems, Optimal control
Abstract: A new motion planning framework for automated highway merging is presented in this paper. To plan the merge and predict the motion of the neighboring vehicle, the ego automated vehicle solves a joint optimization of both vehicle costs over a receding horizon. The non-convex nature of feasible regions and lane discipline is handled by introducing integer decision variables resulting in a mixed integer quadratic programming (MIQP) formulation of the model predictive control (MPC) problem. Furthermore, the ego uses an inverse optimal control approach to impute the weights of neighboring vehicle cost by observing the neighbor’s recent motion and adapts its solution accordingly. We call this adaptive interactive mixed integer MPC (aiMPC). Simulation results show the effectiveness of the proposed framework.
|
|
WeB13 Regular Session, Aqua Salon D |
Add to My Program |
Distributed Control II |
|
|
Chair: Braitor, Andrei-Constantin | CentraleSupélec |
Co-Chair: Werner, Herbert | Hamburg University of Technology |
|
14:00-14:15, Paper WeB13.1 | Add to My Program |
Bounded Voltage Regulation in a Direct Current Microgrid Using Barrier Lyapunov Function with Uncertain Load Current |
|
Singh, Shubham | Indian Institute of Technology, Jodhpur |
Vaishnav, Vaibhav | Indian Institute of Technology, Jodhpur |
Jain, Anoop | Indian Institute of Technology, Jodhpur, India |
Sharma, Dushyant | IIT (ISM) Dhanbad |
Keywords: Distributed control, Smart grid, Networked control systems
Abstract: A distributed control methodology for steady-state voltage regulation, along with a bounded transient response in a Direct Current (DC) microgrid, is proposed in this letter. The idea of asymmetric barrier Lyapunov function (ABLF) is used in realizing the prescribed constraints on the voltage magnitudes while considering the uncertain nature of the load current. By proposing a suitable load current estimator, we show that the voltage regulation is achieved within prescribed limits, and the closed-loop system remains stable under the proposed controller. We also obtain explicit bounds on the voltage and current errors, virtual controller, and estimated load current in the post-design analysis. Simulations illustrate the theoretical findings and the robustness of the proposed scheme to load perturbations.
|
|
14:15-14:30, Paper WeB13.2 | Add to My Program |
Distributed Bounded Consensus-Based Control for Multi-Agent Systems with Undirected Graphs |
|
Braitor, Andrei-Constantin | CentraleSupélec |
Iovine, Alessio | CNRS |
Siguerdidjane, Houria | Supelec |
Keywords: Distributed control, Networked control systems, Smart grid
Abstract: In this paper, we propose a distributed consensus-based control method that guarantees agent states limitation for networks of multi-agent systems with undirected communication graphs. This approach takes into account the general linear dynamic models of n identical agents and, by employing Lyapunov methods and ultimate boundedness theory, it ensures that each agent state remains within given bounds. This latter feature is particularly useful in scenarios where one must mitigate the occurrence of abnormal values being transmitted within the network, thus, maintaining a relatively safe consensus policy between agents. The developed strategy requires, at each agent node, information from their respective neighbours only, and it can be applied independently of any global information of the communication graph, hence, it is fully distributed. To highlight the developed controller capability and effectiveness, a multi-converter microgrid system with a meshed communication network has been used as a practical example. Subsequent to the asymptotic stability proof for the overall closed-loop microgrid system, the control framework has been tested in a predetermined scenario.
|
|
14:30-14:45, Paper WeB13.3 | Add to My Program |
Distributed Continuous-Time Resource Allocation Algorithm for Networked Double-Integrator Systems with Time-Varying Non-Identical Hessians and Resources |
|
Ding, Yong | University of California, Riverside |
Ren, Wei | University of California, Riverside |
Meng, Ziyang | Tsinghua University |
Keywords: Distributed control, Optimization
Abstract: This paper investigates the optimal resource allocation problem for networked double-integrator systems with time-varying cost functions and resources. Due to the coexistence of challenges caused by non-identical Hessians and more complicated agents' dynamics, the extension from existing related results on single-integrator agents is nontrivial. A distributed algorithm is proposed to address the time-varying resource allocation problem and achieve the exact optimum tracking. Finally, an example is provided to illustrate the effectiveness of the proposed algorithm.
|
|
14:45-15:00, Paper WeB13.4 | Add to My Program |
A Distributed Time-Varying Optimization Algorithm for Networked Lagrangian Agents Generating Continuous Control Torques |
|
Ding, Yong | University of California, Riverside |
Wang, Hanlei | Beijing Institute of Control Engineering |
Mei, Jie | Harbin Institute of Technology, Shenzhen |
Ren, Wei | University of California, Riverside |
Keywords: Distributed control, Optimization
Abstract: In this paper, the distributed time-varying optimization problem is investigated for networked Lagrangian systems with parametric uncertainties. Due to the usage of the signum function in the control torque design, there might exist chattering while implementing the distributed time-varying optimization algorithms for networked Lagrangian agents in the existing works. To this end, we design a distributed optimization algorithm that is capable of generating continuous control torques and achieving exact optimum tracking. A simulation is presented to validate the effectiveness of the proposed algorithm.
|
|
15:00-15:15, Paper WeB13.5 | Add to My Program |
Distributed Connectivity Keeping Supervision Scheme with Collision and Obstacle Avoidance Requirements |
|
Casavola, Alessandro | Universita' Della Calabria |
D'Angelo, Vincenzo | Università Della Calabria |
El Qemmah, Ayman | Università Della Calabria |
Tedesco, Francesco | Università Della Calabria |
Torchiaro, Franco Angelo | University of Calabria |
Keywords: Distributed control, Predictive control for linear systems, Autonomous systems
Abstract: This paper investigates a connectivity preserving coordination problem for a group of dynamically decoupled fully actuated Unmanned Surface Vehicles (USVs) that are subject to both collision and obstacle avoidance constraints. An existing supervision scheme based on Distributed Command Governor ideas is here extended in order to obtain motion coordination of vehicles. To this end, the proposed method enforces the existence of a specific minimum spanning tree at each time instant while fulfilling obstacle avoidance and collision avoidance constraints, by using a separation hyperplane and solving on-line a linear constrained optimization problem involving a mixture of continuous and integer variables. Conditions that formally ensure the feasibility of the proposed strategy are included along with simulations that show the effectiveness of the proposed method.
|
|
15:15-15:30, Paper WeB13.6 | Add to My Program |
Distributed Model Predictive Flocking with Obstacle Avoidance and Asymmetric Interaction Forces |
|
Hastedt, Philipp | Hamburg University of Technology |
Werner, Herbert | Hamburg University of Technology |
Keywords: Distributed control, Predictive control for linear systems, Networked control systems
Abstract: In most of the existing literature on predictive flocking, the characteristic swarm behavior is formulated in terms of cost functions of quadratic optimization problems with attractive and repulsive interaction forces of equal strength. In this paper, we propose a distributed model predictive flocking framework in which attractive and repulsive interaction forces can be tuned independently by implementing the rules of flocking as softened inequality constraints. The presented framework is able to handle input constraints, obstacle avoidance, and the pursuit of group objectives. The performance of the proposed algorithms is validated in simulation.
|
|
WeB14 Regular Session, Aqua 311A |
Add to My Program |
Predictive Control I |
|
|
Chair: Malikopoulos, Andreas A. | University of Delaware |
Co-Chair: Koeln, Justin | University of Texas at Dallas |
|
14:00-14:15, Paper WeB14.1 | Add to My Program |
Optimal Weight Adaptation of Model Predictive Control for Connected and Automated Vehicles in Mixed Traffic with Bayesian Optimization |
|
Le, Viet-Anh | University of Delaware |
Malikopoulos, Andreas A. | University of Delaware |
Keywords: Human-in-the-loop control, Multivehicle systems
Abstract: In this paper, we develop an optimal weight adaptation strategy of model predictive control (MPC) for connected and automated vehicles (CAVs) in mixed traffic. We model the interaction between a CAV and a human-driven vehicle (HDV) as a simultaneous game and formulate a game-theoretic MPC problem to find a Nash equilibrium of the game. In the MPC problem, the weights in the HDV’s objective function can be learned online using moving horizon inverse reinforcement learning. Using Bayesian optimization, we propose a strategy to optimally adapt the weights in the CAV’s objective function so that the expected true cost when using MPC in simulations can be minimized. We validate the effectiveness of the optimal strategy by numerical simulations of a vehicle crossing example at an unsignalized intersection.
|
|
14:15-14:30, Paper WeB14.2 | Add to My Program |
Flux Exponent Control Predicts Metabolic Dynamics from Network Structure |
|
Xiao, Fangzhou | California Institute of Technology |
Li, Jing Shuang | California Institute of Technology |
Doyle, John C. | Caltech |
Keywords: Metabolic systems, Predictive control for nonlinear systems, Cellular dynamics
Abstract: Metabolic dynamics such as stability of steady states, oscillations, lags and growth arrests in stress responses are important for microbial communities in human health, ecology, and metabolic engineering. Yet it is hard to model due to sparse data available on trajectories of metabolic fluxes. For this reason, a constraint-based approach called flux control (e.g., flux balance analysis) was invented to split metabolic systems into known stoichiometry (plant) and unknown fluxes (controller), so that data can be incorporated as refined constraints, and optimization can be used to find behaviors in scenarios of interest. However, flux control can only capture steady state fluxes well, limiting its application to scenarios with days or slower timescales. To overcome this limitation and capture dynamic fluxes, this work proposes a novel constraint-based approach, flux exponent control (FEC). FEC uses a different plant-controller split between the activities of catalytic enzymes and their regulation through binding reactions. Since binding reactions effectively regulate fluxes' exponents (from previous works), this yields the rule of FEC, that cells regulate fluxes' exponents, not the fluxes themselves as in flux control. In FEC, dynamic regulations of metabolic systems are solutions to optimal control problems that are computationally solvable via model predictive control. Glycolysis, which is known to have minute-timescale oscillations, is used as an example to demonstrate FEC can capture metabolism dynamics from network structure. More generally, FEC brings metabolic dynamics to the realm of control system analysis and design.
|
|
14:30-14:45, Paper WeB14.3 | Add to My Program |
Long Duration Stochastic MPC with Mission-Wide Probabilistic Constraints Using Waysets |
|
Raghuraman, Vignesh | The University of Texas at Dallas |
Koeln, Justin | University of Texas at Dallas |
Keywords: Predictive control for linear systems, Stochastic optimal control, Constrained control
Abstract: A stochastic Model Predictive Control (MPC) formulation is presented for systems with finite operation subject to constraints on the Mission-Wide Probability of Safety (MWPS). For linear discrete-time systems subject to unknown disturbances, the goal is to formulate an MPC controller to achieve a desired probability of mission success, where the entire closed-loop state and input trajectories stay within the state and input constraint sets and the final state reaches the desired terminal set. To enable longer missions under greater uncertainty, a wayset-based approach is proposed that allows for the prediction horizon of the MPC to be significantly shorter than the length of the mission. Using a scenario-based approach to stochastic MPC, the use of constrained zonotopes makes the computation of these waysets efficient and practical. Numerical results demonstrate the utility of the waysets for increasing the feasibility of MWPS constraints to longer missions and that the percentage of successful missions asymptotically converges above the desired probability.
|
|
14:45-15:00, Paper WeB14.4 | Add to My Program |
Reference Governor for Input-Constrained MPC to Enforce State Constraints at Lower Computational Cost |
|
Castroviejo-Fernandez, Miguel | University of Michigan |
Leung, Jordan, M | University of Michigan |
Kolmanovsky, Ilya V. | The University of Michigan |
Keywords: Constrained control, Aerospace
Abstract: In this paper, a control scheme is developed based on an input constrained Model Predictive Control (MPC) law and the idea, usual of Reference Governors (RG), of modifying the reference command to enforce constraints. The proposed scheme, referred to as the RGMPC, can handle (possibly nonlinear) state and input constraints and only requires optimization for MPC with polytopic input constraints for which fast algorithms exist. Conditions are given that ensure recursive feasibility of the RGMPC scheme and finite-time convergence of the modified reference command to the desired reference command. Simulation results for a spacecraft rendezvous maneuver with linear and nonlinear constraints demonstrate that the RGMPC scheme has lower average computational time than state and input constrained MPC with similar performance.
|
|
15:00-15:15, Paper WeB14.5 | Add to My Program |
Event-Triggered Multi-Mode MPC with Application to Modular Aerial Vehicles |
|
Casau, Pedro | University of Aveiro |
Guerreiro, Bruno J. N. | Instituto Superior Tecnico, 501 507 930 |
Keywords: Sampled-data control, Optimal control, Hybrid systems
Abstract: Our paper proposes a controller for global asymptotic stabilization of a nonlinear plant by combining multiple model-predictive controllers with event-triggered control. Given a collection of operating modes, the controller drives the state of the closed-loop system to a target operating mode using a sequence of optimal input trajectories. The computation of optimal trajectories takes place at events which are triggered either when the difference between the state of the closed-loop system and the optimal state trajectory exceeds a given threshold or when an internal timer expires. We demonstrate the proposed controller through simulations by applying it to the control of modular aerial vehicles.
|
|
15:15-15:30, Paper WeB14.6 | Add to My Program |
Multi-Contact MPC for Dynamic Loco-Manipulation on Humanoid Robots |
|
Li, Junheng | University of Southern California |
Nguyen, Quan | University of Southern California |
Keywords: Optimal control, Predictive control for linear systems, Robotics
Abstract: This paper presents a novel method to control humanoid robot dynamic loco-manipulation with multiple contact modes via multi-contact Model Predictive Control (MPC) framework. The proposed framework includes a multi-contact dynamics model capable of capturing various contact modes in loco-manipulation, such as hand-object contact and foot-ground contacts. Our proposed dynamics model represents the object dynamics as an external force acting on the system, which simplifies the model and makes it feasible for solving the MPC problem. In numerical validations, our multi-contact MPC framework only needs contact timings of each task and desired states to give MPC the knowledge of changes in contact modes in the prediction horizons in loco-manipulation. The proposed framework can control the humanoid robot to complete multi-task dynamic loco-manipulation applications such as efficiently picking up and dropping off objects while turning and walking.
|
|
WeB15 Regular Session, Aqua 311B |
Add to My Program |
Energy Systems I |
|
|
Chair: Zlotnik, Anatoly | Los Alamos National Laboratory |
Co-Chair: Bitar, Eilyan | Cornell University |
|
14:00-14:15, Paper WeB15.1 | Add to My Program |
Optimal Control of Transient Flows in Pipeline Networks with Heterogeneous Mixtures of Hydrogen and Natural Gas |
|
Baker, Luke | Arizona State University |
Kazi, Saif R. | Los Alamos National Laboratory |
Platte, Rodrigo | Arizona State University |
Zlotnik, Anatoly | Los Alamos National Laboratory |
Keywords: Energy systems, Optimal control, Network analysis and control
Abstract: We formulate a control system model for the distributed flow of mixtures of highly heterogeneous gases through large-scale pipeline networks with time-varying injections of constituents, withdrawals, and control actions of compressors. This study is motivated by the proposed blending of clean hydrogen into natural gas pipelines as an interim means to reducing end use carbon emissions while utilizing existing infrastructure for its planned lifetime. We reformulate the partial differential equations for gas dynamics on pipelines and balance conditions at junctions using lumped elements to a sparse nonlinear differential algebraic equation system. Our key advance is modeling the mixing of constituents in time throughout the network, which requires doubling the state space needed for a single gas and increases numerical ill-conditioning. The reduced model is shown to be a consistent approximation of the original system, which we use as the dynamic constraints in a model-predictive optimal control problem for minimizing the energy expended by applying time-varying compressor operating profiles to guarantee time-varying delivery profiles subject to system pressure limits. The optimal control problem is implemented after time discretization using a nonlinear program, with validation of the results done using a transient simulation. We demonstrate the methodology for a small test network, and discuss scalability and potential applications.
|
|
14:15-14:30, Paper WeB15.2 | Add to My Program |
Optimization of Hydrogen Blending in Natural Gas Networks for Carbon Emissions Reduction |
|
Sodwatana, Mo | Stanford University |
Kazi, Saif R. | Los Alamos National Laboratory |
Sundar, Kaarthik | Los Alamos National Laboratory |
Zlotnik, Anatoly | Los Alamos National Laboratory |
Keywords: Energy systems, Optimization, Fluid flow systems
Abstract: We present an economic optimization problem for allocating the flow of natural gas and hydrogen blends through a large-scale transportation pipeline network. Physical flow of the gas mixture is modeled using a steady-state relation between pressure decrease and flow rate, which depends on mass concentration of the constituents as it varies by location in the network. The objective reflects the economic value provided by the system, accounting for delivered energy in withdrawn flows, the cost of natural gas and hydrogen injections, and avoided carbon emissions. The problem is solved subject to physical flow equations, nodal balance and mixing laws, and engineering inequality constraints. The desired energy delivery rate and minimum hydrogen concentration can be specified as upper and lower bound values, respectively, of inequality constraints, and we examine the sensitivity of the physical pressure and flow solution to these parameters for two test networks. The results confirm that increasing hydrogen concentration requires greater energy expended for compression to deliver the same energy content, and the formulation could be used for valuation of the resulting mitigation of carbon emissions.
|
|
14:30-14:45, Paper WeB15.3 | Add to My Program |
Valuing Uncertainties in Wind Generation: An Agent-Based Optimization Approach |
|
Shen, Daniel | Massachusetts Institute of Technology |
Ilic, Marija | Massachusetts Inst. of Tech |
Keywords: Energy systems, Optimization, Smart grid
Abstract: The increasing integration of variable renewable energy sources such as wind and solar will require new methods of managing generation uncertainty. Existing practices of uncertainty management for these resources largely focuses around modifying the energy offers of such resources in the quantity domain and from a centralized system operator consideration of these uncertainties. This paper proposes an approach to instead consider these uncertainties in the price domain, where more uncertain power is offered at a higher price instead of restricting the quantity offered. We demonstrate system-level impacts on a modified version of the RTS-GMLC system where wind generators create market offers valuing their uncertainties over scenario set of day-ahead production forecasts. The results are compared with a dispatch method in which wind energy is offered at zero marginal price and restricted based on the forecast percentile.
|
|
14:45-15:00, Paper WeB15.4 | Add to My Program |
A Multi-Battery Model for the Aggregate Flexibility of Heterogeneous Electric Vehicles |
|
Al Taha, Feras | Cornell University |
Vincent, Tyrone L. | Colorado School of Mines |
Bitar, Eilyan | Cornell University |
Keywords: Energy systems, Power systems, Optimization
Abstract: The increasing prevalence of electric vehicles (EVs) in the transportation sector will introduce a large number of highly flexible electric loads that EV aggregators can pool and control to provide energy and ancillary services to the wholesale electricity market. To integrate large populations of EVs into electricity market operations, aggregators must express the aggregate flexibility of the EVs under their control in the form of a small number of energy storage (battery) resources that accurately capture the supply/demand capabilities of the individual EVs as a collective. To this end, we propose a novel multi-battery flexibility model defined as a linear combination of a small number of base sets (termed batteries) that reflect the differing geometric shapes of the individual EV flexibility sets, and suggest a clustering approach to identify these base sets. We study the problem of computing a multi-battery flexibility set that has minimum Hausdorff distance to the aggregate flexibility set, subject to the constraint that the multi-battery flexibility set be a subset of the aggregate flexibility set. We show how to conservatively approximate this problem with a tractable convex program, and illustrate the performance achievable by our method with several numerical experiments.
|
|
15:00-15:15, Paper WeB15.5 | Add to My Program |
Reference Optimisation of Uncertain Offshore Hybrid Power Systems with Multi-Stage Nonlinear Model Predictive Control |
|
Hoang, Kiet Tuan | Norwegian University of Science and Technology |
Knudsen, Brage Rugstad | Norwegian University of Science and Technology |
Imsland, Lars | Norwegian University of Science and Technology |
Keywords: Energy systems, Power systems
Abstract: This paper presents a modified multi-stage economic nonlinear model predictive controller (M-ENMPC) for reference optimisation of isolated, uncertain offshore hybrid power systems (OHPSs). These systems require control strategies that can handle significant stochastic disturbances in exogenous power demand and wind, given uncertain forecasts of the disturbances. An M-ENMPC modified with a certainty horizon is formulated to hande uncertain forecasts of these disturbances for reference optimisation of OHPSs. The certainty horizon models the increase in uncertainty of forecasts with time to decrease the cost in the M-ENMPC. Monte Carlo simulations with different realisations of the considered disturbances show that explicitly considering scenarios of the disturbances with the M-ENMPC can decrease greenhouse gas (GHG) emissions by operating the gas turbines in the hybrid power system more efficiently while achieving an acceptable satisfaction of the exogenous power demand. Furthermore, the Monte Carlo simulations show that using the modified M-ENMPC decreases the average computational time by 17% compared with the conventional M-ENMPC from the literature.
|
|
15:15-15:30, Paper WeB15.6 | Add to My Program |
Ripple-Type Voltage Control for Extreme-Event Contingencies |
|
Satkauskas, Ignas | National Renewable Energy Laboratory |
Cavraro, Guido | National Renewable Energy Laboratory |
Bernstein, Andrey | National Renewable Energy Lab (NREL) |
Keywords: Energy systems, Smart grid, Control of networks
Abstract: Frequent and intense extreme events make grid operation unprecedentedly challenging. Disruptive events could lead to dangerous voltage drops and even voltage collapse if corrective actions are not quickly taken. In this paper, we present a real-time algorithm for voltage control suitable for mitigating electric grid damage scenarios. In our strategy, when agents (generators, substations) experience a dangerous undervoltage, they first respond locally. When the local control resources are depleted, agents seek assistance from peer nodes over a communication network. The algorithm is simulated on a realistic test transmission system. Using fragility curve methodology, we simulate hurricane damages to the components of the synthetic 2000-bus grid representing the ERCOT system. Although being tested over a damaged grid after a hurricane event, our algorithm can be equally successfully applied to any other emergency low-voltage situation.
|
|
WeB16 Regular Session, Aqua 313 |
Add to My Program |
Machine Learning I |
|
|
Chair: Sun, Wei | University of Oklahoma |
Co-Chair: Ferlez, James | University of California, Irvine |
|
14:00-14:15, Paper WeB16.1 | Add to My Program |
Safety-Aware Learning-Based Control of Systems with Uncertainty Dependent Constraints |
|
Abbaszadeh Chekan, Jafar | University of Illinois at Urbana Champaign |
Langbort, Cedric | Univ of Illinois, Urbana-Champaign |
Keywords: Learning, Constrained control, Lyapunov methods
Abstract: In this paper, we tackle the problem of safely stabilizing an originally (partially) unknown system while ensuring that it does not leave a prescribed 'safe set' whose structure itself depends on the unknown part of the system's dynamics. For this aim, we apply a popular approach based on control Lyapunov functions (CLF), control barrier functions (CBF), and Gaussian processes (to build confidence set around the unknown term), which has proved successful in the known-safe set setting. However, with the mentioned safety set structure, we witness the introduction of higher-order terms to be estimated and bounded with high probability using only system state measurements. In this paper, we build on the recent literature on Gaussian Processes (GPs) and reproducing kernels to address the challenge and show how to modify the CLF-CBF-based approach correspondingly to obtain safety guarantees. To overcome the intractability of verification of these conditions on the continuous domain, we apply discretization of the state space and use Lipschitz continuity properties of dynamics to derive equivalent CLF and CBF certificates in discrete state space. Finally, we discuss the strategy for the control design aim using the derived certificates.
|
|
14:15-14:30, Paper WeB16.2 | Add to My Program |
Mean Field Games on Weighted and Directed Graphs Via Colored Digraphons |
|
Fabian, Christian | Technische Universität Darmstadt |
Cui, Kai | Technische Universität Darnstadt |
Koeppl, Heinz | Technische Universitat Darmstadt |
Keywords: Mean field games, Agents-based systems, Machine learning
Abstract: Multi-agent systems are in general hard to model and control due to their complex nature involving many individuals. Numerous approaches focus on empirical and algorithmic aspects of approximating outcomes and behavior in multi-agent systems and lack a rigorous theoretical foundation. Graphon mean field games (GMFGs) on the other hand provide a mathematically well-founded and numerically scalable framework for a large number of connected agents. In standard GMFGs, the connections between agents are undirected, unweighted and invariant over time. Our paper introduces colored digraphon mean field games (CDMFGs) which allow for weighted and directed links between agents that are also adaptive over time. Thus, CDMFGs are able to model more complex connections than standard GMFGs. Besides a rigorous theoretical analysis including both existence and convergence guarantees, we employ the online mirror descent algorithm to learn equilibria. To conclude, we illustrate our findings with an epidemics model and a model of the systemic risk in financial markets.
|
|
14:30-14:45, Paper WeB16.3 | Add to My Program |
Polynomial-Time Reachability for LTI Systems with Two-Level Lattice Neural Network Controllers |
|
Ferlez, James | University of California, Irvine |
Shoukry, Yasser | University of California, Irvine |
Keywords: Computer-aided control design, Neural networks, Linear systems
Abstract: In this paper, we consider the computational complexity of bounding the reachable set of a Linear Time-Invariant (LTI) system controlled by a Rectified Linear Unit (ReLU) Two-Level Lattice (TLL) Neural Network (NN) controller. In particular, we show that for such a system and controller, it is possible to compute the exact one-step reachable set in polynomial time in the size of the TLL NN controller (number of neurons). Additionally, we show that a tight bounding box of the reachable set is computable via two polynomial-time methods: one with polynomial complexity in the size of the TLL and the other with polynomial complexity in the Lipschitz constant of the controller and other problem parameters. Finally, we propose a pragmatic algorithm that adaptively combines the benefits of (semi-)exact reachability and approximate reachability, which we call L-TLLBox. We evaluate L-TLLBox with an empirical comparison to a state-of-the-art NN controller reachability tool. In our experiments, L-TLLBox completed reachability analysis as much as 5000x faster than this tool on the same network/system, while producing reach boxes that were from 0.08 to 1.42 times the area.
|
|
14:45-15:00, Paper WeB16.4 | Add to My Program |
Learning-Based Nonlinear H∞ Control Via Game-Theoretic Differential Dynamic Programming |
|
Sun, Wei | University of Oklahoma |
Trafalis, Theodore | University of Oklahoma |
Keywords: H-infinity control, Machine learning, Game theory
Abstract: In this work, we present a learning-based nonlinear H∞ control algorithm that guarantees system performance under learned dynamics and disturbance estimate. The Gaussian Process (GP) regression is utilized to update the nominal dynamics of the system and provide disturbance estimate based on data gathered through interaction with the system. A soft-constrained differential game associated with the disturbance attenuation problem in nonlinear H∞ control is then formulated to obtain the nonlinear H∞ controller. The differential game is solved through the min-max Game-Theoretic Differential Dynamic Programming (GT-DDP) algorithm in continuous time. Simulation results on a quadcopter system demonstrate the efficiency of the learning-based control algorithm in handling external disturbances.
|
|
15:00-15:15, Paper WeB16.5 | Add to My Program |
Time Optimal Data Harvesting in Two Dimensions through Reinforcement Learning without Engineered Reward Functions |
|
Wu, Shili | Texas A&M University |
Zhu, Yancheng | Boston University |
Datta, Aniruddha | Texas A&M Univ |
Andersson, Sean B. | Boston University |
Keywords: Optimal control, Machine learning, Markov processes
Abstract: We consider the problem of harvesting data from a set of targets distributed throughout a two dimensional environment. The targets broadcast their data to an agent flying above them, and the goal is for the agent to extract all the data and move to a desired final position in minimum time. While previous work developed optimal controllers for the one-dimensional version of the problem, such methods do not extend to the 2-D setting. Therefore, we first convert the problem into a Markov Decision Process in discrete time and then apply reinforcement learning to find high performing solutions using double deep Q learning. We use a simple binary cost function that directly captures the desired goal, and we overcome the challenge of the sparse nature of these rewards by incorporating hindsight experience replay. To improve learning efficiency, we also utilize prioritized sampling of the replay buffer. We demonstrate our approach through several simulations, which show a similar performance as an existing optimal controller in the 1-D setting, and explore the effect of both the replay buffer and the prioritized sampling in the 2-D setting.
|
|
WeB17 Tutorial Session, Aqua 314 |
Add to My Program |
A Tutorial on Policy Learning Methods for Advanced Controller
Representations |
|
|
Chair: Paulson, Joel | The Ohio State University |
Co-Chair: Mesbah, Ali | University of California, Berkeley |
Organizer: Paulson, Joel | The Ohio State University |
Organizer: Mesbah, Ali | University of California, Berkeley |
|
14:00-14:40, Paper WeB17.1 | Add to My Program |
Policy Learning for Advanced Controller Representations Using Bayesian Optimization Principles (I) |
|
Mesbah, Ali | University of California, Berkeley |
Paulson, Joel | The Ohio State University |
|
14:40-15:10, Paper WeB17.2 | Add to My Program |
Reinforcement Learning with Guarantees (I) |
|
Zanon, Mario | IMT Institute for Advanced Studies Lucca |
|
15:10-15:30, Paper WeB17.3 | Add to My Program |
Safety-Critical Distributionally Robust Imitation Learning (I) |
|
Zhong, Zhengang | Imperial College London |
del Rio Chanona, Antonio | Imperial College London |
Petsagkourakis, Panagiotis | Imperial College London |
|
WeC01 RI Session, Sapphire MN |
Add to My Program |
Cooperative Control (RI) |
|
|
Chair: Rastgoftar, Hossein | University of Arizona |
Co-Chair: Wu, Wencen | San Jose State University |
|
16:00-16:04, Paper WeC01.1 | Add to My Program |
Resilient Distributed Optimization |
|
Zhu, Jingxuan | Stony Brook University |
Lin, Yixuan | Stony Brook University |
Velasquez, Alvaro | Air Force Research Laboratory, AFRL/RISC, Rome, NY |
Liu, Ji | Stony Brook University |
Keywords: Cooperative control, Agents-based systems, Fault tolerant systems
Abstract: This paper considers a distributed optimization problem in the presence of Byzantine agents capable of introducing untrustworthy information into the communication network. A resilient distributed subgradient algorithm is proposed based on graph redundancy and objective redundancy. It is shown that the algorithm causes all non-Byzantine agents' states to asymptotically converge to the same optimal point under appropriate assumptions. A partial convergence rate result is also provided.
|
|
16:04-16:08, Paper WeC01.2 | Add to My Program |
Potential Energy Saving by Different Cooperative Driving Automation Classes in Car-Following Scenarios |
|
Hyeon, Eunjeong | Argonne National Laboratory |
Karbowski, Dominik | Argonne National Laboratory |
Rousseau, Aymeric | Argonne National Laboratory |
Keywords: Cooperative control, Automotive control, Optimal control
Abstract: Cooperative driving automation (CDA) enables connected and automated vehicles to cooperate with surrounding vehicles and infrastructure for increased safety, mobility, and energy efficiency. CDA systems are categorized into four classes, depending on the cooperation level: status-sharing, intent-sharing, agreement-seeking, and prescriptive cooperation. In order to maximize the benefits of these systems, new communication frameworks and protocols need to be designed based on extensive studies on corresponding vehicle control performance. This work investigates the potential energy savings from using different CDA classes in car-following scenarios. The essential parameters of control and communication for reliable control performance and real-time implementation are identified, such as agreement-seeking frequency, prediction horizon length, and the number of CDA participants. In addition, important control design factors that need to be considered in CDA development are discussed, including the smooth transition between cooperative and individual driving plans and the proposals that maximize the probability of agreement from counterparts.
|
|
16:08-16:12, Paper WeC01.3 | Add to My Program |
Multiagent Networks with Misbehaving Nodes: Control with Driver and Observer Nodes |
|
Yildirim, Emre | University of South Florida |
Saltos, Alexander | University of South Florida |
Yucelen, Tansel | University of South Florida |
Keywords: Cooperative control, Control system architecture
Abstract: As opposed to applying control signals to each node in multiagent networks for suppressing the negative effects of misbehaving nodes, this paper focuses on applying control signals to a small subset of nodes due to physical (i.e., inaccessible nodes) and/or economical (i.e., large number of nodes) constraints. To achieve this goal, we have recently focused on how to control misbehaving multiagent networks via sending control signals to a small subset of nodes (i.e., driver nodes) in the network, where the control signals are generated by their state information. In this study, we now consider that the control signals applied to driver nodes are generated based on the state information of other subset of nodes (i.e., observer nodes) in the network. We characterize which nodes in the network behave as desired based on the selection of driver nodes and observer nodes in control of misbehaving multiagent networks. We also present several illustrative numerical examples to demonstrate the theoretical contributions of this paper.
|
|
16:12-16:16, Paper WeC01.4 | Add to My Program |
Distributed Formation Trajectory Planning for Multi-Vehicle Systems |
|
Nguyen, Binh | Texas A&M University-Corpus Christi |
Nghiem, Truong X. | Northern Arizona University |
Nguyen, Linh | Federation University Australia |
Nguyen, Anh Tung | Uppsala University |
La, Hung | University of Nevada |
Sookhak, Mehdi | Texas A&M University-Corpus Christi |
Nguyen, Thang | Texas A&M University-Corpus Christi |
Keywords: Cooperative control, Distributed control, Optimization
Abstract: This paper addresses the problem of distributed formation trajectory planning for multi-vehicle systems with collision avoidance among vehicles. Unlike some previous distributed formation trajectory planning methods, our proposed approach offers great flexibility in handling computational tasks for each vehicle when the global formation of all the vehicles changes. It affords the system the ability to adapt to the computational capabilities of the vehicles. Furthermore, global formation constraints can be handled at any selected vehicles. Thus, any formation change can be effectively updated without recomputing all local formations at all the vehicles. To guarantee the above features, we first formulate a dynamic consensus-based optimization problem to achieve desired formations while guaranteeing collision avoidance among vehicles. Then, the optimization problem is effectively solved by ADMM-based or alternating projection-based algorithms, which are also presented. Theoretical analysis is provided not only to ensure the convergence of our method but also to show that the proposed algorithm can surely be implemented in a fully distributed manner. The effectiveness of the proposed method is illustrated by a numerical example of a 9-vehicle system.
|
|
16:16-16:20, Paper WeC01.5 | Add to My Program |
Multi-Robot Localization and Target Tracking with Connectivity Maintenance and Collision Avoidance |
|
Zahroof, Rahul | University of Pennsylvania |
Liu, Jiazhen | University of Pennsylvania |
Zhou, Lifeng | Drexel University |
Kumar, Vijay | University of Pennsylvania, School of Engineering and Applied Sc |
Keywords: Cooperative control, Estimation, Optimization algorithms
Abstract: We study the problem that requires a team of robots to perform joint localization and target tracking task while ensuring team connectivity and collision avoidance. The problem can be formalized as a nonlinear, non-convex optimization program, which is typically hard to solve. To this end, we design a two-staged approach that utilizes a greedy algorithm to optimize the joint localization and target tracking performance and applies control barrier functions to ensure safety constraints, i.e., maintaining connectivity of the robot team and preventing inter-robot collisions. Simulated Gazebo experiments verify the effectiveness of the proposed approach. We further compare our greedy algorithm to a non-linear optimization solver and a random algorithm, in terms of the joint localization and tracking quality as well as the computation time. The results demonstrate that our greedy algorithm achieves high task quality and runs efficiently.
|
|
16:20-16:24, Paper WeC01.6 | Add to My Program |
Resilient Strong Structural Controllability in Networks Using Leaky Forcing in Graphs |
|
Abbas, Waseem | University of Texas at Dallas |
Keywords: Network analysis and control, Cooperative control, Control of networks
Abstract: This paper studies the problem of selecting input nodes (leaders) to make networks strong structurally controllable despite misbehaving nodes and edges. We utilize a graph-based characterization of network strong structural controllability (SSC) in terms of zero forcing in graphs, which is a dynamic coloring of nodes. We consider three types of misbehaving nodes and edges that disrupt the zero forcing process in graphs, thus, deteriorating the network SSC. Then, we examine a leader selection guaranteeing network SSC by ensuring the accuracy of the zero forcing process, despite k misbehaving nodes/edges. Our main result shows that a network is resilient to k misbehaving nodes/edges under one threat model if and only if it is resilient to the same number of failures under the other threat models. Thus, resilience against one threat model implies resilience against the other models. We then discuss the computational aspects of leader selection for resilient SSC and present a numerical evaluation.
|
|
16:24-16:28, Paper WeC01.7 | Add to My Program |
Decentralized Formation Control with Prescribed Distance Constraints and Shape Uniqueness |
|
Pan, Yuqi | ShanghaiTech University |
He, Binglin | ShanghaiTech University |
Geng, Junyi | Pennsylvania State University |
Wang, Yang | Shanghai Technology Unversity |
Keywords: Cooperative control, Networked control systems, Decentralized control
Abstract: For the Multi-Agent Systems (MAS) formation problem, this paper presents a novel decentralized control protocol that can guarantee the uniqueness of formation shape and restrain the distance between neighboring agents within a prescribed range. The proposed control law is decentralized, in the sense that each agent merely employs local relative information regarding its neighbors to obtain the control input. Using a delicately designed gain matrix, we avoid the problem of non-uniqueness of shape that is existed in the majority of the decentralized formation methods, especially the distance-based methods. Meanwhile, the distance constraints, like connectivity maintenance and collision avoidance, required in many practical scenarios are also addressed. Furthermore, the convergence of formation is rigorously proved under a less restrictive assumption on the communication graph. Finally, a comparative simulation verifies the effectiveness and superiority of proposed approach.
|
|
16:28-16:32, Paper WeC01.8 | Add to My Program |
Deep Continuum Deformation Coordination and Optimization with Safety Guarantees |
|
Uppaluru, Harshvardhan | University of Arizona |
Rastgoftar, Hossein | University of Arizona |
Keywords: Cooperative control, Neural networks, Flight control
Abstract: — In this paper, we present a novel method for safe coordination of a large-scale multi-agent team with ”local deformation” capability. Multi-agent coordination is defined by our proposed method as a multi-layer deformation problem specified as a deep neural network (NN) optimization problem. The proposed NN consists of p hidden layers, each of which contains neurons representing unique agents. In addition, the desired deformation of the agents of hidden layer k + 1 is planned based on the desired positions of the agents of hidden layer k (k = 1, · · · , p − 1). In contrast to the available neural network learning problems, our proposed neural network optimization receives time-invariant reference positions of the boundary agents as inputs, trains the weights based on the desired trajectory of the agent team configuration, where the weights are constrained by certain lower and upper bounds to ensure inter-agent collision avoidance.
|
|
16:32-16:36, Paper WeC01.9 | Add to My Program |
H2 Suboptimal Containment Control of Multi-Agent Systems |
|
Gao, Yuan | Technical University of Munich |
Jiao, Junjie | Technical University of Munich |
Hirche, Sandra | Technische Universität München |
Keywords: Cooperative control, Optimal control, Linear systems
Abstract: This paper deals with the distributed H2 suboptimal containment control problem by static state feedback for linear multi-agent systems. Given multiple autonomous leaders, a number of followers, and an H2 cost functional, we aim to design a distributed protocol that achieves containment control while the associated H2 cost is smaller than an a priori given upper bound. To that end, we first show that the H2 suboptimal containment control problem can be equivalently recast into the H2 suboptimal control problem of a set of independent systems. Based on this, a design method is provided to compute such a distributed protocol. The computation of the feedback gain involves a single Riccati inequality whose dimension is equal to the dimension of the states of the agents. The performance of the proposed protocol is illustrated by a simulation example.
|
|
16:36-16:40, Paper WeC01.10 | Add to My Program |
Distributed Cooperative Kalman Filter Constrained by Discretized Poisson Equation for Mobile Sensor Networks |
|
Zhang, Ziqiao | Georgia Institute of Technology |
Mayberry, Scott | Georgia Institute of Technology |
Wu, Wencen | San Jose State University |
Zhang, Fumin | Georgia Institute of Technology |
Keywords: Kalman filtering, Sensor networks, Distributed parameter systems
Abstract: This paper proposes cooperative Kalman filters for distributed mobile sensor networks where the mobile sensors are organized into cells that resemble a mesh grid to cover a spatial area. The mobile sensor networks are deployed to map an underlying spatial-temporal field modeled by the Poisson equation. After discretizing the Poisson equation with finite volume method, we found that the cooperative Kalman filters for the cells are subjected to a set of distributed constraints. The field value and gradient information at each cell center can be estimated by the constrained cooperative Kalman filter using measurements within each cell and information from neighboring cells. We also provide convergence analysis for the distributed constrained cooperative Kalman filter. Simulation results with a five cell network validates the proposed distributed filtering method.
|
|
WeC02 RI Session, Sapphire IJ |
Add to My Program |
Constrained Control (RI) |
|
|
Chair: Burlion, Laurent | Rutgers, the State University of New Jersey |
Co-Chair: Pangborn, Herschel | Pennsylvania State University |
|
16:00-16:04, Paper WeC02.1 | Add to My Program |
Finite-Time Stability and Stabilization of Polynomial Systems (I) |
|
Tartaglione, Gaetano | Università Di Napoli Parthenope |
Ariola, Marco | Univ. Degli Studi Di Napoli Parthenope |
Amato, Francesco | Università Degli Studi Di Napoli Federico II |
Keywords: Constrained control
Abstract: In this paper we consider the class of polynomial systems and we investigate on their finite-time stability properties. In this analysis, for the first time, finite-time stability is defined with respect to domains with polynomial bounds. A sufficient condition for finite-time stability is obtained, which can be solved recasting the feasibility problem in terms of SDP through SOS programming. Moreover, a nonlinear state-feedback control law is developed to stabilize the system in the finite-time notion. The effectiveness of the stabilizing control law is shown by a numerical example.
|
|
16:04-16:08, Paper WeC02.2 | Add to My Program |
Lyapunov-Based Current-Profile Feedback Control in Tokamaks with Nonsymmetric Individual Actuator Saturation |
|
Paruchuri, Sai Tej | Lehigh University |
Pajares, Andres | General Atomics |
Schuster, Eugenio | Lehigh University |
Keywords: Constrained control, Lyapunov methods, Energy systems
Abstract: Advanced tokamak scenarios can achieve optimal tokamak operation by shaping the plasma internal profiles through the use of noninductive heating and current sources. As a result of the dynamic complexities, active control of the power of each noninductive heating and current source, a nonnegative value, may be necessary to achieve the desired tokamak performance. However, due to the inherent physical limitations, arbitrary power prescription by the controller may saturate the heating and current drives. Therefore, it is highly desirable to develop a class of active control algorithms that account for the saturation limits of these actuators. A Lyapunov-based nonlinear feedback control algorithm that intrinsically accounts for saturation limits is proposed in this work to regulate the spatial distribution of the toroidal current density in the tokamak. The controller does not rely on constrained optimization techniques, which can be computationally expensive for real-time implementation. Furthermore, the controller can handle nonsymmetric saturation limits, i.e., the absolute values of the upper and lower saturation limits do not have to be equal. The effectiveness of the control algorithm is demonstrated for a DIII-D tokamak scenario in nonlinear simulations.
|
|
16:08-16:12, Paper WeC02.3 | Add to My Program |
Successor Sets of Discrete-Time Nonlinear Systems Using Hybrid Zonotopes |
|
Siefert, Jacob | Pennsylvania State University |
Bird, Trevor J. | Purdue University |
Koeln, Justin | University of Texas at Dallas |
Jain, Neera | Purdue University |
Pangborn, Herschel | Pennsylvania State University |
Keywords: Constrained control, Hybrid systems, Formal verification/synthesis
Abstract: This paper presents identities for calculating over-approximated successor sets of discrete-time nonlinear systems using hybrid zonotopes. The proposed technique extends the state-update set construct, previously developed for linear hybrid systems, to nonlinear systems. Forward reachability of nonlinear systems can then be performed using only projection, intersection, and Cartesian product set operations with the state-update set. It is shown that use of an over-approximation of the state-update set yields over-approximations of successor sets. A technique to over-approximate a nonlinear function using a special ordered set approximation, equivalently represented as a hybrid zonotope, is then presented. A numerical example of a nonlinear system controlled by a piecewise-affine control law demonstrates that the approach provides a computationally-efficient and tight over-approximation of the closed-loop reachable set.
|
|
16:12-16:16, Paper WeC02.4 | Add to My Program |
Robust Data-Driven Control Barrier Functions for Unknown Continuous Control Affine Systems |
|
Jin, Zeyuan | Arizona State University |
Khajenejad, Mohammad | University of California, San Diego |
Yong, Sze Zheng | Northeastern University |
Keywords: Constrained control, Identification for control, Optimization
Abstract: In this letter, we introduce robust data-driven control barrier functions (CBF-DDs) to guarantee robust safety of unknown continuous control affine systems despite worst-case realizations of generalization errors from prior data under various continuity assumptions. To achieve this, we leverage results from data-driven abstraction that provide guaranteed upper and lower bounds of an unknown function from the data set to formulate/obtain a safe input set for a given state. By incorporating the safe input set into an optimization-based controller, the safety of the system can be ensured. Moreover, we present several complexity reduction approaches including providing subproblems that can be solved in parallel, closed-form solutions for a special case and downsampling strategies to improve computational performance.
|
|
16:16-16:20, Paper WeC02.5 | Add to My Program |
Anti-Windup Compensation for Stable and Unstable Quantized Systems with Saturation |
|
Richards, Christopher | University of Louisville |
Turner, Matthew C. | University of Southampton |
Keywords: Constrained control, Quantized systems, Stability of nonlinear systems
Abstract: It is well known that actuator saturation can cause destabilization and degradation in performance; similar problems are faced when actuation is quantized. This paper proposes the design of an anti-windup compensator for systems with actuators that are limited to a textit{finite} number of quantization levels. This combination of discrete level actuation and saturation poses a unique anti-windup problem that has not yet been solved. To surmount this combined issue, an anti-windup compensator is proposed which provides ultimate-boundedness of the system state within a prescribed region, and also guarantees that the state does not stray outside a larger compact set. A numerical simulation example illustrates the effectiveness on a rigid-body system which inspired this work.
|
|
16:20-16:24, Paper WeC02.6 | Add to My Program |
Attacker-Resilient Adaptive Path Following of a Quadrotor with Dynamic Path-Dependent Constraints |
|
Jin, Xu | University of Kentucky |
Hu, Zhongjun | University of Kentucky |
Keywords: Robust adaptive control, Constrained control, Lyapunov methods
Abstract: For most works on constrained motion control in the literature, only constant or time-varying constraints are discussed, which are often conservative and cannot adapt to the dynamically changing operation environment. In this work, in the context of quadrotor operations, we propose a new adaptive path following architecture with dynamic path-dependent constraints, in which the desired path coordinate, desired path speed, and constraint requirements not only depend on a path parameter associated with the desired path, but also can adapt to the presence of an "attacker" nearby. A new concept of "composite barrier function" has been proposed to address both safety and performance constraints in a unified structure. Adaptive laws are introduced to estimate the upper bounds of system uncertainties and unknown "attacker" velocity. Exponential convergence into a small neighborhood around the equilibrium for position tracking error can be guaranteed. In the end, a simulation example further demonstrates the effectiveness of the proposed architecture.
|
|
16:24-16:28, Paper WeC02.7 | Add to My Program |
Control Constrained Game Theoretic Differential Dynamic Programming |
|
Sun, Wei | University of Oklahoma |
Kleiber, Justin | Virginia Tech |
Keywords: Constrained control, Optimal control, Game theory
Abstract: In this work, a control constrained version of the continuous time game theoretic differential dynamic programming (GT-DDP) algorithm is presented. The convergence of the GT-DDP algorithm is analyzed, and it is shown that control constraints can be successfully applied through solving quadratic programs during the control update phase of the GT-DDP. The control constrained GT-DDP is applied to the trajectory tracking problem of a quadrotor. Additionally, the proposed algorithm is demonstrated on a real world experiment of a pursuit-evasion game between two quadrotors to show its ability in planning trajectories for multiple quadrotors under different control constraints.
|
|
16:28-16:32, Paper WeC02.8 | Add to My Program |
Safety-Critical Control with Bounded Inputs Via Reduced Order Models |
|
Molnar, Tamas G. | California Institute of Technology |
Ames, Aaron D. | California Institute of Technology |
Keywords: Constrained control, Reduced order modeling, Mechanical systems/robotics
Abstract: Guaranteeing safe behavior on complex autonomous systems---from cars to walking robots---is challenging due to the inherently high dimensional nature of these systems and the corresponding complex models that may be difficult to determine in practice. With this as motivation, this paper presents a safety-critical control framework that leverages reduced order models to ensure safety on the full order dynamics---even when these models are subject to disturbances and bounded inputs (e.g., actuation limits). To handle input constraints, the backup set method is reformulated in the context of reduced order models, and conditions for the provably safe behavior of the full order system are derived. Then, the input-to-state safe backup set method is introduced to provide robustness against discrepancies between the reduced order model and the actual system. Finally, the proposed framework is demonstrated in high-fidelity simulation, where a quadrupedal robot is safely navigated around an obstacle with legged locomotion by the help of the unicycle model.
|
|
16:32-16:36, Paper WeC02.9 | Add to My Program |
Control Synthesis for Stability and Safety by Differential Complementarity Problem |
|
Yi, Yinzhuang | University of California, San Diego |
Koga, Shumon | University of California, San Diego |
Gavrea, Bogdan | Technical University of Cluj-Napoca |
Atanasov, Nikolay | University of California, San Diego |
Keywords: Constrained control, Stability of nonlinear systems, Differential-algebraic systems
Abstract: This paper develops a novel control synthesis method for safe stabilization of control-affine systems as a Differential Complementarity Problem (DCP). Our design uses a control Lyapunov function (CLF) and a control barrier function (CBF) to define complementarity constraints in the DCP formulation to certify stability and safety, respectively. The CLF-CBF-DCP controller imposes stability as a soft constraint, which is automatically relaxed when the safety constraint is active, without the need for parameter tuning or optimization. We study the closed-loop system behavior with the CLF-CBF-DCP controller and identify conditions on the existence of local equilibria. Although in certain cases the controller yields undesirable local equilibria, those can be confined to a small subset of the safe set boundary by proper choice of the control parameters. Then, our method can avoid undesirable equilibria that CLF-CBF quadratic programming techniques encounter.
|
|
16:36-16:40, Paper WeC02.10 | Add to My Program |
Reference Governor Design in the Presence of Uncertain Polynomial Constraints |
|
Schieni, Rick | Rutgers University |
Zhao, Chengwei | Rutgers University |
Malisoff, Michael | Louisiana State University |
Burlion, Laurent | Rutgers, the State University of New Jersey |
Keywords: Constrained control, Uncertain systems, Aerospace
Abstract: Reference governors are add-on schemes that are used to prevent controlled dynamical systems from violating input and state constraints, and so are playing an increasingly important role in aerospace, robotic, and other engineering applications. Here we present a novel reference governor design for systems whose polynomial constraints depend on unknown bounded parameters. This is a significant departure from earlier treatments of reference governors, where the constraints were linear or known, because here we transfer the uncertainties into the constraints instead of having them in the closed loop dynamics, which greatly simplifies the task of determining future evolution of the constraints. Unlike our earlier treatment of reference governors with polynomial constraints which transformed the constraints into linear ones that depend on the state of the system, here we transform the constraints into linear ones that depend on both the system's state and uncertain parameters. Convexity allows us to compute the maximal output admissible set for an uncertain pre-stabilized linear system. We show that it is sufficient to only consider the extreme values of the uncertain parameters when computing and propagating the polynomial constraints. We illustrate our method using an uncertain longitudinal dynamics for civilian aircraft which is controlled using a disturbance compensation method and needs to satisfy constraints, and where our reference governor method ensures that safety constraints are always satisfied.
|
|
WeC03 Regular Session, Sapphire EF |
Add to My Program |
Robotics II |
|
|
Chair: Tanner, Herbert G. | University of Delaware |
Co-Chair: Cai, Mingyu | Lehigh University |
|
16:00-16:15, Paper WeC03.1 | Add to My Program |
A Transient Response Adjustable MPC for Following a Dynamic Object |
|
Hong, Sanghyun | Ford Motor Company |
Miller, Justin | Ford Motor Company |
Lu, Jianbo | Ford Motor Company |
Keywords: Robotics, Predictive control for linear systems, Predictive control for nonlinear systems
Abstract: As mobile robots increasingly interact with humans, vehicles, and other robots, the ability to follow these dynamic objects becomes a crucial capability. In this paper, a mobile robot motion controller based on a down-scaled Model Predictive Control (MPC) is proposed for performing dynamic object following. The controller can reflect the dynamics of an object for an appropriate response speed while ensuring safety of the robot against predicted risks. In addition, an adaptive prediction time horizon is proposed to improve following robustness. The proposed motion controller is demonstrated through extensive simulations and real-time experiments on mobile robot hardware.
|
|
16:15-16:30, Paper WeC03.2 | Add to My Program |
Priority Patrol with a Single Agent - Bounds and Approximations |
|
Mallya, Deepak | Indian Institute of Technology, Bombay |
Sinha, Arpita | Indian Institute of Technology, Bombay |
Vachhani, Leena | Indian Institute of Technology Bombay |
Keywords: Robotics, Autonomous systems, Constrained control
Abstract: Priority patrolling is a particular case of the patrolling problem where a few locations have higher priority than others, and a patrolling agent must visit these locations more frequently. This work provides three results on the priority patrol problem. First, we study the minimum time interval between two visits to a priority node, which we term the time period. We show that there doesn't exist a feasible patrol strategy for a time period less than a particular threshold. Next, we prove that a patrol strategy with recurring circuits always exists for this problem. Lastly, we provide an algorithm to obtain a patrol strategy with a constant factor approximation to the time period. We validated the results on grid graphs of various sizes, connectivity, and the number and placement of priority nodes in the graph.
|
|
16:30-16:45, Paper WeC03.3 | Add to My Program |
Learning Minimally-Violating Continuous Control for Infeasible Linear Temporal Logic Specifications |
|
Cai, Mingyu | Lehigh University |
Mann, Makai | MIT Lincoln Laboratory |
Serlin, Zachary | MIT Lincoln Laboratory |
Leahy, Kevin | MIT Lincoln Laboratory |
Vasile, Cristian Ioan | Lehigh University |
Keywords: Robotics, Learning, Automata
Abstract: This paper explores continuous-time control synthesis for target-driven navigation to satisfy complex high-level tasks expressed as linear temporal logic (LTL). We propose a model-free framework using deep reinforcement learning (DRL) where the underlying dynamic system is unknown (an opaque box). Unlike prior work, this paper considers scenarios where the given LTL specification might be infeasible and therefore cannot be accomplished globally. Instead of modifying the given LTL formula, we provide a general DRL-based approach to satisfy it with minimal violation. %mminline{Need to decide if we're comfortable calling these "guarantees" due to the stochastic policy. I'm not repeating this comment everywhere that says "guarantees" but there are multiple places.} To do this, we transform a previously multi-objective DRL problem, which requires simultaneous automata satisfaction and minimum violation cost, into a single objective. By guiding the DRL agent with a sampling-based path planning algorithm for the potentially infeasible LTL task, the proposed approach mitigates the myopic tendencies of DRL, which are often an issue when learning general LTL tasks that can have long or infinite horizons. This is achieved by decomposing an infeasible LTL formula into several reach-avoid sub-tasks with shorter horizons, which can be trained in a modular DRL architecture. Furthermore, we overcome the challenge of the exploration process for DRL in complex and cluttered environments by using path planners to design rewards that are dense in the configuration space. The benefits of the presented approach are demonstrated through testing on various complex nonlinear systems and compared with state-of-the-art baselines. The Video demonstration can be found on YouTube Channel:url{https://youtu.be/jBhx6Nv224E}.
|
|
16:45-17:00, Paper WeC03.4 | Add to My Program |
A Koopman Operator Approach for the Pitch Stabilization of a Hydrofoil in an Unsteady Flow Field |
|
Rodwell, Colin | Clemson University |
Buzhardt, Jake | Clemson University |
Tallapragada, Phanindra | Clemson University |
Keywords: Robotics, Maritime control, Optimal control
Abstract: The control of swimming robots presents several challenges, in large part due to the complex fluid-structure interaction. Low fidelity simplified formulas for drag and lift force lead to control amenable models, but do not capture key physics that can play an especially important role in the swimming of small-scale robots with limited actuation. Higher fidelity models of the fluid-structure interaction lead to nonlinear high dimensional control systems for which solution methods are not obvious. We propose the use of the Koopman operator in developing a linear representation for both the complex fluid structure interaction as well as actuation effects. As a test case for this framework we address the problem of stabilizing the pitching oscillations of a hydrofoil that is hinged in a simulated unsteady free stream flow. The actuator for the hydrofoil is an internal reaction wheel which presents an integral saturation constraint. Using the Koopman operator, the lifted control system is used to formulate a constrained optimal control problem which we solve using model predictive control. The framework proposed in this paper can potentially be extended to design a combination of data-driven and physics based control algorithms for swimming robots.
|
|
17:00-17:15, Paper WeC03.5 | Add to My Program |
Geometry of Radial Basis Neural Networks for Safety Biased Approximation of Unsafe Regions |
|
Abuaish, Ahmad | Georgia Institute of Technology |
Srinivasan, Mohit | Ford Motor Company |
Vela, Patricio A. | Georgia Institute of Technology |
Keywords: Robotics, Neural networks, Constrained control
Abstract: Barrier function-based inequality constraints are a means to enforce safety specifications for control systems. When used in conjunction with a convex optimization program, they provide a computationally efficient method to enforce safety for the general class of control-affine systems. One of the main assumptions when taking this approach is the a priori knowledge of the barrier function itself, i.e., knowledge of the safe set. In the context of navigation through unknown environments where the locally safe set evolves with time, such knowledge does not exist. This manuscript focuses on the synthesis of a zeroing barrier function characterizing the safe set based on safe and unsafe sample measurements, e.g., from perception data in navigation applications. Prior work formulated a supervised machine learning algorithm whose solution guaranteed the construction of a zeroing barrier function with specific level-set properties. However, it did not explore the geometry of the neural network design used for the synthesis process. This manuscript describes the specific geometry of the neural network used for zeroing barrier function synthesis, and shows how the network provides the necessary representation for splitting the state space into safe and unsafe regions.
|
|
17:15-17:30, Paper WeC03.6 | Add to My Program |
Multi-Behavioral Multi-Robot Systems Driven by Motivation Dynamics |
|
Baxevani, Kleio | University of Delaware |
Tanner, Herbert G. | University of Delaware |
Keywords: Time-varying systems, Adaptive systems
Abstract: This paper outlines a methodology for constructing multiple dynamical behaviors for a multi-agent system within the motivation dynamics theoretical framework. Recent work introduced analytical conditions for a dynamical system to undergo a Hopf bifurcation and generate multiple dynamical behaviors from a single family of continuous dynamics. The paper contributes by leveraging these recent results to develop a multi-agent system capable of switching its dynamic behavior without changing its underlying continuous dynamics. Simulation and experimental results are provided, confirming the theoretical results which guarantee the existence of a Hopf bifurcation in the dynamics of the multi-robot system.
|
|
WeC04 Invited Session, Sapphire AB |
Add to My Program |
Advanced Control of Wind Farms and Wind Turbines: Session III: Wind Turbine
Modeling, Estimation and Control |
|
|
Chair: Mulders, Sebastiaan Paul | Delft University of Technology |
Co-Chair: Pusch, Manuel | Munich University of Applied Sciences |
Organizer: Mulders, Sebastiaan Paul | Delft University of Technology |
Organizer: Bay, Christopher | National Renewable Energy Laboratory |
Organizer: Fleming, Paul | National Renewable Energy Laboratory |
Organizer: van Wingerden, Jan-Willem | Delft University of Technology |
|
16:00-16:15, Paper WeC04.1 | Add to My Program |
Reducing Plant-Model Mismatch for Economic Model Predictive Control of Wind Turbine Fatigue by a Data-Driven Approach (I) |
|
Anand, Abhinav | Technical University of Munich, Wind Energy Institute |
Bottasso, Carlo Luigi | Technical University of Munich |
Keywords: Optimal control, Neural networks, Grey-box modeling
Abstract: This paper considers the inclusion of an adaptive element in the model-predictive control of a wind turbine. In fact, an adaptive internal model can reduce the plant-model mismatch, in turn potentially leading to an improved performance. A Reduced Order Model (ROM) is augmented by training a Neural Network (NN) offline. The improvement in state predictions due to model augmentation is assessed and compared with the non-augmented ROM. The augmented ROM is then used as the internal model in an Economic Nonlinear Model Predictive Controller (ENMPC), which maximizes profit by optimally balancing tower fatigue damage costs with revenue due to power generation. The tower cyclic fatigue costs are formulated directly within the controller using the Parametric Online Rainflow Counting (PORFC) approach. The designed ENMPC is implemented using the state-of-the-art ACADOS framework. The performance of the controller and the impact of a reduced plant model mismatch is assessed in closed loop with the NREL 5MW onshore wind turbine, simulated using OpenFAST. Results show that the ENMPC utilizing the augmented ROM yields higher economic profit, slightly higher torque travel, and significantly lower pitch travel, compared to the ENMPC utilizing only the baseline ROM.
|
|
16:15-16:30, Paper WeC04.2 | Add to My Program |
Modeling Blade-Pitch Actuation Use in Wind Turbines (I) |
|
Henry, Aoife | University of Colorado Boulder |
Pusch, Manuel | Munich University of Applied Sciences |
Pao, Lucy Y. | University of Colorado Boulder |
Keywords: Modeling, Closed-loop identification, Learning
Abstract: Estimating the levelized cost of energy (LCOE) of a wind turbine is useful for performing a cost-benefit analysis of potential designs. The power consumed by blade-pitch actuation is an often neglected, but nontrivial factor in LCOE estimation. The peak power consumption determines the required rating of the actuation motors and the mean power consumption impacts the net annual energy production (nAEP) of the turbine. The closed-loop blade-pitch actuation and the power consumed by its motors are complex functions of the wind field disturbance and internal turbine states. They can only be predicted well with reasonably high-fidelity and computationally expensive simulations or field tests. We present an alternative approach to modeling these signals using the Sparse Identification of Nonlinear Dynamics with Control (SINDyC) methodology. It is computationally tractable to generate these models for large datasets and to simulate power consumption for a given wind field. Furthermore, the models provide intuition as to how the turbine states and disturbances contribute to the signal dynamics. By generating a closed-form dynamic state equation for the blade-pitch actuation and an algebraic equation for the blade-pitch motor power, we can efficiently predict the mean and maximum power required for a given turbulent wind field and turbine design. The model is trained and validated using data generated from the open-source aero-servo-hydro-elastic wind turbine simulation tool OpenFAST.
|
|
16:30-16:45, Paper WeC04.3 | Add to My Program |
A Learning Algorithm for the Calibration of Internal Model Uncertainties in Advanced Wind Turbine Controllers: A Wind Speed Measurement-Free Approach (I) |
|
Mulders, Sebastiaan Paul | Delft University of Technology |
Brandetti, Livia | Delft University of Technology |
Spagnolo, Fabio | Vestas Wind Systems A/S |
Liu, Yichao | Delft University of Technology |
Christensen, Poul Brandt | Vestas Wind Systems A/S |
van Wingerden, Jan-Willem | Delft University of Technology |
Keywords: Learning, Uncertain systems, Energy systems
Abstract: Wind turbine partial-load controllers have evolved from simple static nonlinear function implementations to more advanced dynamic controller structures. Such dynamic control schemes have the potential to improve power production performance in realistic environmental conditions and allow for a more granular trade-off between loads and energy capture. The control structure generally consists of a wind speed estimator (WSE) combined with a controller aiming to track the commanded tip-speed ratio (TSR) reference. The performance and resulting closed-loop system stability are however highly dependent on the accuracy of the internal model in the WSE-TSR tracking scheme. Therefore, developing learning algorithms to calibrate the internal model is of particular interest. Previous works have proposed such algorithms; however, they all rely on the availability of (rotor-effective) wind speed measurements. For the first time, this paper proposes an excitation-based learning algorithm that exploits the closed-loop dynamic structure of the WSE-TSR tracking scheme. This algorithm calibrates the internal model without the need for wind speed measurements. Analysis and simulations show that the proposed algorithm corrects for model uncertainties in the form of magnitude scaling errors under ideal constant and realistic turbulent wind conditions.
|
|
16:45-17:00, Paper WeC04.4 | Add to My Program |
Robustness of an Economic Nonlinear Model Predictive Control for Wind Turbines under Changing Environmental and Wear Conditions (I) |
|
Pustina, Luca | Roma Tre University |
Serafini, Jacopo | Roma Tre University |
Biral, Francesco | University of Trento |
Keywords: Predictive control for nonlinear systems, Optimal control, Power generation
Abstract: In this letter, the authors have assessed the robustness of an Economic Nonlinear Model-Predictive Controller (ENMPC) aimed at maximizing the power production of wind turbines. The scope of this letter is to quantify the sensitivity of this type of controller concerning wind conditions, climate, wind speed prediction unavailability, and aerodynamic performance degradation. A power production controller’s robustness is crucial for the wind turbine industry due to the extreme variability of external conditions and the wear caused by long-term continuous operativity. Model-Predictive controllers are, in principle, more prone to robustness issues concerning standard controllers, a fact that limits their adoption on actual wind turbines. The analysis is performed with the fully-aeroelastic solver OpenFAST considering a wide set of realistic load cases. It is demonstrated that the ENMPC previously developed is robust to wind prediction unavailability and change in wind turbulence intensity. Conversely, it is not robust to the modelling error due to aerodynamic degradation. Indeed, a reduction in generated power concerning the reference controller is observed, especially for operating region two and end-life blades. Finally, a significant increase in power production is achieved considering the external temperature variation thanks to the ENMPC’s direct handling of the generator temperature constraint.
|
|
17:00-17:15, Paper WeC04.5 | Add to My Program |
Wake Steering for Wind Turbine Fatigue Load Optimisation (I) |
|
Navalkar, Sachin | Siemens Gamesa Renewable Energy |
Dell, Timothy | Siemens Gamesa Renewable Energy |
Burillo, Nieves | Siemens Gamesa Renewable Energy |
Keywords: Modeling, Optimization, Simulation
Abstract: Wake steering is currently being implemented on commercial wind turbines to increase the power output from densely-packed wind farm layouts. Apart from increased power capture, wake steering also has an impact on the loads of both the upstream turbine due to operation in yawed inflow conditions, and on the downstream turbine due to reduction in the effective turbulence caused by the deflection of the wake. In this paper, commonly used wake expansion and wake deflection models are extended to obtain an analytical expression for the modified wake overlap and hence modified effective turbulence experienced by the downwind turbine. The impact of the modified yaw inflow and turbulence on wind turbine fatigue loads are investigated in a commercial aeroelastic environment. It is concluded that for a full wake overlap situation, wake steering has minimal impact on the loads of the downwind turbine. However, with partial overlap, significant changes in the effective turbulence and the loading of the downwind turbine can be observed. Wake steering strategies are hence recommended to consider both power and loads consequences in order to achieve the correct balance between turbine lifetime extension and short-term energy gains.
|
|
17:15-17:30, Paper WeC04.6 | Add to My Program |
Load Reduction of Wind Turbines Using Integrated Torque, Collective Pitch, and Individual Pitch Control Actions |
|
Mohsin, Kazi | University of Central Florida |
Odeh, Mohammad | University of Central Florida |
Ngo, Tri | University of Central Florida |
Das, Tuhin | University of Central Florida |
Keywords: Control applications, Modeling, Simulation
Abstract: This article presents the design of an integrated control strategy to improve performance and reduce load variations in large wind turbines using torque control, collective pitch control (CPC), and individual pitch control (IPC). The nonlinear CPC and torque controller are designed to regulate the rotor speed and power generation of variable-speed wind turbines in multi-regime operations. The IPC controller is designed to provide blade pitch corrections for load reduction and improved performance of the wind turbines. A Control-oriented, Reconfigurable, and Acausal Floating Turbine Simulator (CRAFTS), developed in-house, is used for the control design, implementation, and evaluation. CRAFTS enables rapid simulation of wind turbines, integration of control modules, and testing of controllers in several load cases. It has been validated against experimental data and against the well-known OpenFAST platform. Extensive simulations show that the IPC controller makes a significant load reduction in blade root bending moment, tower side-by-side, and tower fore-aft bending moments at the frequencies of interest. The IPC controller has no detrimental effects on the rotor speed and power generation that are regulated by the CPC and torque controller. Comparisons with simulation data from OpenFAST and the standard ROSCO controller are also performed in this study.
|
|
WeC05 Regular Session, Sapphire 411A |
Add to My Program |
Optimization Algorithms III |
|
|
Chair: P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Co-Chair: Chen, Xiang | University of Windsor |
|
16:00-16:15, Paper WeC05.1 | Add to My Program |
Optimal Storage and Solar Capacity of a Residential Household under Net Metering and Time-Of-Use Pricing |
|
K, Victor Sam Moses Babu | Birla Institute of Technology & Science Pilani, Hyderabad Campus |
Chakraborty, Pratyush | Birla Institute of Technology and Scienc |
Baeyens, Enrique | University of Valladolid |
Khargonekar, Pramod | Univ. of California, Irvine |
Keywords: Smart grid, Optimization, Energy systems
Abstract: Incentive programs and ongoing reduction in costs are driving joint installation of solar PV panels and storage systems in residential households. There is a need for optimal investment decisions to reduce the electricity consumption costs of the households further. In this paper, we first develop analytical expression of storage investment decision and then of solar investment decision for a household which is under net metering billing mechanism with time of use pricing condition. Using real data of a residential household in Austin, TX, USA, we study how the investment decisions would provide benefit for a period of one year. Results show significant profit when using storage devices and solar panels optimally for the system. It is important to note that though our approach can help significantly to take investment decisions, the solution will still be sub-optimal for somebody who needs optimal investment jointly on both storage and solar systems.
|
|
16:15-16:30, Paper WeC05.2 | Add to My Program |
Sample Quantile-Based Programming for Non-Convex Separable Chance Constraints (I) |
|
P. Vinod, Abraham | Mitsubishi Electric Research Laboratories |
Di Cairano, Stefano | Mitsubishi Electric Research Labs |
Keywords: Optimization, Optimization algorithms, Stochastic systems
Abstract: We propose a sampling-based approximation to non-convex, separable, chance constrained optimization problems using sample quantiles. The proposed approach does not require any prior knowledge of the distribution or the moments of the uncertainty, and accommodates chance constraints that are non-convex in the decision variables. We prescribe the finite number of samples and the tightening necessary to produce a feasible solution to the original chance constrained optimization problem with bounded suboptimality. The proposed approximation has a low computational cost since the number of sample-based constraints does not grow with the number of samples, and the number of samples needed is independent of the number of decision variables. We show the effectiveness of the proposed solution in a stochastic motion planning problem with non-convex obstacle avoidance constraints.
|
|
16:30-16:45, Paper WeC05.3 | Add to My Program |
Performance Preserving Controller Reduction in Matlab Using Structured H-Infinity Synthesis |
|
Samson, Edward | University of Minnesota Twin Cities |
|
16:45-17:00, Paper WeC05.4 | Add to My Program |
Robust Tracking Control for Nonlinear Systems: Performance Optimization Via Extremum Seeking |
|
Xu, Jiapeng | University of Windsor |
Tan, Ying | The University of Melbourne |
Chen, Xiang | University of Windsor |
Keywords: Output regulation, Optimization algorithms, Robust control
Abstract: This paper presents a controller design and optimization framework for nonlinear dynamic systems to track a given reference signal in the presence of disturbances when the task is repeated over a finite-time interval. This novel framework mainly consists of two steps. The first step is to design a robust linear quadratic tracking controller based on the existing control structure with a Youla-type filter tilde Q. Secondly, an extra degree of freedom: a parameterization in terms of tilde Q, is added to this design framework. This extra design parameter is tuned iteratively from measured tracking cost function with the given disturbances and modeling uncertainties to achieve the best transient performance. The proposed method is validated with simulation placed on a Furuta inverted pendulum, showing significant tracking performance improvement.
|
|
17:00-17:15, Paper WeC05.5 | Add to My Program |
Novel Matrix Decomposition for Fast and Scalable Model Predicative Control |
|
Adil, Muhammad | Wartsila |
Goyal, Raman | Palo Alto Research Center |
Mostafavi, Saman | Palo Alto Research Center, Inc |
Keywords: Optimization algorithms, Predictive control for linear systems, Large-scale systems
Abstract: This paper presents an inverse-free algorithm that exploits the inherent structural sparsity of a model predictive control problem. The scalability associated with large problem sizes and time horizons is one of the major obstacles in model predictive control applications. We address scalability issues by proposing a novel matrix decomposition technique, coupled with a first-order method, to efficiently solve predictive control problems. This approach solves the system of linear equations in an inverse-free manner. The iterative steps of the proposed approach are computationally efficient, require less memory, and can be easily warm-started based on the solution of the previous horizon. We evaluate the performance of the proposed approach on benchmark model predictive control problems, demonstrating its computational advantages over other state-of-the-art algorithms.
|
|
17:15-17:30, Paper WeC05.6 | Add to My Program |
Traffic Congestion Control Using Distributed Extremum Seeking and Filtered Feedback Linearization Control Approaches |
|
Karimi Shahri, Pouria | UNC Charlotte |
HomChaudhuri, Baisravan | Illinois Institute of Technology |
Pulugurtha, Srinivas | University of North Carolina at Charlotte |
Mesbah, Ali | University of California, Berkeley |
Ghasemi, Amirhossein | University of North Carolina Charlotte |
Keywords: Distributed control, Feedback linearization, Traffic control
Abstract: This paper presents a hierarchical infrastructure-based control algorithm to manage mainstream traffic flow on freeways. At the upper level, a distributed Extremum-Seeking control approach is employed to determine the optimal density of vehicles in a congested cell. The local objective function is defined such that the average flow within the target cell is maximized to resolve the congestion, and the flow difference with its upstream cell is minimized to prevent back-propagating the congestion. At the lower level, a distributed Filtered Feedback Linearization controller is used to update the suggested velocity communicated to the vehicles so that the desired density determined by the upper level can be achieved in each cell. We adopted the METANET model to describe the aggregated dynamics of the traffic network. We tested the performance of these controllers via a MATLAB-VISSIM COM interface. The results demonstrate that the designed distributed controllers can achieve the desired closed-loop performance despite unknown disturbances in an uncertain large-scale traffic network.
|
|
WeC06 Special Session, Sapphire 411B |
Add to My Program |
Student Best Paper Finalists |
|
|
Chair: Hall, Carrie | Illinois Institute of Technology |
Co-Chair: Andersson, Sean B. | Boston University |
|
16:00-16:15, Paper WeC06.1 | Add to My Program |
Student Paper Award Nomination - Nominates Submission 231 for Student Best Paper Award (Mohammad Alali*, Mahdi Imani, Reinforcement Learning Data-Acquiring for Causal Inference of Regulatory Networks) , Nominee Mohammad Alali |
|
Imani, Mahdi | Northeastern University |
|
16:15-16:30, Paper WeC06.2 | Add to My Program |
Student Paper Award Nomination - Nominates Submission 591 for Student Best Paper Award (Yongkai Xie, Zhaojian Wang*, John Pang, Bo Yang, Xin-Ping Guan, Distributed Online Generalized Nash Equilibrium Tracking for Prosumer Energy Trading Games) , Nominee Yongkai Xie |
|
Wang, Zhaojian | Shanghai Jiao Tong University |
|
16:30-16:45, Paper WeC06.3 | Add to My Program |
Student Paper Award Nomination - Nominates Submission 761 for Student Best Paper Award (Eugen Ernst*, Florian Pfaff, Uwe D. Hanebeck, Marcus Baum, the Kernel-SME Filter with Adaptive Kernel Widths for Association-Free Multi-Target Tracking) , Nominee Eugen Ernst |
|
Hanebeck, Uwe D. | Karlsruhe Institute of Technology (KIT) |
|
16:45-17:00, Paper WeC06.4 | Add to My Program |
Student Paper Award Nomination - Nominates Submission 33 for Student Best Paper Award (Mahmoud Abdelgalil*, Asmaa Eldesoukey, Haithem Taha, Singularly Perturbed Averaging with Application to Bio-Inspired 3D Source Seeking) , Nominee Mahmoud Abdelgalil |
|
Taha, Haithem | UNIVERSITY OF CALIFORNIA, IRVINE |
|
17:00-17:15, Paper WeC06.5 | Add to My Program |
Student Paper Award Nomination - Nominates Submission 1196 for Student Best Paper Award (Bryan Lee*, Tetsuya Iwasaki, Analysis of Central Pattern Generators with Weak Nondiffusive Coupling) , Nominee Bryan Lee |
|
Iwasaki, Tetsuya | UCLA |
|
WeC07 Invited Session, Aqua 303 |
Add to My Program |
Mechatronics |
|
|
Chair: Al Janaideh, Mohammad | Memorial University |
Co-Chair: Nagel, William S. | Widener University |
Organizer: Al Janaideh, Mohammad | Memorial University of Newfoundland |
Organizer: Flores, Gerardo | Center for Research in Optics |
Organizer: Heertjes, Marcel | Eindhoven University of Technology |
Organizer: Nagel, William S. | Widener University |
Organizer: Khadraoui, Sofiane | University of Sharjah |
Organizer: Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
|
16:00-16:15, Paper WeC07.1 | Add to My Program |
Reference Modulation for High-Quality Scan Images of Two-Dimensional Laser Scanner Systems (I) |
|
Lee, Youngwoo | Chonnam National University |
Chung, Chung Choo | Hanyang University |
Keywords: Mechatronics, Linear systems
Abstract: For laser scanner systems, the most critical issue for high-quality images is the linearity of the slow axis. The laser scanner systems have various disturbances and uncertainties, such as manufacturing tolerances, structure vibrations, and parameter uncertainties. Further, there are vertical and horizontal resonance modes that hinder controller design. In this paper, we propose a reference modulation technique to enhance the vertical angle linearity of the slow axis for two-dimensional laser scanner systems. First, the laser scanner system is mathematically modeled in the state-space form. Then, the reference modulation is designed to boost the loop gain over the desired frequency range. The reference modulation can guarantee high-quality scan images by redesigning the desired reference based on the output feedback. The current tracking controller was then implemented with virtual references for the states. We investigated its tracking performance via sensitivity function analysis against the disturbances. Experiments were performed to evaluate the effectiveness of the proposed method using four different laser scanners. We confirmed that the proposed method achieved up to 48% improvement in the angle tracking error from the experimental results.
|
|
16:15-16:30, Paper WeC07.2 | Add to My Program |
Consensus of Euler-Lagrange Agents with Internal Model Disturbance Rejection and Interconnection Delays (I) |
|
Nuño, Emmanuel | University of Guadalajara |
Sarras, Ioannis | ONERA |
Yin, Hao | University of Groningen |
Jayawardhana, Bayu | University of Groningen |
Keywords: Control of networks, Robotics, Robust control
Abstract: In this work, a distributed control method to achieve the leaderless consensus of heterogeneous Euler-Lagrange (EL) systems with bounded time-varying communication delays while simultaneously rejecting periodic external disturbances is reported. The robust controller has a simple-to-implement structure of proportional-integral-derivative scheme that employs the internal model approach to reject the disturbance. We consider that the network of EL-systems is interconnected through an undirected weighted graph that is static and we assume that the information exchange between any connected nodes is subjected to bounded variable time-delays. The efficacy of the proposed method is shown in a numerical simulation using a network of ten robotic manipulators.
|
|
16:30-16:45, Paper WeC07.3 | Add to My Program |
Tracking Control of Unmanned Aerial Vehicles Using Only Feedback of Inertial Coordinates (I) |
|
Al Saaideh, Mohammad | Memorial University of Newfoundland |
Boker, Almuatazbellah | Virginia Tech |
Al Janaideh, Mohammad | University of Guelph |
Keywords: Mechatronics, Control applications
Abstract: The aim of this study is to investigate the output feedback tracking control of an Unmanned Aerial Vehicle (UAV) using only measured inertial coordinate information. The proposed approach involves two cascade high-gain observers combined with a full state feedback control based on the backstepping approach. The backstepping controller is designed to solve the tracking control problem of the underactuated system. The observer consists of two cascaded high-gain observers with different speeds, where the faster observer estimates the output position and velocity of the system in three dimensions and feeds a virtual nonlinear output to estimate the Euler angles (pitch, roll, and yaw) and angular velocity. The study shows that the equilibrium point of the full state feedback control system is exponentially stable when the system information is fully known. Simulation results indicate that the output feedback control achieves the tracking control objective and recovers the performance of the state feedback control. Additionally, the study demonstrates the convergence and boundedness of the estimation errors.
|
|
16:45-17:00, Paper WeC07.4 | Add to My Program |
Dahl Hysteresis Modeling and Position Control of Piezoelectric Digital Manipulator (I) |
|
Flores, Gerardo | Center for Research in Optics |
Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
Keywords: Mechatronics, Robotics, Nonlinear output feedback
Abstract: The synthesis of a feedback controller based on a hysteresis model is challenging, mainly when the hysteresis is non-symmetric. In this paper, we propose to model a non-symmetric hysteresis of a piezoelectric actuator by using the Dahl model and design a nonlinear feedback controller for the system. For that aim, we exploit the boundedness of the system states to design an extended observer responsible for estimating the part of the system that is observable, including non-modeled terms and exogenous signals. Then, we design active disturbance rejection control that globally asymptotically stabilizes the tracking error. Finally, simulations were carried out to demonstrate the effectiveness of our approach.
|
|
17:00-17:15, Paper WeC07.5 | Add to My Program |
Output Estimation and Failure Detection in Piezoelectric Actuators Using Transmissibility Operators (I) |
|
Khalil, Abdelrahman | Memorial University of Newfoundland |
Aljanaideh, Khaled | Jordan University of Science and Technology |
Al Janaideh, Mohammad | Memorial University of Newfoundland |
Rakotondrabe, Micky | ENIT Tarbes, INPT, University of Toulouse |
Keywords: Fault detection, Estimation, Flexible structures
Abstract: In this article, we investigate how to identify faulty sensors in piezoelectric actuators used for precise positioning. Four sensors are distributed along the actuator's cantilever structure to measure the deflection (displacement) at various points. We suggest identifying the sensor and detecting the fault in one of the sensors, which is thought to be faulty or producing a degraded signal. To address this, we use transmissibility operators, which are mathematical operators that can be used to estimate sensor measurements based on another set of sensor measurements within the same system. This estimation is highly robust against any external excitations/disturbances, as well as unknown nonlinearities or unmodeled dynamics. The estimation robustness allows for failure detection to be carried out even in the presence of significant actuator hysteresis nonlinearity and outside disturbance. Simulation results with various sensor fault conditions are used to verify the suggested strategy.
|
|
17:15-17:30, Paper WeC07.6 | Add to My Program |
Least Squares Solution for System Identification with Non-Uniform Data under a Coprime Collaborative Sensing Scheme (I) |
|
Ouyang, Jinhua | University of Washington |
Chen, Xu | University of Washington |
Keywords: Identification, Identification for control, Mechatronics
Abstract: This paper presents a least squares formulation and a closed-form solution for identifying dynamical systems using non-uniform data obtained under a coprime collaborative sensing scheme. Specifically, the method combines measurements from two slow output sensors with different sampling rates to estimate the system's dynamics. We provide the theoretical foundation for developing advanced least-squares-based system identification algorithms for cases where the input-output data are sampled at different rates. Demonstrative examples are provided to validate the proposed method, and to show the identification beyond the Nyquist frequency.
|
|
WeC08 Invited Session, Aqua 305 |
Add to My Program |
Estimation and Control of Infinite Dimensional Systems III |
|
|
Chair: Seiler, Peter | University of Michigan, Ann Arbor |
Co-Chair: Peet, Matthew M. | Arizona State University |
Organizer: Demetriou, Michael A. | Worcester Polytechnic Institute |
Organizer: Burns, John A | Virginia Tech |
|
16:00-16:15, Paper WeC08.1 | Add to My Program |
Integral Quadratic Constraints with Infinite-Dimensional Channels (I) |
|
Talitckii, Aleksandr | Arizona State University |
Peet, Matthew M. | Arizona State University |
Seiler, Peter | University of Michigan, Ann Arbor |
Keywords: Robust control, Stability of nonlinear systems, Uncertain systems
Abstract: Modern control theory provides us with a spectrum of methods for studying the interconnection of dynamic systems using input-output properties of the interconnected subsystems. Perhaps the most advanced framework for such input-output analysis is the use of Integral Quadratic Constraints (IQCs), which considers the interconnection of a nominal linear system with an unmodelled nonlinear or uncertain subsystem with known input-output properties. Although these methods are widely used for Ordinary Differential Equations (ODEs), there have been fewer attempts to extend IQCs to infinite-dimensional systems. In this paper, we present an IQC-based framework for Partial Differential Equations (PDEs) and Delay Differential Equations (DDEs). First, we introduce infinite-dimensional signal spaces, operators, and feedback interconnections. Next, in the main result, we propose a formulation of hard IQC-based input-output stability conditions, allowing for infinite-dimensional multipliers. We then show how to test hard IQC conditions with infinite-dimensional multipliers on a nominal linear PDE or DDE system via the Partial Integral Equation (PIE) state-space representation using a sufficient version of the Kalman-Yakubovich-Popov lemma (KYP). The results are then illustrated using four example problems with uncertainty and nonlinearity.
|
|
16:15-16:30, Paper WeC08.2 | Add to My Program |
A Minimax Sliding Mode Control Design on Finite Horizon for Linear Evolution Equations with Switching Modes (I) |
|
Iftime, Orest V. | University of Groningen |
Demetriou, Michael A. | Worcester Polytechnic Institute |
Zhuk, Sergiy | IBM |
Keywords: Distributed parameter systems, Switched systems, Robust control
Abstract: In this paper we consider a family of linear evolution equations in infinite dimensions (Hilbert spaces) with initial state, input and output bounded uncertainty, and allow the possibility of switching between the given systems. The achievable measurement/actuator location will be fixed over certain time intervals. Based on uncertain output measurements, we use a minimax sliding mode control approach to design a switching controller which steers a state to a finite-dimensional hyperplane in finite time. We show that the switching controller provides an optimal solution to a particular optimal mini- max sliding mode control problem with switching modes and state/measurement/input disturbances. The proposed approach is summarized in an algorithm and it is illustrated through a numerical study on a family of delay evolution equations with switching modes and bounded disturbances.
|
|
16:30-16:45, Paper WeC08.3 | Add to My Program |
Nonlinear Local Control of the Safety-Factor-Profile Gradient at Moving Spatial Locations in Tokamak Plasmas (I) |
|
Paruchuri, Sai Tej | Lehigh University |
Pajares, Andres | General Atomics |
Schuster, Eugenio | Lehigh University |
Keywords: Distributed parameter systems, Emerging control applications, Feedback linearization
Abstract: Tokamaks are toroidal devices that confine a very hot plasma (hydrogenic ionized gas) by using strong magnetic fields. When the kinetic energy is high, positively charged nuclei in the plasma can overcome the Coulombic forces of repulsion and fuse to form a heavier nucleus. A tremendous amount of energy is released during this reaction. The pitch of the magnetic field in a tokamak, measured by the safety factor profile, plays a crucial role in ensuring the magnetohydrodynamic (MHD) stability of the tokamak plasma. MHD instabilities like the Neoclassical Tearing Mode (NTM), which can deteriorate or even terminate plasma confinement, can appear at regions in the tokamak where the safety factor profile assumes a rational value. Since the safety factor profile is a continuous function of location in the tokamak, rational values at specific locations are inevitable. Controlling the gradient of the safety factor profile at these locations can prevent or mitigate the effect of MHD instabilities. In this work, a one-dimensional model that approximates the safety factor gradient dynamics at one of the locations where the safety factor achieves a rational value is developed. A controller based on feedback linearization of this model is designed to track a target gradient value in the steady-state scenario. The effectiveness of this controller is demonstrated in nonlinear numerical simulations powered by the Control Oriented Transport SIMulator (COTSIM) for a DIII-D tokamak scenario.
|
|
16:45-17:00, Paper WeC08.4 | Add to My Program |
Energy Amplification of Stochastically-Forced Hypersonic Blunt Body Flows (I) |
|
Dwivedi, Anubhav | University of Southern California |
Jovanovic, Mihailo R. | University of Southern California |
Keywords: Distributed parameter systems, Fluid flow systems, Linear systems
Abstract: We utilize stochastically-forced compressible linearized Navier-Stokes equations to study the dynamics of hypersonic flows over blunt bodies. Our analysis of the energy of the flow fluctuations around the laminar stagnated flow reveals strong amplification of specific streamwise and spanwise length scales. We also provide insights into how changes in different physical parameters, such as the temperature of the blunt body and its curvature, influence the amplification of flow fluctuations. We show that increasing the bluntness and decreasing the wall temperature can significantly enhance the amplification of flow fluctuations. Our approach offers a systematic control-theoretic framework for quantifying the influence of stochastic excitation sources (e.g., free-stream turbulence and surface roughness) that are unavoidable in experiments and paves the way for the development of control-oriented models of hypersonic flows over blunt bodies.
|
|
17:00-17:15, Paper WeC08.5 | Add to My Program |
On Resource-Constrained Stabilization of Nonlinear Parabolic PDE Systems with Parametric Drift (I) |
|
Zedan, Amr | University of California Davis |
El-Farra, Nael H. | University of California, Davis |
Keywords: Distributed parameter systems, Networked control systems, Process Control
Abstract: In this work, we consider the problem of stabilization of spatially-distributed dynamical systems described by nonlinear parabolic PDEs controlled over resource-constrained sensor-controller communication channels subject to process parametric variations. A framework for augmenting model-based feedback control with error-triggered parameter re-identification is developed to address this problem. The goal is to maintain closed-loop stability in the presence of varying levels of plant-model mismatch during periods of parametric drift, while simultaneously keeping the sensor-controller communication rate to a minimum and maintaining acceptable levels of closed-loop performance. Initially, a stabilizing nonlinear state feedback controller based on an approximate finite-dimensional model of the infinite-dimensional system is designed. The controller utilizes model-generated state estimates which are periodically updated using the available state measurements. An estimate of the maximum allowable update period is obtained and characterized in terms of the parametric uncertainty and the controller design parameters. An error monitoring scheme with a time-varying instability alarm threshold is then devised to determine on-line if, and when, the model parameters need to be updated. A breach of the instability threshold at some time triggers a safe-parking phase in which the sensor-controller communication rate is adjusted to mitigate the destabilizing impact of increased plant-model mismatch. The measurements collected during the safe-parking mode are used to obtain new estimates of the process parameters using nonlinear grey-box parameter estimation techniques. An explicit characterization of the closed-loop stability region associated with the new model parameters is obtained to determine the appropriate post-drift sensor-controller communication rate that should be used when the model parameters are updated. The implementation of the proposed methodology is illustrated using a representative diffusion-reaction process example with a time varying parametric drift.
|
|
17:15-17:30, Paper WeC08.6 | Add to My Program |
Adaptive Stabilization of the Kuramoto-Sivashinsky Equation Subject to Intermittent Sensing |
|
Belhadjoudja, Mohamed Camil | Gipsa Lab / Cnrs |
Maghenem, Mohamed Adlene | Gipsa Lab, CNRS, France |
Witrant, Emmanuel | Université Grenoble Alpes |
Prieur, Christophe | CNRS |
Keywords: Distributed parameter systems, Lyapunov methods, Adaptive control
Abstract: We study in this paper the one-dimensional Kuramoto-Sivashinsky equation (KS), subject to intermittent sensing. Namely, we measure the state on a sub-interval of the spatial domain during certain intervals of time, and we measure the state on the remaining sub-interval of space during the remaining intervals of time. As a result, we assign an active control at the boundaries of the spatial domain, and we set a zero boundary condition at the junction of the two spatial sub-intervals. Under the assumption that the destabilizing coefficient is unknown, we design adaptive boundary controllers that guarantee global exponential stability (GES) of the trivial solution in the L2 norm. Numerical simulations are performed to illustrate our results
|
|
WeC09 Regular Session, Aqua 307 |
Add to My Program |
Estimation III |
|
|
Chair: Namerikawa, Toru | Keio University |
Co-Chair: Li, Shengbo Eben | Tsinghua University |
|
16:00-16:15, Paper WeC09.1 | Add to My Program |
Generalized Moving Horizon Estimation for Nonlinear Systems with Robustness to Measurement Outliers |
|
Cao, Wenhan | Tsinghua University |
Liu, Chang | Cornell University |
Lan, Zhiqian | Tsinghua University |
Piao, Yingxi | Tsinghua University |
Li, Shengbo Eben | Tsinghua University |
Keywords: Estimation, Filtering
Abstract: Moving horizon estimation (MHE) is an effective filtering technique for nonlinear systems subjected to arbitrary, non-Gaussian noise distributions. While there has been noticeable progress in the stability analysis of MHE, there is lack of research on robustifying MHE against measurement outliers. To bridge this gap, we propose a generalized MHE approach by utilizing the loss-theoretic perspective of Generalized Bayesian Inference. In particular, we design a robust loss function by leveraging the β-divergence and propose the β moving horizon estimator to handle the outliers. Analytical influence functions are derived to analyze the robustness of the MHE methods. Based on this, we prove that for the case of linear Gaussian systems, the gross error sensitivity of the proposed estimator remains bounded, while for the standard MHE, it is unbounded. The effectiveness of the proposed approach is demonstrated in simulations on both linear and nonlinear systems, where the estimator outperforms commonly used linear and nonlinear filters while keeping the computational overhead comparable to that of the standard MHE.
|
|
16:15-16:30, Paper WeC09.2 | Add to My Program |
Analysis of the Potential of Onboard Vehicle Sensors for Model-Based Maximum Friction Coefficient Estimation |
|
Lampe, Nicolas | Osnabrück University of Applied Sciences |
Ziaukas, Zygimantas | Institute of Mechatronic Systems, Leibniz Universität Hannover |
Westerkamp, Clemens | Osnabrück University of Applied Sciences |
Jacob, Hans-Georg | Leibniz University Hannover |
Keywords: Estimation, Kalman filtering, Grey-box modeling
Abstract: Advanced driver assistance systems (ADAS) have led to a steady improvement in driving comfort and safety. Knowledge of vehicle dynamics and perception of the vehicle's environment are necessary for optimized ADAS and autonomous driving. A crucial parameter influencing vehicle dynamics is the maximum friction coefficient between the tire and the road. As this coefficient cannot be measured economically in serial production cars via existing vehicle sensors, model-based estimation algorithms are a field of interest. In this paper, maximum friction coefficient estimation is presented using an unscented Kalman filter (UKF) based on onboard vehicle sensors such as a six degrees of freedom inertial measurement unit, height level sensors, and tie rod force sensors. The goal is to analyze the potential of these sensors for maximum friction coefficient estimation. First, a variance-based sensitivity analysis is used to analyze the physical vehicle model of a Dacia Duster. Second, model-based maximum friction coefficient estimation is implemented for the test vehicle and the results using different sensor settings are compared for driving maneuvers carried out on a test track with different road surfaces. Finally, model-based maximum friction coefficient estimation using these onboard vehicle sensors shows improved results compared to the UKF from previous works.
|
|
16:30-16:45, Paper WeC09.3 | Add to My Program |
Quantifying the Estimation Error for a Constant-Gain Tracker |
|
Wei, Shihong | Johns Hopkins University |
Spall, James C. | Johns Hopkins Univ |
Keywords: Estimation, Kalman filtering, Nonlinear systems identification
Abstract: We present a new method of calculating approximate probabilistic bounds for the state estimation error of a constant-gain Kalman filter in nonlinear multivariate systems. A critical part of the method is an analysis of the convergence in distribution and the mean-square error of the algorithm. Based on such analysis, a surrogate error process is constructed to mimic the dynamics of the unknown tracking error. We analyze the distance between the true filtering error and the surrogate process and show that it can be bounded under some circumstances. The surrogate process provides a means of computing probabilistic bounds and confidence regions for the tracking error.
|
|
16:45-17:00, Paper WeC09.4 | Add to My Program |
SLAM Backends with Objects in Motion: A Unifying Framework and Tutorial |
|
Chiu, Chih-Yuan | University of California, Berkeley |
Keywords: Estimation, Kalman filtering, Numerical algorithms
Abstract: Simultaneous Localization and Mapping (SLAM) algorithms are frequently deployed to support a wide range of robotics applications, such as autonomous navigation in unknown environments, and scene mapping in virtual reality. Many of these applications require autonomous agents to perform SLAM in highly dynamic scenes. To this end, this tutorial extends a recently introduced, unifying optimization-based SLAM backend framework to environments with moving objects and features. Using this framework, we consider a rapprochement of recent advances in dynamic SLAM. Moreover, we present dynamic EKF SLAM: a novel, filtering-based dynamic SLAM algorithm generated from our framework, and prove that it is mathematically equivalent to a direct extension of the classical EKF SLAM algorithm to the dynamic environment setting. Empirical results with simulated data indicate that dynamic EKF SLAM can achieve high localization and mobile object pose estimation accuracy, as well as high map precision, with high efficiency.
|
|
17:00-17:15, Paper WeC09.5 | Add to My Program |
On the Lack of Robustness of Observers for Systems with Uncertain, Unstable Dynamics |
|
Kamaldar, Mohammadreza | University of Michigan |
Goel, Ankit | University of Maryland Baltimore County |
Islam, Syed Aseem Ul | University of Michigan |
Bernstein, Dennis S. | Univ. of Michigan |
Keywords: Estimation, Kalman filtering, Observers for Linear systems
Abstract: We consider the robustness of state estimation for linear, time-invariant systems. Since state estimation is dual to full-state feedback, it may be expected that stability of the error dynamics depends continuously on perturbations of the dynamics matrix. This paper shows, however, that, if the system dynamics are unstable, then, regardless of how the filter gain is chosen, there always exist arbitrarily small perturbations of the system dynamics that give rise to unbounded state-estimation error. Since this phenomenon cannot occur in full-state feedback control, this result reveals a surprising breakdown in the duality between estimation and control.
|
|
17:15-17:30, Paper WeC09.6 | Add to My Program |
Distributed Point-Mass Filter with Reduced Data Transfer Using Copula Theory |
|
Matoušek, Jakub | University of West Bohemia |
Dunik, Jindrich | University of West Bohemia |
Forsling, Robin | Linköping University |
Keywords: Estimation, Kalman filtering, Sensor fusion
Abstract: This paper deals with distributed Bayesian state estimation of generally nonlinear stochastic dynamic systems. In particular, distributed point-mass filter algorithm is developed. It is comprised of a basic part that is accurate but data intense and optional step employing advanced copula theory. The optional step significantly reduces data transfer for the price of a small accuracy decrease. In the end, the developed algorithm is numerically compared to the usually employed distributed extended Kalman filter.
|
|
WeC10 Regular Session, Aqua 309 |
Add to My Program |
Agents-Based Systems III |
|
|
Chair: Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Co-Chair: Chen, Yijun | University of Sydney |
|
16:00-16:15, Paper WeC10.1 | Add to My Program |
Decentralized Multi-Agent Motion Planning in Dynamic Environments |
|
Netter, Josh | Georgia Institute of Technology |
Vamvoudakis, Kyriakos G. | Georgia Inst. of Tech |
Keywords: Autonomous systems, Autonomous robots, Agents-based systems
Abstract: In this paper we present a decentralized multi-agent motion planning algorithm for navigation in dynamic environments. Each agent constructs a graph of boundary value problems in the environment considering their own kinodynamic constraints using a learning-based motion planning framework. A game-theoretic approach is then used by each agent to select their individual path through the environment while considering the planned motion of other agents. This path is updated online to ensure collisions are avoided, and to provide a method of counteracting the freezing robot problem. The effectiveness of the algorithm is illustrated in simulations.
|
|
16:15-16:30, Paper WeC10.2 | Add to My Program |
Optimal Persistent Monitoring of Mobile Targets in One Dimension |
|
Hall, Jonas | Boston University |
Andersson, Sean B. | Boston University |
Cassandras, Christos G. | Boston University |
Keywords: Agents-based systems, Optimal control, Hybrid systems
Abstract: This work shows the existence of optimal control laws for persistent monitoring of mobile targets in a one-dimensional mission space and derives explicit solutions. The underlying performance metric consists of minimizing the total uncertainty accumulated over a finite mission time. We first demonstrate that the corresponding optimal control problem can be reduced to a finite-dimensional optimization problem, and then establish existence of an optimal solution. Motivated by this result, we construct a parametric reformulation for which an event based gradient descent method is utilized with the goal of deriving (locally optimal) solutions. We additionally provide a more practical parameterization that has attractive properties such as simplicity, flexibility, and robustness. Both parameterizations are validated through simulation.
|
|
16:30-16:45, Paper WeC10.3 | Add to My Program |
Formation Control for a Class of Nonlinear Multi-Agent Systems Using Three-Layer Neural Networks |
|
Aryankia, Kiarash | Concordia University |
Selmic, Rastko | Concordia University |
Keywords: Agents-based systems, Stability of nonlinear systems, Lyapunov methods
Abstract: This paper considers a leader-following formation control problem for a class of second-order, uncertain, input-affine, nonlinear multi-agent systems modeled by a directed graph. A three-layer neural network (NN) is proposed with an input layer, two hidden layers, and an output layer to approximate an unknown nonlinearity. The NN weights tuning laws were derived using the Lyapunov theory. The leader-following and formation control problem was addressed using a robust integral of the sign of the error (RISE) feedback and a NN-based control. The RISE feedback term compensates for an unknown leader dynamics and the bounded disturbance in the agent error dynamics. The NN-based term compensates for the unknown nonlinearity in the dynamics of agents. Semi-global asymptotic tracking results were rigorously proven using the Lyapunov stability theory. The numerical simulation results show the effectiveness of the proposed method.
|
|
16:45-17:00, Paper WeC10.4 | Add to My Program |
Target Defense against Periodically Arriving Intruders |
|
Pourghorban, Arman | University of North Carolina at Charlotte |
Maity, Dipankar | University of North Carolina at Charlotte |
Keywords: Agents-based systems, Robotics, Game theory
Abstract: We consider a variant of pursuit-evasion games where a single defender is tasked to defend a static target from a sequence of periodically arriving intruders. The intruders’ objective is to breach the boundary of a circular target without being captured and the defender’s objective is to capture as many intruders as possible. At the beginning of each period, a new intruder appears at a random location on the perimeter of a fixed circle surrounding the target and moves radially towards the target center to breach the target. The intruders are slower in speed compared to the defender and they have their own sensing footprint through which they can perfectly detect the defender if it is within their sensing range. Considering the speed and sensing limitations of the agents, we analyze the entire game by dividing it into partial information and full information phases. We address the defender’s capturability using the notions of engagement surface and capture circle. We develop and analyze three efficient strategies for the defender and derive a lower bound on the capture fraction. Finally, we conduct a series of simulations and numerical experiments to compare and contrast the three proposed approaches.
|
|
17:00-17:15, Paper WeC10.5 | Add to My Program |
Network Learning from Best-Response Dynamics in LQ Games |
|
Chen, Yijun | University of Sydney |
Ding, Kemi | Hong Kong University of Science and Technology |
Shi, Guodong | The University of Sydney |
Keywords: Agents-based systems, Identification, Learning
Abstract: In this paper, we focus on network structure inference problem for linear-quadratic (LQ) games from best-response dynamics. An adversary is considered to have no knowledge of the game network structure but have the ability to observe all players' best-response actions and manipulate some players' actions. This work presents a comprehensive framework for network learning from best-response dynamics in LQ games. First of all, we establish theoretic results that characterize network structure identifiability and provide numerical examples to demonstrate the usefulness of our theoretic results. Next, in the face of the inherent stability and sparsity constraints for the game network structure, we propose an information-theoretic stable and sparse system identification algorithm for learning the network structure. Finally, the effectiveness of the proposed learning algorithm is tested. The connection between network structure inference problem and classical system identification theory is covered by our work, which advances the literature.
|
|
17:15-17:30, Paper WeC10.6 | Add to My Program |
Non-Sequential Decentralized Stochastic Control and Static Reduction |
|
Simpson, Ryan | Queen's University |
Yuksel, Serdar | Queen's University |
Keywords: Stochastic optimal control, Uncertain systems, Agents-based systems
Abstract: In decentralized stochastic control (or stochastic team theory) and game theory, if there is a pre-defined order in a system in which agents act, the system is called sequential, otherwise it is non-sequential. Much of the literature on stochastic control theory, such as studies on the existence analysis, approximation methods, and on dynamic programming or other analytical or learning theoretic methods, have focused on sequential systems. The static reduction method for sequential control problems (via change of measures or other techniques), in particular, has been shown to be very effective in arriving at existence, structural, approximation and learning theoretic results. Many practical systems, however, are non-sequential where the order of agents acting is random, and dependent on the realization of solution paths and prior actions taken. The study of such systems is particularly challenging as tools applicable for sequential models are not directly applicable. In this paper, we will study static reducibility of non-sequential stochastic control systems, including by change of measure methods. We revisit the notion of Causality (a definition due to Witsenhausen and which has been refined by Andersland and Tekenetzis), and provide an alternative representation using imaginary agents. Via this representation, we show that Causality, under an absolute continuity condition, allows for an equivalent static model whose reduction is policy-independent. This facilitates much of the stochastic analysis available for sequential systems to also be applicable for non-sequential systems. We further show that under more relaxed conditions on the model, such as solvability, such a reduction, when possible at all, is policy-dependent or includes policies as parameters in the cost of the reduced model, and thus has limited utility. We will also present a further reduction method for partially nested causal non-sequential systems.
|
|
WeC11 Regular Session, Aqua Salon AB |
Add to My Program |
Game Theory III |
|
|
Chair: Marden, Jason R. | University of California, Santa Barbara |
Co-Chair: Malikopoulos, Andreas A. | University of Delaware |
|
16:00-16:15, Paper WeC11.1 | Add to My Program |
When Would Online Platforms Pay Data Dividends? |
|
Kudva, Sukanya | UC Berkeley |
Aswani, Anil | UC Berkeley |
Keywords: Game theory, Modeling, Optimization
Abstract: Online platforms, including social media and search platforms, have routinely used their users' data for targeted ads, to improve their services, and to sell to third-party buyers. But an increasing awareness of the importance of users' data privacy has led to new laws regulating platform data-sharing. Further, there have been political discussions on introducing data dividends, that pay users for their data. Three interesting questions are then: When would these online platforms be incentivized to pay data dividends? How does their decision depend on whether users value their privacy more than the platform's free services? And should platforms invest in protecting users' data? This paper considers various factors affecting the users' and platform's decisions through utility functions. We construct a principal-agent model using a Stackelberg game to calculate their optimal decisions and qualitatively discuss the implications. Our results could inform a policymaker trying to understand the consequences of mandating data dividends.
|
|
16:15-16:30, Paper WeC11.2 | Add to My Program |
Mobility Equity and Economic Sustainability Using Game Theory |
|
Chremos, Ioannis Vasileios | University of Delaware |
Malikopoulos, Andreas A. | University of Delaware |
Keywords: Game theory
Abstract: In this paper, we consider a multi-modal mobility system of travelers each with an individual travel budget, and propose a game-theoretic framework to assign each traveler to a ``mobility service" (each one representing a different mode of transportation). We are interested in equity and sustainability, thus we maximize the worst-case revenue of the mobility system while ensuring ``mobility equity," which we define it in terms of accessibility. In the proposed framework, we ensure that all travelers are truthful and voluntarily participate under informational asymmetry, and the solution respects the individual budget of each traveler. Each traveler may seek to travel using multiple services (e.g., car, bus, train, bike). The services are capacitated and can serve up to a fixed number of travelers at any instant of time. Thus, our problem falls under the category of many-to-one assignment problems, where the goal is to find the conditions that guarantee the stability of assignments. We formulate a linear program of maximizing worst-case revenue under the constraints of mobility equity, and we fully characterize the optimal solution.
|
|
16:30-16:45, Paper WeC11.3 | Add to My Program |
Cost Design in Atomic Routing Games |
|
Yu, Yue | The University of Texas at Austin |
Chen, Shenghui | University of Texas at Austin |
Fridovich-Keil, David | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Network analysis and control, Optimization
Abstract: An atomic routing game is a multiplayer game on a directed graph. Each player in the game chooses a path---a sequence of links that connect its origin node to its destination node---with the lowest cost, where the cost of each link is a function of all players' choices. We develop a novel numerical method to design the link cost function in atomic routing games such that the players' choices at the Nash equilibrium minimize a given smooth performance function. This method first approximates the nonsmooth Nash equilibrium conditions with smooth ones, then iteratively improves the link cost function via implicit differentiation. We demonstrate the application of this method to atomic routing games that model noncooperative agents navigating in grid
|
|
16:45-17:00, Paper WeC11.4 | Add to My Program |
A Weakest-Link Extension Theorem for General Lotto Games |
|
Aghajan, Adel | University of California Santa Barbara |
Paarporn, Keith | University of Colorado, Colorado Springs |
Marden, Jason R. | University of California, Santa Barbara |
Keywords: Game theory, Network analysis and control
Abstract: The General Lotto game is a well-studied model where two opposing players strategically allocate a limited amount resources to multiple contests. In the classic setup, each contest represents an individual battlefield with an associated value, and the objective is to maximize the accumulated value by winning individual battlefields. In this paper, we consider scenarios beyond the classic setup, where (i) success on a contest can depend on securing subsets of battlefields, and (ii) the winner of a battlefield can be based on alternate winning rules other than the classic winner-take-all rule. Our main results demonstrate that having an equilibrium solution to a single contest scenario can provide immediate equilibrium characterizations for the weakest-link extension (best-shot), where one player must win all (at least one) of the constituent contests in order to earn any value. We highlight the applicability of the derived theory on network defense problems.
|
|
17:00-17:15, Paper WeC11.5 | Add to My Program |
Strategic Information Design in Quadratic Multidimensional Persuasion Games with Two Senders |
|
Velicheti, Raj Kiriti | University of Illinois at Urbana Champaign |
Bastopcu, Melih | University of Illinois Urbana Champaign |
Basar, Tamer | Univ of Illinois, Urbana-Champaign |
Keywords: Game theory, Optimization, Intelligent systems
Abstract: In the classical communication setting, multiple senders having access to the same source of information and transmitting it over channel(s) to a receiver, in general, leads to a decrease in estimation error at the receiver. However, if the objectives of the information providers are different from that of the estimator, this might result in interesting strategic interactions. In this work, we consider a hierarchical signaling game between two senders (information designers) and a single receiver (decision maker) each having their own, possibly misaligned, objectives. The senders lead the game by committing to individual information disclosure policies simultaneously, within the framework of a Nash game among themselves. This is followed by the receiver's action decision. With Gaussian information structure and quadratic objectives (which depend on underlying state and receiver's action) for all the players, we show that in general the equilibrium is not unique. While we show that full revelation of the state is always an equilibrium, we propose an algorithm to achieve non trivial equilibria. Through simulations we show that misalignment between senders' objectives is beneficial for the receiver.
|
|
17:15-17:30, Paper WeC11.6 | Add to My Program |
Inverse Matrix Games with Unique Quantal Response Equilibrium |
|
Yu, Yue | The University of Texas at Austin |
Salfity, Jonathan | University of Texas at Austin |
Fridovich-Keil, David | The University of Texas at Austin |
Topcu, Ufuk | The University of Texas at Austin |
Keywords: Game theory, Optimization
Abstract: In an inverse game problem, one needs to infer the cost function of the players in a game such that a desired joint strategy is a Nash equilibrium. We study the inverse game problem for a class of multiplayer matrix games, where the cost perceived by each player is corrupted by random noise. We provide sufficient conditions for the players' quantal response equilibrium---a generalization of the Nash equilibrium to games with perception noise---to be unique. We develop efficient optimization algorithms for inferring the cost matrix based on semidefinite programs and bilevel optimization. We demonstrate the application of these methods in encouraging collision avoidance and fair resource allocation.
|
|
WeC12 Invited Session, Aqua Salon C |
Add to My Program |
Advanced Controls in Vehicle Electrification |
|
|
Chair: Borhan, Hoseinali | Cummins Inc |
Co-Chair: Chen, Pingen | Tennessee Technological University |
Organizer: Chen, Pingen | Tennessee Technological University |
Organizer: Drallmeier, Joseph | University of Michigan |
Organizer: Ghasemi, Amirhossein | University of North Carolina Charlotte |
Organizer: Borhan, Hoseinali | Cummins Inc |
|
16:00-16:15, Paper WeC12.1 | Add to My Program |
Optimal Dispatch & Routing of Electrified Heavy-Duty Truck Fleets: A Sensitivity Analysis with Fleet Data (I) |
|
Wang, Ruiting | University of California, Berkeley |
Zeng, Teng | University of California, Berkeley |
Keyantuo, Patrick | University of California, Berkeley |
Sandoval, Jairo | Cummins |
Vishwanath, Aashrith | Cummins Inc |
Borhan, Hoseinali | Cummins Inc |
Moura, Scott | University of California, Berkeley |
Keywords: Multivehicle systems, Optimization, Transportation networks
Abstract: Electrifying the trucking fleet has the potential to substantially reduce the carbon footprint of logistics. However, fleet electrification also poses significant operational challenges. This study provides an up-to-date, realistic case study on opti- mal dispatch and routing of a heterogeneous fleet of heavy-duty trucks with the goal to improve the economic and environmental benefits of electrification. A fleet management optimization model incorporating detailed energy consumption modeling was proposed, and applied to real-world fleet demand data for practical insights. The results from numerical experiments show the decision complexities of fleet management with the proposed optimization. We demonstrate that it is more cost- effective to electrify half of the fleet when the diesel (USD/gal) versus electricity (USD/kWh) price ratio is from 20 to 30. These findings provide a critical reference point for challenging the misconception that electrified fleets are always more expensive. Additionally, this analysis reveals how policymakers can use other pathways and tax policies to close the total cost of ownership gap between diesel and electric trucks to encourage the electrification of heavy-duty trucks.
|
|
16:15-16:30, Paper WeC12.2 | Add to My Program |
Rear Vehicle Tracking on a Smart E-Scooter (I) |
|
Alai, Hamidreza | University of Minnesota |
Jeon, Woongsun | University of Minnesota |
Alexander, Lee | University of Minnesota |
Rajamani, Rajesh | Univ. of Minnesota |
Keywords: Estimation, IVHS, Mechatronics
Abstract: This paper develops an active sensing system for protection of an e-scooter from car-scooter collisions. The objective is to track the trajectories of cars behind the e-scooter and predict any real-time danger to the e-scooter. If the danger of being hit by a car is predicted, then a loud horn-like audio warning is sounded to alert the car driver to the presence of the scooter. A low-cost single-beam laser sensor is chosen for measuring the positions of cars behind the scooter. The sensor is mounted on a stepper motor and the region behind the scooter is scanned to detect vehicles. Once a vehicle is detected, its trajectory is tracked in real-time by using feedback control to focus the orientation of the laser sensor in real-time so as to make measurements of the right front corner of the vehicle. A nonlinear vehicle model and a nonlinear observer are used to estimate the trajectory variables of the tracked car. The estimated states are used in a receding horizon controller that controls the real-time position of the laser sensor to focus on the vehicle. The developed system is implemented on a Ninebot e-scooter platform. Simulation results with multiple vehicle maneuvers show that the closed-loop system is able to accurately track trajectories of rear vehicles that can pose a danger to the e-scooter.
|
|
16:30-16:45, Paper WeC12.3 | Add to My Program |
Real Time Passenger Mass Estimation for E-Scooters (I) |
|
Leoni, Jessica | Politecnico Di Milano |
Strada, Silvia | Politecnico Di Milano |
Tanelli, Mara | Politecnico Di Milano |
Savaresi, Sergio M. | Politecnico Di Milano |
Keywords: Machine learning, Grey-box modeling, Pattern recognition and classification
Abstract: Dockless electric scooters have proven, nowadays, to be a competitive solution in the urban micro-mobility environment. However, recent studies underline how electric scooters’ sustainability is closely related to their lifetime and recharging frequency. Since these depend significantly on the onboard mass, this paper presents two approaches, i.e., gray-box and black-box, to estimate this parameter relying only on the measurements from GPS and IMU sensors installed on the e-scooter. The robustness and the accuracy of the mass estimation is tested on extensive datasets collected in ad-hoc experimental campaigns.
|
|
16:45-17:00, Paper WeC12.4 | Add to My Program |
Energy Management of Electric Truck Fleet Considering Cargo Load, Platooning, and Charging (I) |
|
Wu, Yun | Tennessee Technological University |
Chen, Pingen | Tennessee Technological University |
Keywords: Automotive control, Automotive systems
Abstract: Heavy-duty trucks are currently contributing to a significant portion of greenhouse gas emissions from the transportation sector. Electrification of heavy-duty vehicle fleet can potentially reduce the energy consumption for freight delivery due to high powertrain efficiency. However, the high cost of vehicles due to large battery size and range anxiety due to limited charging infrastructures are considered as the main barriers for adoption. The main objective of this paper is to design a comprehensive platoon management framework for an electric truck fleet which incorporates cargo allocation, platooning sequence change, and charging time coordination. Three planning problems are formulated in this paper for different phases of freight delivery by electric truck fleets and analytical solutions are offered for each scenario. Linear Programming was used as a backup method when analytical solutions are infeasible. Numerical results demonstrated that the proposed solutions can potentially reduce the energy consumption or charging time for an electric truck fleet.
|
|
17:00-17:15, Paper WeC12.5 | Add to My Program |
HEV Energy Management Strategy Based on TD3 with Prioritized Exploration and Experience Replay (I) |
|
He, Yu | University of Michigan Dearborn |
Kim, Youngki | University of Michigan - Dearborn |
Keywords: Optimal control, Automotive control, Automotive systems
Abstract: This paper presents a novel energy management strategy for hybrid electric vehicles (HEVs) that is based on an expert twin-delayed deep deterministic policy gradient with prioritized exploration and experience replay (TD3-PEER). State-of-the-art TD3 requires critic networks to generate predicted Q value for state-action pairs to update a policy network. However, the critic networks may struggle with predicting Q values for certain states when the Q values of these states are sensitive to action selection. To address this issue, this paper proposes a prioritized exploration technique that encourages the agent to visit action-sensitive states more frequently in the application of HEV energy management. The proposed algorithm is tested and validated on a P0+P4 HEV model. To simplify the control design, a motor activation threshold is introduced into the final layer of the agent's actor. In addition, dynamic programming results are incorporated into the training of the TD3, helping the agent avoid inefficient operations. Simulation results demonstrate that with expert knowledge considered for all learning-based methods, the proposed TD3-PEER outperforms other RL-based energy management strategies, including DDPG-PER and deep Q-network, by an average of 2.3% and 3.74% over the training and validation cycles, respectively.
|
|
17:15-17:30, Paper WeC12.6 | Add to My Program |
Study on the Benefits of Integrated Battery and Cabin Thermal Management in Cold Weather Conditions (I) |
|
Hajidavalloo, Mohammad | Michigan State University |
Chen, Jun | Oakland University |
Hu, Qiuhao | University of Michigan |
Li, Zhaojian | Michigan State University |
Keywords: Automotive systems, Energy systems, Predictive control for nonlinear systems
Abstract: To expand the global adoption of electric vehicles (EVs), improving their driving range is of utmost importance. One of the major obstacles along the way is the degraded EV performance in extremely cold or hot environments, where significant amount of energy is used for cabin and battery temperature regulation while the battery’s power and energy capacity are also impeded. To mitigate this issue, we present an integrated cabin and battery thermal management system to simultaneously optimize battery and cabin temperatures in real time. A new nonlinear model predictive control (NMPC)- based thermal management strategy is developed to achieve cabin temperature regulation and driving range maximization. The benefits of the proposed integrated thermal management (ITM) of battery and cabin are investigated for cold-temperature driving in various scenarios. Simulation results identify several important factors that affect the EV driving range, and we show that up to 7-13% range improvement, relative to the case where only cabin heating is considered, can be achieved using the proposed NMPC-based ITM strategy.
|
|
WeC13 Regular Session, Aqua Salon D |
Add to My Program |
Decentralized Control and Distributed Parameter Systems |
|
|
Chair: Diagne, Mamadou | University of California San Diego |
Co-Chair: Li, Na | Harvard University |
|
16:00-16:15, Paper WeC13.1 | Add to My Program |
Control Barrier Function Based Decentralized UAV Swarm Navigation While Preserving Connectivity without Explicit Communication |
|
Palani, Thiviyathinesvaran | Kyoto University of Advanced Science |
Fukushima, Hiroaki | Kyoto University of Advanced Science |
Izuhara, Shunsuke | Kyoto University of Advanced Science |
Keywords: Decentralized control, Cooperative control, Robotics
Abstract: This paper proposes a decentralized leader-follower navigation method for a group of unmanned aerial vehicles (UAVs) in environments with obstacles. The proposed method is based on control barrier functions (CBFs) and is designed to overcome the limitations of current approaches that use artificial potential functions (APFs), which may in some cases cause undesired vibratory movements. The algorithm presented in this study computes control inputs using only local sensor information and without explicit communication between the robots, with the aim of ensuring collision avoidance, obstacle avoidance, and connectivity preservation in the group. The control inputs are derived without relying on numerical solution techniques, and the proposed approach aims to prevent constraint violations from occurring before they happen. The performance of the proposed approach is compared to the APF-based approach in the context of vibratory movements, and simulation results demonstrate its ability to achieve smoother robot movements. Moreover, the effectiveness of the proposed approach is confirmed through experimental validation with quadrotors.
|
|
16:15-16:30, Paper WeC13.2 | Add to My Program |
On the Optimal Control of Network LQR with Spatially-Exponential Decaying Structure |
|
Zhang, Runyu | Harvard University |
Li, Weiyu | Harvard University |
Li, Na | Harvard University |
Keywords: Decentralized control, Networked control systems, Linear systems
Abstract: This paper studies network LQR problems with system matrices being spatially-exponential decaying (SED) between nodes in the network. The major objective is to study whether the optimal controller also enjoys a SED structure, which is an appealing property for ensuring the optimality of decentralized control over the network. We start with studying the open-loop asymptotically stable system and show that the optimal LQR state feedback gain K is `quasi'-SED in this setting, i.e. |[K]_{ij}|sim Oleft(e^{-frac{beta}{textup{poly}ln(N)}dist(i,j)}rig ht). The decaying rate beta depends on the decaying rate and norms of system matrices and the open-loop exponential stability constants. Then the result is further generalized to unstable systems under a stabilizability assumption. Building upon the `quasi'-SED result on K, we give an upper-bound on the performance of kappa-truncated local controllers, suggesting that distributed controllers can achieve near-optimal performance for SED systems. We develop these results via studying the structure of another type of controller, disturbance response control, which has been studied and used in recent online control literature; thus as a side result, we also prove the `quasi'-SED property of the optimal disturbance response control, which serves as a contribution on its own merit.
|
|
16:30-16:45, Paper WeC13.3 | Add to My Program |
Utilizing Feedback Channel Mechanisms for Reaching Average Consensus Over Directed Network Topologies |
|
Makridis, Evagoras | University of Cyprus |
Charalambous, Themistoklis | University of Cyprus |
Hadjicostis, Christoforos N. | University of Cyprus |
Keywords: Decentralized control, Agents-based systems, Delay systems
Abstract: In this paper, we address the problem of discrete-time average consensus, where agents (nodes) exchange information over unreliable communication links. We enhance the Robustified Ratio Consensus algorithm by embedding the Automatic Repeat ReQuest (ARQ) protocol used for error control of data transmissions, in order to allow the agents to reach asymptotic average consensus while handling time-varying delays induced by retransmissions of erroneous packets, and possible packet drops that occur due to excess of a predefined packet retransmission limit imposed by the ARQ protocol. Invoking the ARQ protocol allows nodes to: (a) exploit the incoming error-free acknowledgement feedback signals to initially acquire or later update their out-degree, (b) know whether a packet has arrived or not, and (c) determine a local upper-bound on the delays which is imposed by the retransmission limit. The analysis of our proposed algorithm, herein called the ARQ-based Ratio Consensus algorithm, relies on augmenting the network's corresponding weighted adjacency matrix, to handle time-varying (yet bounded) delays and possible packet drops. To the best of the authors' knowledge, this is the first consensus algorithm that incorporates a communication protocol for error control used in real communication systems with feedback.
|
|
16:45-17:00, Paper WeC13.4 | Add to My Program |
Representation of Linear PDEs with Spatial Integral Terms As Partial Integral Equations |
|
Shivakumar, Sachin | Arizona State University |
Das, Amritam | University of Cambridge |
Peet, Matthew M. | Arizona State University |
Keywords: Distributed parameter systems, Linear systems, Computational methods
Abstract: In this paper, we present the Partial Integral Equation (PIE) representation of linear Partial Differential Equations (PDEs) in one spatial dimension, where the PDE has spatial integral terms appearing in the dynamics and the boundary conditions. The PIE representation is obtained by performing a change of variable where every PDE state is replaced by its highest, well-defined derivative using the Fundamental Theorem of Calculus to obtain a new equation (a PIE). We show that this conversion from PDE representation to PIE representation can be written in terms of explicit maps from the PDE parameters to PIE parameters. Lastly, we present numerical examples to demonstrate the application of the PIE representation by performing stability analysis of PDEs via convex optimization methods.
|
|
17:00-17:15, Paper WeC13.5 | Add to My Program |
Event-Triggered Safe Stabilizing Boundary Control for the Stefan PDE System with Actuator Dynamics |
|
Koga, Shumon | University of California, San Diego |
Demir, Cenk | University of California, San Diego |
Krstic, Miroslav | University of California, San Diego |
Keywords: Distributed parameter systems, Constrained control, Sampled-data control
Abstract: This paper proposes an event-triggered boundary control for the safe stabilization of the Stefan PDE system with actuator dynamics. The control law is designed by applying Zero-Order Hold (ZOH) to the continuous-time safe stabilizing controller developed in our previous work. The event-triggering mechanism is then derived so that the imposed safety conditions associated with high order Control Barrier Function (CBF) are maintained and the stability of the closed-loop system is ensured. We prove that under the proposed event-triggering mechanism, the so-called ``Zeno" behavior is always avoided, by showing the existence of the minimum dwell-time between two triggering times. The stability of the closed-loop system is proven by employing PDE backstepping method and Lyapunov analysis. The efficacy of the proposed method is demonstrated in numerical simulation.
|
|
17:15-17:30, Paper WeC13.6 | Add to My Program |
Periodic Event-Triggered Boundary Control of a Class of Reaction-Diffusion PDEs |
|
Rathnayake, Bhathiya | Student (University of California San Diego) |
Diagne, Mamadou | University of California San Diego |
Keywords: Distributed parameter systems, Stability of hybrid systems, Sampled-data control
Abstract: This paper provides a novel periodic event-triggered boundary control (PETBC) strategy for a class of reaction-diffusion PDEs with Robin actuation using infinite-dimensional backstepping boundary control. We propose a method for converting a certain class of continuous-time dynamic event-triggers that require continuous monitoring to periodic event-triggers that require only periodic evaluation. We achieve this by finding an upper bound on the underlying continuous-time event-trigger between two successive periodic evaluations. We provide an explicit criterion for choosing a sampling period for periodically evaluating the event-trigger. The control input is updated only at events indicated by the periodic event-trigger and is applied in a Zero-Order-Hold fashion between two events. We prove that the closed-loop system well-posedness and global L^2-exponential convergence to zero under continuous-time event-triggered boundary control (CETBC) are preserved under the proposed PETBC. We provide simulation results to validate the theoretical claims.
|
|
WeC14 Regular Session, Aqua 311A |
Add to My Program |
Predictive Control II |
|
|
Chair: Coogan, Samuel | Georgia Institute of Technology |
Co-Chair: Kwon, Joseph | Texas A&M University |
|
16:00-16:15, Paper WeC14.1 | Add to My Program |
Nonlinear MPC for Thermal Balancing of the TCP-100 Parabolic Trough Collectors Solar Plant |
|
Gallego, Antonio J. | University of Seville |
Yebra, Luis José | CIEMAT-Plataforma Solar De Almería |
Sánchez del Pozo, Adolfo J. | University of Seville |
Escano, Juan Manuel | University of Seville |
Camacho, Eduardo F. | Univ. of Sevilla |
Keywords: Modeling, Predictive control for nonlinear systems, Energy systems
Abstract: The efficiency of the solar plants is conditioned by the control strategies applied in their operation. In this paper, an application of a Model Predictive Controller based on nonlinear models of the TCP-100 parabolic trough collector solar plant is presented as one example of the advanced control techniques that can contribute to enhance the efficiency of this type of plants. Both types of nonlinear models of the TCP-100 facility are applied for this application: lumped and distributed parameters ones. The objective of the proposed control strategy is to face a problem that arises in current commercial solar trough plants, with hundreds of loops, where in practice each of those loops get a different outlet temperature of the heat transfer fluid. These temperature differences might cause inefficiency in the operation and/or irreversible damages by overheating, if not properly controlled. The presented control strategy computes the set-points of the control valves of each of the loops to achieve a good thermal balance of the solar plant. The proposed strategy implements also a heuristic based algorithm when strong transients are affecting the field. The simulation results show that the application of the proposed control technique balances the outlet temperatures of the loops, protecting the TCP-100 facility from damages and increasing its efficiency in the operation.
|
|
16:15-16:30, Paper WeC14.2 | Add to My Program |
Model Predictive Control of the High-Pressure Side of Simple Supercritical CO2 Cycle |
|
Bone, Viv | The University of Melbourne |
Kearney, Michael | University of Queensland |
Jahn, Ingo | The University of Queensland |
Keywords: Fluid power control, Predictive control for linear systems
Abstract: Control of non-condensing non-ideal-gas power cycles is challenging because (1) their output power dynamics depend on complex system interactions, (2) turbomachinery cannot be modelled by simple analytical relations, and (3) state constraints must be respected. This article presents a control methodology for these systems, comprising a control modelling approach and model predictive control (MPC) strategy. We demonstrate this methodology on the high-pressure side of a simple supercritical CO_2 cycle power block. We develop a novel control model by using timescale-separation arguments, then implement MPC by linearizing this control model online at each sampling instant. Closed-loop simulations with a full-order gas-dynamics truth model demonstrate the good dynamic performance and constraint management of this approach. This article demonstrates the suitability of MPC for the supercritical CO2 cycle, and provides a pathway to implementing MPC for more complex cycle variants such as the recuperated and recompression cycle.
|
|
16:30-16:45, Paper WeC14.3 | Add to My Program |
Control Lyapunov-Barrier Function-Based Predictive Control Using a Deep Hybrid Model with Guarantees on Domain of Applicability |
|
Bangi, Mohammed Saad Faizan | Texas A&M University |
Kwon, Joseph | Texas A&M University |
Keywords: Grey-box modeling, Machine learning, Process Control
Abstract: The domain of applicability (DA) of a data-driven model is limited by its training data. Consequently, the DA of a hybrid model which combines a first-principles model with a data-driven model is also limited by its training data even though it has better extrapolation capabilities compared to a data-based model. Nonetheless, the domain of applicability (DA) of a hybrid model is finite and should be taken into account when developing a hybrid model-based predictive controller in order to maximize its performance. To this end, a Control Lyapunov-Barrier Function-based model predictive controller (CLBF-based MPC) is developed which utilizes a deep hybrid model (DHM), i.e., a deep neural network (DNN) combined with a first-principles model. Additionally, theoretical guarantees are provided on stability as well as on system states to stay within the DA of the DHM. The efficacy of the proposed framework is demonstrated on a continuous stirred tank reactor.
|
|
16:45-17:00, Paper WeC14.4 | Add to My Program |
A Vessel Propulsion Controller Based on Economic Model Predictive Control |
|
Papadimitrakis, Myron | University of West Attica |
Alexandridis, Alex | University of West Attica |
Keywords: Maritime control, Optimal control, Predictive control for nonlinear systems
Abstract: The importance of reducing energy expenditure in vessel propulsion is underlined by recent environmental mandates in the maritime sector. Vessel propulsion is a multi-objective problem, since the overall energy expenditure of the powertrain must be minimized, while the vessel speed must be maximized. This paper proposes an economic model predictive control (EMPC) approach, which can accommodate powertrain efficiency maps and thus evaluate candidate input trajectories in terms of energy efficiency. The proposed EMPC controller utilizes recent theoretical developments in order to guarantee stability. Simulation results are presented in comparison to a standard MPC scheme, for two different vessel sizes under environmental disturbances, and are evaluated in terms of the overall energy expenditure and the settling time to the desired vessel speed. It is demonstrated that the proposed approach achieves a reduction in energy consumption of up to 1.9% in a rough sea scenario.
|
|
17:00-17:15, Paper WeC14.5 | Add to My Program |
Safe Learning-Based Predictive Control from Efficient Reachability |
|
Cao, Michael Enqi | Georgia Institute of Technology |
Coogan, Samuel | Georgia Institute of Technology |
Keywords: Predictive control for nonlinear systems, Uncertain systems, Nonlinear systems identification
Abstract: We consider a dynamical system subject to a disturbance input that is an unknown function of the state. Given a target goal region, we propose a control scheme that encourages exploration of the state space in order to sample the dynamics and obtain an estimate of the unknown component while avoiding unsafe regions of the state space until the goal is able to be reached with high probability. By estimating the unknown component as a Gaussian process, we efficiently obtain hyperrectangular overapproximations of the reachable set for the system using the theory of mixed monotone systems, and these sets are improved over time as measurements of the dynamics are collected. Using these reachability estimates, we propose a model predictive scheme that avoids the unsafe region and ensures the system is always within reach of a conservative, guaranteed safe region that is given a priori, thus always ensuring feasibility until the goal is reachable. We demonstrate the approach on a model of an autonomous vehicle operating on an icy road and on a planar multirotor moving in an unknown wind field.
|
|
17:15-17:30, Paper WeC14.6 | Add to My Program |
Stochastic MPC with Realization-Adaptive Constraint Tightening |
|
Lee, Hotae | UC Berkeley |
Bujarbaruah, Monimoy | UC Berkeley |
Borrelli, Francesco | Unversity of California at Berkeley |
Keywords: Predictive control for linear systems, Stochastic systems, Uncertain systems
Abstract: This paper presents a stochastic model predictive controller (SMPC) for linear time-invariant systems in the presence of additive disturbances. The distribution of the disturbance is unknown and is assumed to have a bounded support. A sample-based strategy is used to compute sets of disturbance sequences necessary for robustifying the state chance constraints. These sets are constructed emph{offline} using samples of the disturbance extracted from its support. For emph{online} MPC implementation, we propose a novel reformulation strategy of the chance constraints, where the constraint tightening is computed by adjusting the offline computed sets based on the previously realized disturbances along the trajectory. The proposed MPC is recursive feasible and can lower conservatism over existing SMPC approaches at the cost of higher offline computational time. Numerical simulations demonstrate the effectiveness of the proposed approach.
|
|
WeC15 Regular Session, Aqua 311B |
Add to My Program |
Energy Systems II |
|
|
Chair: Fathy, Hosam K. | University of Maryland |
Co-Chair: Abdelghany, Muhammad Bakr | University of Sannio |
|
16:00-16:15, Paper WeC15.1 | Add to My Program |
On the Feasibility of Electrode Concentration Distribution Estimation in Single-Particle Lithium-Ion Battery Models |
|
Nozarijouybari, Zahra | University of Maryland |
Allam, Anirudh | Stanford University |
Onori, Simona | Stanford Univeristy |
Fathy, Hosam K. | University of Maryland |
Keywords: Energy systems, Estimation, Observers for nonlinear systems
Abstract: This paper analyzes the observability of a nonlinear single particle model (SPM) of a lithium-ion battery. SPMs offer an attractive middle ground between the simplicity of equivalent circuit models (ECMs) and the fidelity of higher-order electrochemical models. This can enable individual electrode concentration estimation via algorithms such as interconnected observers. Limitations exist on the feasibility and accuracy of such estimation and can be examined using metrics such as nonlinear observability. The paper presents a set of conditions under which one can estimate the spatial distribution of concentrations in a nonlinear SPM. Specifically, we examine a nonlinear SPM where terminal voltage and its resistive component are measured independently. Butler-Volmer polarization potentials are assumed to be concentration-dependent. Under these assumptions, we show that electrode spatial concentration distributions can be estimated if the Jacobian of the voltage measurements with respect to surface concentrations is full rank. This condition applies for an arbitrary finite difference discretization of solid-phase diffusion dynamics. The paper demonstrates this insight numerically, for a model of a nickel-manganese-cobalt (NMC) battery.
|
|
16:15-16:30, Paper WeC15.2 | Add to My Program |
A Novel Scheme for Randomized Customer-Sensitivity-Based Control of Thermostatic Loads for Better Integration of Intermittent Renewables |
|
Julien, Tanguy | Polytechnique Montréal |
Malhame, Roland P. | Ecole Poly. De Montreal |
Keywords: Energy systems, Markov processes, Decentralized control
Abstract: Increased electric grid penetration of intermittent renewable energy sources has reduced the controllability of the generation side, and created a need for more coordination between generation and load to maintain grid stability. Thermostatically controlled loads (TCLs) have long been seen as capable of providing a source of load flexibility. However, controlling thousands of small loads to create a better match between generation and consumption is a challenging task. Direct load control methods tend to be imprecise, invasive, and somewhat coercive, while pricing-based methods can result in social push-back and produce unreliable results. We propose and analyze a probabilistic control scheme based on a novel type of aggregator-customer contracts. The latter are tailored a priori so as to account for a customer's particular tolerance to loss of comfort versus interest in cost reduction. While through these contracts, aggregators have to obey pre-agreed constraints on their controls, the up-side for them is that they can reliably anticipate the aggregate behaviors that their pool of loads can achieve. The control is decentralized via a single so-called pressure signal which is broadcasted and acts locally, in a probabilistic manner, on thermostat set-points. We demonstrate how the probabilistic nature of the control allows achieving a continuum of smooth potentially desirable aggregate load behaviors.
|
|
16:30-16:45, Paper WeC15.3 | Add to My Program |
Reduced-Order Model of Lithium-Iron Phosphate Battery Dynamics: A POD-Galerkin Approach |
|
Fasolato, Simone | Pavia |
Allam, Anirudh | Stanford University |
Li, Xueyan | LG Energy Solutions |
Lee, Donghoon | LG Energy Solutions |
Ko, Johan | LG Energy Solutions |
Onori, Simona | Stanford Univeristy |
Keywords: Energy systems, Model/Controller reduction, Modeling
Abstract: A lithium iron phosphate battery is characterized by a plateau in its open circuit voltage, hysteresis, and path dependence due to phase transition during intercalation/deintercalation. The core-shell electrochemical modeling technique is an accurate tool to capture this phase transition behavior. However, the model requires fine-grained spatial grids to transform the governing Partial Differential Algebraic Equations into Ordinary Differential Algebraic Equations to accurately capture the battery dynamics, which results in a computationally expensive system intractable for design of realtime battery management system algorithms. To that end, this paper presents a reduced-order modeling paradigm to transform the high-dimensional model into a low-dimensional yet accurate core-shell electrochemical model. The Proper Orthogonal Decomposition-Galerkin method is used to reduce the state variables from 169 to a meagre 9 with negligible loss in fidelity. The reduced-order model’s accuracy is validated against both experimental data and high-dimensional model for discharging and charging load profiles of different C-rates. Promising results with one-third the computational burden and a voltage RMS error of less than 0.6% are achieved.
|
|
16:45-17:00, Paper WeC15.4 | Add to My Program |
A Coordinated Model Predictive Control of Grid-Connected Energy Storage Systems |
|
Abdelghany, Muhammad Bakr | University of Sannio |
Al-Durra, Ahmed | Khalifa University |
Keywords: Predictive control for nonlinear systems, Energy systems, Optimization
Abstract: The environmental goals set out in the 2015 Paris Agreement on climate change lead to the design and the definition of energy management strategies based on renewable energy sources (RESs). In this regard, the integration of energy storage systems (ESSs) into the microgrid requires the development of sophisticated control systems for their management and the reduction of their degradation. Moreover, external agents, e.g., battery/fuel cell electric vehicles, exchange energy with microgrids using electric and hydrogen markets. This paper presents a novel energy management strategy to control a microgrid which includes RESs paired with a battery-ESS and a hydrogen-ESS, and consumer loads. The strategy, based on the model predictive control (MPC) framework, takes into account ESSs' economical and operating costs, degradation issues, and physical and dynamical system constraints. Numerical simulations show the effectiveness of the strategy, which successfully manages the plant by fulfilling constraints and energy requests while reducing device costs and increasing battery life.
|
|
17:00-17:15, Paper WeC15.5 | Add to My Program |
An Online Feedback Optimization Approach to Voltage Regulation in Inverter-Based Power Distribution Networks |
|
Dominguez-Garcia, Alejandro D. | University of Illinois at Urbana-Champaign |
Zholbaryssov, Madi | Typhoon HIL |
Amuda, Temitope | University of Illinois at Urbana Champaign |
Ajala, Olaoluwapo | University of Illinois at Urbana-Champaign |
Keywords: Power systems, Smart grid, Energy systems
Abstract: We address the problem of controlling the reactive power setpoints of a set of distributed energy resources (DERs) in a power distribution network so as to mitigate the impact of variability in uncontrolled power injections associated with, e.g., renewable-based generation. We formulate the control design problem as a stochastic optimization problem, which we solve online using a modified version of a projected stochastic gradient descent (PSGD) algorithm. The proposed PSGD-based algorithm utilizes sensitivities of changes in bus voltage magnitudes to changes in DER reactive power setpoints; such sensitivities are learned online via a recursive least squares estimator (rLSE). To ensure proper operation of the rLSE, the sequence of incremental changes in DER reactive power setpoints needs to be persistently exciting, which is guaranteed by a mechanism built into the controller. We analyze the stability of the closed-loop system and showcase controller performance via numerical simulations on the IEEE 123-bus distribution test feeder.
|
|
17:15-17:30, Paper WeC15.6 | Add to My Program |
Robust Decentralized Secondary Control Scheme for Inverter-Based Power Networks |
|
Bhela, Siddharth | Siemens Technology |
Banerjee, Abhishek | Siemens Technology |
Münz, Ulrich | Siemens AG |
Bamberger, Joachim | Siemens AG |
Keywords: Power systems, Power electronics, Smart grid
Abstract: Inverter-dominated microgrids are quickly becoming a key building block of future power systems. They rely on centralized controllers that can provide reliability and resiliency in extreme events. Nonetheless, communication failures due to cyber-physical attacks or natural disasters can make autonomous operation of islanded microgrids challenging. This paper examines a unified decentralized secondary control scheme that is robust to inverter clock synchronization errors and can be seamlessly applied to grid-following or grid-forming control architectures. The proposed scheme overcomes the well-known stability problem that arises from parallel operation of local integral controllers. Theoretical guarantees for stability are provided along with criteria to appropriately tune the secondary control gains to achieve good frequency regulation performance while ensuring fair power sharing. The efficacy of our approach is demonstrated through simulations on a 5-bus microgrid with four grid-forming inverters.
|
|
WeC16 Regular Session, Aqua 313 |
Add to My Program |
Machine Learning II |
|
|
Chair: Xu, Xiangru | University of Wisconsin-Madison |
Co-Chair: Tron, Roberto | Boston University |
|
16:00-16:15, Paper WeC16.1 | Add to My Program |
Tailored Output Layers of Neural Networks for Satisfaction of State Constraints in Nonlinear Control Systems |
|
Markolf, Lukas | University of Kassel |
Stursberg, Olaf | University of Kassel |
Keywords: Neural networks, Constrained control, Learning
Abstract: This work considers the synthesis of state feedback controllers established as deep artificial feed-forward neural networks for the control of discrete-time nonlinear but input-affine systems. The idea is to design output layers of particular structure to guarantee the satisfaction of state constraints in form of control-invariant ellipsoids. Since an analytical expression can be derived for the resulting neural network controller, the latter can be stored and evaluated efficiently. Moreover, the proposed output layer guarantees the satisfaction of the considered state constraints for each specification of the parameter vector. Numerical examples are provided for illustration and evaluation of the approach, in which the approximation of a nonlinear model predictive control law is considered as application.
|
|
16:15-16:30, Paper WeC16.2 | Add to My Program |
Application of eXplainable AI and Causal Inference Methods to Estimation Algorithms in Networks of Dynamic Systems |
|
Biparva, Darya | University of Minnesota |
Materassi, Donatello | University of Minnesota |
Keywords: Neural networks, Identification, Estimation
Abstract: While continuous progress in the area of machine learning is producing algorithms capable of achieving better and better decision and predictive performance, the way such algorithms operate is also becoming more and more inscrutable. When an increasing amount of decisions is being ceded to often inexplicable algorithms which are not susceptible to any form of human supervision or scrutiny, it is just natural to start raising doubts about their fairness, soundness, and reliability. This has motivated a growing need for tools capable of disentangling and explaining the mechanisms behind AI based decisions, creating a new field of research referred to as eXplainable AI (XAI). Given the significant impact that machine learning is having also on the area of estimation and control, this article advances the idea of borrowing and adapting methodologies from the area of XAI and apply them to estimation and control algorithms involving networks of dynamic processes. Specifically, we translate the methodology known as Local Interpretable Model-Agnostic Explanations (LIME) in order to explain the mechanisms behind a black-box estimation algorithm processing time-series. Furthermore, we find that LIME can be extended using notions of causal inference to detect cause-effect relations among the input features that the estimation algorithm takes as inputs. This causal inference procedure provides LIME with additional explanatory power.
|
|
16:30-16:45, Paper WeC16.3 | Add to My Program |
Safety Certification for Stochastic Systems Via Neural Barrier Functions |
|
Mathiesen, Frederik Baymler | Delft University of Technology |
Calvert, Simeon Craig | Delft University of Technology |
Laurenti, Luca | TU Delft |
|