Solving Optimal Control Problems of Rigid-Body Dynamics with Collisions Using the Hybrid Minimum Principle
Wei Hu, Jihao Long, Yaohua Zang, Weinan E, Jiequn Han
TL;DR
This work tackles open-loop optimal control for rigid-body systems with autonomous state jumps caused by collisions, modeling these jumps through a collision detector $\psi(m{x})$ and a jump map $m{g}$. It extends Pontryagin’s maximum principle to the hybrid setting (HMP) and couples it with a relaxed method of successive approximations (MSA) to compute controls, using forward-Euler integration and careful handling of discontinuities such as collision-time estimation and jump propagation. The authors demonstrate linear convergence with respect to iteration steps and first-order accuracy in time discretization on disc-collision problems, and show that their HMP-based method outperforms direct gradient methods and deep reinforcement learning in accuracy and efficiency. The approach is robust to multiple collisions, supports nonquadratic running costs, and provides analytic benchmarks for benchmarking and future extensions in more complex hybrid robotic systems.
Abstract
Collisions are common in many dynamical systems with real applications. They can be formulated as hybrid dynamical systems with discontinuities automatically triggered when states transverse certain manifolds. We present an algorithm for the optimal control problem of such hybrid dynamical systems based on solving the equations derived from the hybrid minimum principle (HMP). The algorithm is an iterative scheme following the spirit of the method of successive approximations (MSA), and it is robust to undesired collisions observed in the initial guesses. We propose several techniques to address the additional numerical challenges introduced by the presence of discontinuities. The algorithm is tested on disc collision problems whose optimal solutions exhibit one or multiple collisions. Linear convergence in terms of iteration steps and asymptotic first-order accuracy in terms of time discretization are observed when the algorithm is implemented with the forward-Euler scheme. The numerical results demonstrate that the proposed algorithm has better accuracy and convergence than direct methods based on gradient descent. Furthermore, the algorithm is also simpler, more accurate, and more stable than a deep reinforcement learning method.
