Improved Exploration for Safety-Embedded Differential Dynamic Programming Using Tolerant Barrier States
Joshua E. Kuperman, Hassan Almubarak, Augustinos D. Saravanos, Evangelos A. Theodorou
TL;DR
The paper tackles safe trajectory optimization under state constraints by marrying barrier-based safety embedding with tolerant exploration. It introduces Tolerant DBaS (T-DBaS), a barrier construction $ ilde{B}$ that combines sigmoid and softplus terms to allow temporary constraint violations, while preserving informative gradients inside unsafe regions. Embedded into Differential Dynamic Programming (DDP) as T-DBaS-DDP, the approach retains convergence properties and shows improved exploration in non-convex environments, validated on a differential-drive robot, a quadrotor, and multi-robot hardware experiments, with competitive performance versus Augmented-Lagrangian DDP. The results indicate that T-DBaS achieves faster convergence and safer goal-reaching in challenging obstacle configurations, suggesting strong potential for online MPC and learning-based tuning of barrier parameters in uncertainty-rich settings.
Abstract
In this paper, we introduce Tolerant Discrete Barrier States (T-DBaS), a novel safety-embedding technique for trajectory optimization with enhanced exploratory capabilities. The proposed approach generalizes the standard discrete barrier state (DBaS) method by accommodating temporary constraint violation during the optimization process while still approximating its safety guarantees. Consequently, the proposed approach eliminates the DBaS's safe nominal trajectories assumption, while enhancing its exploration effectiveness for escaping local minima. Towards applying T-DBaS to safety-critical autonomous robotics, we combine it with Differential Dynamic Programming (DDP), leading to the proposed safe trajectory optimization method T-DBaS-DDP, which inherits the convergence and scalability properties of the solver. The effectiveness of the T-DBaS algorithm is verified on differential drive robot and quadrotor simulations. In addition, we compare against the classical DBaS-DDP as well as Augmented-Lagrangian DDP (AL-DDP) in extensive numerical comparisons that demonstrate the proposed method's competitive advantages. Finally, the applicability of the proposed approach is verified through hardware experiments on the Georgia Tech Robotarium platform.
