MAGICS: Adversarial RL with Minimax Actors Guided by Implicit Critic Stackelberg for Convergent Neural Synthesis of Robot Safety
Justin Wang, Haimin Hu, Duy Phuong Nguyen, Jaime Fernández Fisac
TL;DR
The paper addresses the lack of convergence guarantees in neural safety synthesis for high‑dimensional robots by introducing MAGICS, a three‑player Stackelberg–minimax RL algorithm with an implicit critic. By casting training as a Stackelberg game and applying a discounted Isaacs formulation for reach–avoid safety, the authors prove local convergence of the learning dynamics to a differential Stackelberg equilibrium and extend these results to -Safety for high‑dimensional systems. Empirically, MAGICS outperforms state‑of‑the‑art neural safety methods in OpenAI Gym tasks and a 36‑D quadruped hardware experiment, demonstrating robust safety under adversarial disturbances. This work provides a scalable, provably convergent framework for neural safety synthesis with practical impact on safe, robust robotic control.
Abstract
While robust optimal control theory provides a rigorous framework to compute robot control policies that are provably safe, it struggles to scale to high-dimensional problems, leading to increased use of deep learning for tractable synthesis of robot safety. Unfortunately, existing neural safety synthesis methods often lack convergence guarantees and solution interpretability. In this paper, we present Minimax Actors Guided by Implicit Critic Stackelberg (MAGICS), a novel adversarial reinforcement learning (RL) algorithm that guarantees local convergence to a minimax equilibrium solution. We then build on this approach to provide local convergence guarantees for a general deep RL-based robot safety synthesis algorithm. Through both simulation studies on OpenAI Gym environments and hardware experiments with a 36-dimensional quadruped robot, we show that MAGICS can yield robust control policies outperforming the state-of-the-art neural safety synthesis methods.
