Linear Supervision for Nonlinear, High-Dimensional Neural Control and Differential Games
William Sharpless, Zeyuan Feng, Somil Bansal, Sylvia Herbert
TL;DR
This work tackles the curse of dimensionality in differential games by marrying linear $HJ$-$PDE$ solutions with deep learning to learn high-dimensional value functions $V$. It introduces two semi-supervision programs: decayed linear semi-supervision (LSS-D) and an augmented nonlinear spectrum with a parameterized $V_\lambda$, both designed to leverage a linear supervisor $V_\ell$ during training. The Hopf-based linear solution provides fast, global supervision, while the augmented approach enables a smooth interpolation to the nonlinear true solution, yielding improvements in speed and accuracy on a 50-D differential game and a 10-D quadrotor collision-avoidance task. The results show substantial gains in IOU and MSE for the high-dimensional benchmark and improved safety-volume metrics for the quadrotor scenario, highlighting a practical pathway to scalable, safer autonomous decision-making in complex multi-agent settings.
Abstract
As the dimension of a system increases, traditional methods for control and differential games rapidly become intractable, making the design of safe autonomous agents challenging in complex or team settings. Deep-learning approaches avoid discretization and yield numerous successes in robotics and autonomy, but at a higher dimensional limit, accuracy falls as sampling becomes less efficient. We propose using rapidly generated linear solutions to the partial differential equation (PDE) arising in the problem to accelerate and improve learned value functions for guidance in high-dimensional, nonlinear problems. We define two programs that combine supervision of the linear solution with a standard PDE loss. We demonstrate that these programs offer improvements in speed and accuracy in both a 50-D differential game problem and a 10-D quadrotor control problem.
