Exact Imposition of Safety Boundary Conditions in Neural Reachable Tubes
Aditya Singh, Zeyuan Feng, Somil Bansal
TL;DR
This work tackles safety verification for high-dimensional autonomous systems by improving Hamilton-Jacobi reachability with ExactBC, a method that exactly enforces boundary conditions in neural approximations of the safety value function. By reparameterizing $V_\theta(x,t)$ as $l(x) + (T-t) O_\theta(x,t)$, ExactBC guarantees $V_\theta(x,T)=l(x)$ and trains using a single PDE loss, removing the need for boundary loss weighting. Across four challenging tasks with varying dynamics and boundary complexity, ExactBC consistently yields more accurate value functions and larger verified safe volumes than prior DeepReach or residual-learning variants, including difficult high-dimensional scenarios like autonomous rocket landing and multi-aircraft avoidance. While offering significant accuracy gains, the method remains limited by non-differentiable boundaries and does not reduce computational time, pointing to directions for handling non-differentiability and improving efficiency in future work.
Abstract
Hamilton-Jacobi (HJ) reachability analysis is a widely adopted verification tool to provide safety and performance guarantees for autonomous systems. However, it involves solving a partial differential equation (PDE) to compute a safety value function, whose computational and memory complexity scales exponentially with the state dimension, making its direct application to large-scale systems intractable. To overcome these challenges, DeepReach, a recently proposed learning-based approach, approximates high-dimensional reachable tubes using neural networks (NNs). While shown to be effective, the accuracy of the learned solution decreases with system complexity. One of the reasons for this degradation is a soft imposition of safety constraints during the learning process, which corresponds to the boundary conditions of the PDE, resulting in inaccurate value functions. In this work, we propose ExactBC, a variant of DeepReach that imposes safety constraints exactly during the learning process by restructuring the overall value function as a weighted sum of the boundary condition and the NN output. Moreover, the proposed variant no longer needs a boundary loss term during the training process, thus eliminating the need to balance different loss terms. We demonstrate the efficacy of the proposed approach in significantly improving the accuracy of the learned value function for four challenging reachability tasks: a rimless wheel system with state resets, collision avoidance in a cluttered environment, autonomous rocket landing, and multi-aircraft collision avoidance.
