Guaranteed Reach-Avoid for Black-Box Systems through Narrow Gaps via Neural Network Reachability
Long Kiu Chung, Wonsuhk Jung, Srivatsank Pullabhotla, Parth Shinde, Yadu Sunil, Saihari Kota, Luis Felipe Wolf Batista, Cédric Pradalier, Shreyas Kousik
TL;DR
NeuralPARC addresses the problem of providing reach-avoid guarantees for black-box mobile robots in narrow-gap scenarios. It learns a trajectory model with ReLU neural networks from data and uses Reachable Polyhedral Marching ($\text{RPM}$) to certify all affine dynamics implied by the network, while accounting for modeling error via explicit bounds. Online, NeuralPARC computes aBackward Reachable Set ($\text{BRAS}$) and aBackward Avoid Set ($\text{BAS}$) to guarantee that the system reaches a goal while avoiding obstacles, even under disturbances and with deep RL controllers. Hardware and simulation experiments on extreme drift parking and a disturbed ASV demonstrate improved safety guarantees and practical viability, validating the approach against the prior PARC method.
Abstract
In the classical reach-avoid problem, autonomous mobile robots are tasked to reach a goal while avoiding obstacles. However, it is difficult to provide guarantees on the robot's performance when the obstacles form a narrow gap and the robot is a black-box (i.e. the dynamics are not known analytically, but interacting with the system is cheap). To address this challenge, this paper presents NeuralPARC. The method extends the authors' prior Piecewise Affine Reach-avoid Computation (PARC) method to systems modeled by rectified linear unit (ReLU) neural networks, which are trained to represent parameterized trajectory data demonstrated by the robot. NeuralPARC computes the reachable set of the network while accounting for modeling error, and returns a set of states and parameters with which the black-box system is guaranteed to reach the goal and avoid obstacles. NeuralPARC is shown to outperform PARC, generating provably-safe extreme vehicle drift parking maneuvers in simulations and in real life on a model car, as well as enabling safety on an autonomous surface vehicle (ASV) subjected to large disturbances and controlled by a deep reinforcement learning (RL) policy.
