Discovering Closed-Loop Failures of Vision-Based Controllers via Reachability Analysis
Kaustav Chakraborty, Somil Bansal
TL;DR
The paper tackles safety verification for vision-based controllers in robotics by blending simulation-based sampling with Hamilton-Jacobi reachability to compute an approximate Backward Reachable Tube (BRT). This hybrid approach enables tractable identification of unsafe visual inputs without requiring an explicit analytic sensor model, addressing high-dimensional perception challenges. It contributes a general framework and demonstrates its effectiveness on two case studies—TaxiNet (3D) and indoor navigation (5D)—revealing concrete failure modes such as runway-marking confusion and obstacle occlusions. The work advances systematic closed-loop safety analysis for perception-driven controllers and suggests practical pathways to improve robustness via targeted data collection and training.
Abstract
Machine learning driven image-based controllers allow robotic systems to take intelligent actions based on the visual feedback from their environment. Understanding when these controllers might lead to system safety violations is important for their integration in safety-critical applications and engineering corrective safety measures for the system. Existing methods leverage simulation-based testing (or falsification) to find the failures of vision-based controllers, i.e., the visual inputs that lead to closed-loop safety violations. However, these techniques do not scale well to the scenarios involving high-dimensional and complex visual inputs, such as RGB images. In this work, we cast the problem of finding closed-loop vision failures as a Hamilton-Jacobi (HJ) reachability problem. Our approach blends simulation-based analysis with HJ reachability methods to compute an approximation of the backward reachable tube (BRT) of the system, i.e., the set of unsafe states for the system under vision-based controllers. Utilizing the BRT, we can tractably and systematically find the system states and corresponding visual inputs that lead to closed-loop failures. These visual inputs can be subsequently analyzed to find the input characteristics that might have caused the failure. Besides its scalability to high-dimensional visual inputs, an explicit computation of BRT allows the proposed approach to capture non-trivial system failures that are difficult to expose via random simulations. We demonstrate our framework on two case studies involving an RGB image-based neural network controller for (a) autonomous indoor navigation, and (b) autonomous aircraft taxiing.
