Generalizing Safety Beyond Collision-Avoidance via Latent-Space Reachability Analysis

Kensuke Nakamura; Lasse Peters; Andrea Bajcsy

Generalizing Safety Beyond Collision-Avoidance via Latent-Space Reachability Analysis

Kensuke Nakamura, Lasse Peters, Andrea Bajcsy

TL;DR

This work generalizes robotic safety beyond collision avoidance by embedding Hamilton-Jacobi reachability in the latent space of a generative world model. By learning a latent failure classifier and performing reachability in imagination, the approach yields a policy-agnostic safety filter that can override unsafe actions using high-dimensional observations such as RGB images. The method is validated through simulation and hardware experiments, demonstrating near-privileged safety performance in vision-based tasks and capability to prevent hard-to-model failures like spilling from a bag. Limitations include dependency on world-model quality and the lack of formal guarantees, motivating future work on uncertainty quantification and broader constraint handling.

Abstract

Hamilton-Jacobi (HJ) reachability is a rigorous mathematical framework that enables robots to simultaneously detect unsafe states and generate actions that prevent future failures. While in theory, HJ reachability can synthesize safe controllers for nonlinear systems and nonconvex constraints, in practice, it has been limited to hand-engineered collision-avoidance constraints modeled via low-dimensional state-space representations and first-principles dynamics. In this work, our goal is to generalize safe robot controllers to prevent failures that are hard--if not impossible--to write down by hand, but can be intuitively identified from high-dimensional observations: for example, spilling the contents of a bag. We propose Latent Safety Filters, a latent-space generalization of HJ reachability that tractably operates directly on raw observation data (e.g., RGB images) to automatically compute safety-preserving actions without explicit recovery demonstrations by performing safety analysis in the latent embedding space of a generative world model. Our method leverages diverse robot observation-action data of varying quality (including successes, random exploration, and unsafe demonstrations) to learn a world model. Constraint specification is then transformed into a classification problem in the latent space of the learned world model. In simulation and hardware experiments, we compute an approximation of Latent Safety Filters to safeguard arbitrary policies (from imitation- learned policies to direct teleoperation) from complex safety hazards, like preventing a Franka Research 3 manipulator from spilling the contents of a bag or toppling cluttered objects.

Generalizing Safety Beyond Collision-Avoidance via Latent-Space Reachability Analysis

TL;DR

Abstract

Generalizing Safety Beyond Collision-Avoidance via Latent-Space Reachability Analysis

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)