Diffusion-Based Failure Sampling for Evaluating Safety-Critical Autonomous Systems
Harrison Delecki, Marc R. Schlichting, Mansur Arief, Anthony Corso, Marcell Vazquez-Chanlatte, Mykel J. Kochenderfer
TL;DR
The paper tackles the challenge of validating safety-critical autonomous systems in high-dimensional settings where failures are rare and multimodal. It introduces Diffusion-based Failure Sampling (DiFS), a conditional denoising diffusion model that learns a disturbance distribution conditioned on robustness and uses an adaptive training loop to increasingly bias samples toward failure. DiFS demonstrates superior fidelity, diversity, and sample efficiency compared with CEM and AST across five validation problems up to 1200 dimensions, including a ground collision avoidance scenario for an F-16. This approach enables more reliable exploration of failure modes, potentially improving safety assurance and robust planning in complex autonomous systems.
Abstract
Validating safety-critical autonomous systems in high-dimensional domains such as robotics presents a significant challenge. Existing black-box approaches based on Markov chain Monte Carlo may require an enormous number of samples, while methods based on importance sampling often rely on simple parametric families that may struggle to represent the distribution over failures. We propose to sample the distribution over failures using a conditional denoising diffusion model, which has shown success in complex high-dimensional problems such as robotic task planning. We iteratively train a diffusion model to produce state trajectories closer to failure. We demonstrate the effectiveness of our approach on high-dimensional robotic validation tasks, improving sample efficiency and mode coverage compared to existing black-box techniques.
