CaDRE: Controllable and Diverse Generation of Safety-Critical Driving Scenarios using Real-World Trajectories
Peide Huang, Wenhao Ding, Benjamin Stoler, Jonathan Francis, Bingqing Chen, Ding Zhao
TL;DR
This paper tackles the problem of generating safety-critical driving scenarios for autonomous vehicles by casting the task as a Quality-Diversity optimization problem. It introduces CaDRE, a framework that perturbs real-world trajectories within bounded limits, preserving realism while jointly optimizing for high safety-criticality and diverse scenario behavior through a grid-archived, black-box optimization process. The authors further enhance exploration with an Occupancy-Aware Restart mechanism and demonstrate superior sample efficiency and diversity across three real-world scenarios from nuPlan compared to RL- and sampling-based baselines. The work enables controllable retrieval of scenarios by desired measure values, offering a practical tool for robust AV evaluation and training, with potential extensions to multi-vehicle perturbations and road-context integration.
Abstract
Simulation is an indispensable tool in the development and testing of autonomous vehicles (AVs), offering an efficient and safe alternative to road testing. An outstanding challenge with simulation-based testing is the generation of safety-critical scenarios, which are essential to ensure that AVs can handle rare but potentially fatal situations. This paper addresses this challenge by introducing a novel framework, CaDRE, to generate realistic, diverse, and controllable safety-critical scenarios. Our approach optimizes for both the quality and diversity of scenarios by employing a unique formulation and algorithm that integrates real-world scenarios, domain knowledge, and black-box optimization. We validate the effectiveness of our framework through extensive testing in three representative types of traffic scenarios. The results demonstrate superior performance in generating diverse and high-quality scenarios with greater sample efficiency than existing reinforcement learning (RL) and sampling-based methods.
