Enhancing Autonomous Driving Safety with Collision Scenario Integration
Zi Wang, Shiyi Lan, Xinglong Sun, Nadine Chang, Zhenxin Li, Zhiding Yu, Jose M. Alvarez
TL;DR
This work tackles autonomous driving safety under hazardous, collision-prone scenarios by addressing collision-data scarcity and the shortcomings of imitation-learning-based training. It introduces CollisionGen, a scalable pipeline that generates realistic collision scenarios from natural language prompts and a map-conditional generative model, producing Collision2k, a diverse dataset of approximately 2,000 collision cases. Complementing this, SafeFusion trains neural planners with both regular and collision data using a fixed trajectory vocabulary and Multi-target Hydra-Distillation, removing imitation-learning reliance and improving safety metrics by substantial margins (e.g., up to 63.5% in safety and 56% overall). The approach yields strong improvements in collision avoidance while preserving performance on regular driving, providing a scalable, data-efficient path toward safer autonomous driving systems.
Abstract
Autonomous vehicle safety is crucial for the successful deployment of self-driving cars. However, most existing planning methods rely heavily on imitation learning, which limits their ability to leverage collision data effectively. Moreover, collecting collision or near-collision data is inherently challenging, as it involves risks and raises ethical and practical concerns. In this paper, we propose SafeFusion, a training framework to learn from collision data. Instead of over-relying on imitation learning, SafeFusion integrates safety-oriented metrics during training to enable collision avoidance learning. In addition, to address the scarcity of collision data, we propose CollisionGen, a scalable data generation pipeline to generate diverse, high-quality scenarios using natural language prompts, generative models, and rule-based filtering. Experimental results show that our approach improves planning performance in collision-prone scenarios by 56\% over previous state-of-the-art planners while maintaining effectiveness in regular driving situations. Our work provides a scalable and effective solution for advancing the safety of autonomous driving systems.
