DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving
Tianqi Wang, Sukmin Kim, Wenxuan Ji, Enze Xie, Chongjian Ge, Junsong Chen, Zhenguo Li, Ping Luo
TL;DR
This paper introduces DeepAccident, the first large-scale V2X autonomous driving dataset that embeds safety-critical accident scenarios generated in a realistic simulator, enabling end-to-end motion and accident prediction alongside perception tasks.It provides four vehicles plus infrastructure with multi-view cameras and LiDAR, totaling 285k annotated samples and 57k V2X frames, and defines a new Accident Prediction Accuracy (APA) metric to quantify predictive safety performance.A novel V2X model, V2XFormer, leveraging Swin Transformer-based BEV features and advanced fusion (CoBEVT), demonstrates superior motion, detection, and accident-prediction performance compared with single-vehicle baselines.The dataset supports robust evaluation under varying accident visibility, prediction horizons, and communication delays, and demonstrates beneficial sim-to-real transfer when finetuned on nuScenes.
Abstract
Safety is the primary priority of autonomous driving. Nevertheless, no published dataset currently supports the direct and explainable safety evaluation for autonomous driving. In this work, we propose DeepAccident, a large-scale dataset generated via a realistic simulator containing diverse accident scenarios that frequently occur in real-world driving. The proposed DeepAccident dataset includes 57K annotated frames and 285K annotated samples, approximately 7 times more than the large-scale nuScenes dataset with 40k annotated samples. In addition, we propose a new task, end-to-end motion and accident prediction, which can be used to directly evaluate the accident prediction ability for different autonomous driving algorithms. Furthermore, for each scenario, we set four vehicles along with one infrastructure to record data, thus providing diverse viewpoints for accident scenarios and enabling V2X (vehicle-to-everything) research on perception and prediction tasks. Finally, we present a baseline V2X model named V2XFormer that demonstrates superior performance for motion and accident prediction and 3D object detection compared to the single-vehicle model.
