Real-time Accident Anticipation for Autonomous Driving Through Monocular Depth-Enhanced 3D Modeling
Haicheng Liao, Yongkang Li, Chengyue Wang, Songning Lai, Zhenning Li, Zilin Bian, Jaeyoung Lee, Zhiyong Cui, Guohui Zhang, Chengzhong Xu
TL;DR
AccNet addresses real-time accident anticipation in autonomous driving by exploiting monocular depth cues to build 3D representations from dashcam videos. It introduces a 3D Collision Module with a depth-informed GNN topology and a Binary Adaptive Loss for Early Anticipation (BA-LEA), coupled with a Smooth Module and multitask learning to improve early, accurate predictions. Across DAD, CCD, A3D, and DADA-2000, AccNet achieves superior AP and mean Time-To-Accident (mTTA) compared to SOTA baselines, indicating strong practical benefits for ADAS and autonomous driving safety. The work also demonstrates robust performance through extensive ablations and provides insights into depth-enabled 3D scene understanding for crash anticipation.
Abstract
The primary goal of traffic accident anticipation is to foresee potential accidents in real time using dashcam videos, a task that is pivotal for enhancing the safety and reliability of autonomous driving technologies. In this study, we introduce an innovative framework, AccNet, which significantly advances the prediction capabilities beyond the current state-of-the-art (SOTA) 2D-based methods by incorporating monocular depth cues for sophisticated 3D scene modeling. Addressing the prevalent challenge of skewed data distribution in traffic accident datasets, we propose the Binary Adaptive Loss for Early Anticipation (BA-LEA). This novel loss function, together with a multi-task learning strategy, shifts the focus of the predictive model towards the critical moments preceding an accident. {We rigorously evaluate the performance of our framework on three benchmark datasets--Dashcam Accident Dataset (DAD), Car Crash Dataset (CCD), and AnAn Accident Detection (A3D), and DADA-2000 Dataset--demonstrating its superior predictive accuracy through key metrics such as Average Precision (AP) and mean Time-To-Accident (mTTA).
