CADD: A Chinese Traffic Accident Dataset for Statute-Based Liability Attribution
Yunfei Shen, Zhongcheng Wu
TL;DR
This work tackles the need for legally grounded liability attribution in autonomous driving by introducing the Chinese Accident Duty-determination Dataset (CADD), the first benchmark that links observable driving behaviors to liability judgments and specific Chinese traffic-law statutes via a Behavior–Liability–Statute pipeline. CADD comprises 792 real-world dashcam videos with hierarchical, symmetric annotations, definitive liability judgments, and explicit statute mappings, enabling evaluation of automated liability attribution, causal reconstruction, and explainable legal reasoning. The authors provide a comprehensive data collection and curation workflow, an annotation framework, quality-control procedures, and rich data analyses to demonstrate the dataset’s potential for perception-to-legal reasoning. The work paves the way for legally compliant and accountable autonomous systems by enabling temporally grounded, explainable, and statute-aware decision-making in traffic incidents.
Abstract
As autonomous driving technology advances, the critical challenge evolves beyond collision avoidance to the \textbf{adjudication of liability} when accidents occur. Existing datasets, focused on detection and localization, lack the annotations required for this legal reasoning. To bridge this gap, we introduce the \textbf{C}hinese \textbf{A}ccident \textbf{D}uty-determination \textbf{D}ataset (\textbf{CADD}), the first benchmark for statute-based liability attribution. CADD contains 792 real-world driving recorder videos, each annotated within a novel \textbf{``Behavior--Liability--Statute''} pipeline. This framework provides \textbf{granular, symmetric behavior annotations}, clear responsibility assignments, and, uniquely, links each case to the specific \textbf{Chinese traffic law statute} violated. We demonstrate the utility of CADD through detailed analysis and establish benchmarks for liability prediction and explainable decision-making. By directly connecting perceptual data to legal consequences, CADD provides a foundational resource for developing accountable and legally-grounded autonomous systems.
