Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis

Abhinav Nippani; Dongyue Li; Haotian Ju; Haris N. Koutsopoulos; Hongyang R. Zhang

Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis

Abhinav Nippani, Dongyue Li, Haotian Ju, Haris N. Koutsopoulos, Hongyang R. Zhang

TL;DR

The paper addresses predicting road-traffic accidents on road networks by constructing a large-scale, unified dataset of 9 million accident records across eight US states and integrating road graphs, traffic volume, and weather. It evaluates graph neural networks, notably GraphSAGE, for edge-level accident prediction using multitask learning across states and transfer learning to incorporate annual traffic volume as an auxiliary task. The results show $MAE \\approx 0.3$ and $AUROC \\approx 0.87$ on average, with multitask learning and volume transfer providing additional gains, and reveal that road-network structure is highly informative for risk assessment. The work also provides a public ML4RoadSafety package to facilitate reuse and cross-state analyses, highlighting practical implications for policy and safety interventions.

Abstract

We consider the problem of traffic accident analysis on a road network based on road network connections and traffic volume. Previous works have designed various deep-learning methods using historical records to predict traffic accident occurrences. However, there is a lack of consensus on how accurate existing methods are, and a fundamental issue is the lack of public accident datasets for comprehensive evaluations. This paper constructs a large-scale, unified dataset of traffic accident records from official reports of various states in the US, totaling 9 million records, accompanied by road networks and traffic volume reports. Using this new dataset, we evaluate existing deep-learning methods for predicting the occurrence of accidents on road networks. Our main finding is that graph neural networks such as GraphSAGE can accurately predict the number of accidents on roads with less than 22% mean absolute error (relative to the actual count) and whether an accident will occur or not with over 87% AUROC, averaged over states. We achieve these results by using multitask learning to account for cross-state variabilities (e.g., availability of accident labels) and transfer learning to combine traffic volume with accident prediction. Ablation studies highlight the importance of road graph-structural features, amongst other features. Lastly, we discuss the implications of the analysis and develop a package for easily using our new dataset.

Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis

TL;DR

and

on average, with multitask learning and volume transfer providing additional gains, and reveal that road-network structure is highly informative for risk assessment. The work also provides a public ML4RoadSafety package to facilitate reuse and cross-state analyses, highlighting practical implications for policy and safety interventions.

Abstract

Paper Structure (32 sections, 1 equation, 5 figures, 10 tables)

This paper contains 32 sections, 1 equation, 5 figures, 10 tables.

Introduction
Methodology
Problem setup
Dataset construction
Traffic accident prediction
Experiments
Experimental setup
Experimental results
Ablation studies
Interpretations and implications
Related Work
Discussions
Conclusion
Acknowledgement.
Dataset Collection Procedure
...and 17 more sections

Figures (5)

Figure 1: We note there is a clear association between accident occurrences and traffic volume. Combining accident and annual average daily traffic reports using transfer learning techniques can improve accident prediction by 4.6%. Further, road network structural features across states are most predictive of accident occurrences. We capture cross-state variability using multitask learning by combining labels of all states. This enables the transfer of information from states with rich data to those with fewer labels. We find that this outperforms learning from individual state data by 4.7%.
Figure 2: \ref{['fig_delaw']}-\ref{['fig_mass']}: We show the evolution of annual accident counts across states. There is a sharp drop in 2020 due to the pandemic. \ref{['fig_MA_w']}-\ref{['fig_ia_sp']}: We illustrate the seasonal pattern of accidents, where more accidents occur during winter compared to spring.
Figure 3: Distribution of accidents by daily traffic volume.
Figure 4: Pairwise training vs. single-task learning.
Figure 5: We illustrate the seasonal trend of accident counts within a year. Across all eight states, we consistently observe higher accident counts during Winter and Fall compared to Spring and Summer.

Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis

TL;DR

Abstract

Graph Neural Networks for Road Safety Modeling: Datasets and Evaluations for Accident Analysis

Authors

TL;DR

Abstract

Table of Contents

Figures (5)