The Context of Crash Occurrence: A Complexity-Infused Approach Integrating Semantic, Contextual, and Kinematic Features
Meng Wang, Zach Noonan, Pnina Gershon, Bruce Mehler, Bryan Reimer, Shannon C. Roberts
TL;DR
This work addresses predicting crash density in complex driving environments by integrating semantic scene information, contextual road attributes, and vehicle kinematics into a two-stage framework. A complexity-infused encoder extracts hidden contextual representations from multimodal features, which are then combined with original features to predict crash density, achieving $90.15\%$ accuracy on all-feature inputs compared with $87.98\%$ using original features alone. The study demonstrates that AI-generated complexity indices (via LLMs) outperform human annotations in predictive power when integrated with semantic, kinematic, and contextual data, and provides SHAP-based insights into factors driving low, medium, and high crash-density regions. These findings support real-time crash risk estimation, inform driver-assistance and roadway design, and highlight the value of AI-assisted annotation for scalable safety analytics.
Abstract
Understanding the context of crash occurrence in complex driving environments is essential for improving traffic safety and advancing automated driving. Previous studies have used statistical models and deep learning to predict crashes based on semantic, contextual, or vehicle kinematic features, but none have examined the combined influence of these factors. In this study, we term the integration of these features ``roadway complexity''. This paper introduces a two-stage framework that integrates roadway complexity features for crash prediction. In the first stage, an encoder extracts hidden contextual information from these features, generating complexity-infused features. The second stage uses both original and complexity-infused features to predict crash likelihood, achieving an accuracy of 87.98\% with original features alone and 90.15\% with the added complexity-infused features. Ablation studies confirm that a combination of semantic, kinematic, and contextual features yields the best results, which emphasize their role in capturing roadway complexity. Additionally, complexity index annotations generated by the Large Language Model outperform those by Amazon Mechanical Turk, highlighting the potential of AI-based tools for accurate, scalable crash prediction systems.
