Challenges of Data-Driven Simulation of Diverse and Consistent Human Driving Behaviors
Kalle Kujanpää, Daulet Baimukashev, Shibei Zhu, Shoaib Azam, Farzeen Munir, Gokhan Alcan, Ville Kyrki
TL;DR
The paper addresses the challenge of creating statistically realistic driving environments for autonomous vehicles by proposing a data-driven framework that accommodates the complexity, diversity, and temporal consistency of human drivers. It critically compares theory-based IDM with driver-specific parameters to a data-driven policy (XGBoost) using the NGSIM dataset, finding that per-driver IDM offers improvements but remains suboptimal, while XGBoost struggles with one-step predictions and accumulative errors. The analyses reveal that human driving styles are multimodal and non-Gaussian and that driving behavior exhibits temporal consistency, implying the need for driver-aware stochastic models in simulations. These insights pave the way for more faithful, high-fidelity AV simulators capable of capturing real-world driving variability and stability over time.
Abstract
Building simulation environments for developing and testing autonomous vehicles necessitates that the simulators accurately model the statistical realism of the real-world environment, including the interaction with other vehicles driven by human drivers. To address this requirement, an accurate human behavior model is essential to incorporate the diversity and consistency of human driving behavior. We propose a mathematical framework for designing a data-driven simulation model that simulates human driving behavior more realistically than the currently used physics-based simulation models. Experiments conducted using the NGSIM dataset validate our hypothesis regarding the necessity of considering the complexity, diversity, and consistency of human driving behavior when aiming to develop realistic simulators.
