When Context Is Not Enough: Modeling Unexplained Variability in Car-Following Behavior
Chengyuan Zhang, Zhengbing He, Cathy Wu, Lijun Sun
TL;DR
This work addresses unexplained, context-driven variability in car-following by integrating a DeepAR-based mean model with a nonstationary Gaussian process residual governed by a scenario-adaptive Gibbs kernel. The approach learns context-dependent memory and uncertainty, enabling end-to-end probabilistic car-following simulations that better reproduce observed trajectories and provide sharper uncertainty estimates. Empirical results on the HighD dataset show that the nonstationary kernel yields superior RMSE, CRPS, and Energy Score across acceleration, speed, and spacing, outperforming stationary kernels and i.i.d. baselines, with ablations highlighting the importance of temporal correlations. The framework offers interpretable indicators of driving regimes and is practically applicable to uncertainty-aware microsimulation and safety-critical connected/autonomous vehicle analyses, with clear pathways for transferability and extension to more complex traffic scenarios.
Abstract
Modeling car-following behavior is fundamental to microscopic traffic simulation, yet traditional deterministic models often fail to capture the full extent of variability and unpredictability in human driving. While many modern approaches incorporate context-aware inputs (e.g., spacing, speed, relative speed), they frequently overlook structured stochasticity that arises from latent driver intentions, perception errors, and memory effects -- factors that are not directly observable from context alone. To fill the gap, this study introduces an interpretable stochastic modeling framework that captures not only context-dependent dynamics but also residual variability beyond what context can explain. Leveraging deep neural networks integrated with nonstationary Gaussian processes (GPs), our model employs a scenario-adaptive Gibbs kernel to learn dynamic temporal correlations in acceleration decisions, where the strength and duration of correlations between acceleration decisions evolve with the driving context. This formulation enables a principled, data-driven quantification of uncertainty in acceleration, speed, and spacing, grounded in both observable context and latent behavioral variability. Comprehensive experiments on the naturalistic vehicle trajectory dataset collected from the German highway, i.e., the HighD dataset, demonstrate that the proposed stochastic simulation method within this framework surpasses conventional methods in both predictive performance and interpretable uncertainty quantification. The integration of interpretability and accuracy makes this framework a promising tool for traffic analysis and safety-critical applications.
