An Approach to Systematic Data Acquisition and Data-Driven Simulation for the Safety Testing of Automated Driving Functions
Leon Eisemann, Mirjam Fehling-Kaschek, Henrik Gommel, David Hermann, Marvin Klemp, Martin Lauer, Benjamin Lickert, Florian Luettner, Robin Moss, Nicole Neis, Maria Pohle, Simon Romanski, Daniel Stadler, Alexander Stolz, Jens Ziehn, Jingxing Zhou
TL;DR
The paper addresses the challenge of validating automated driving functions in complex open environments where real-world data are scarce. It proposes a data-driven pipeline that collects heterogeneous traffic data (aerial, infrastructure, and vehicle-based) and converts it into map-referenced trajectories to parameterize the driving-behavior model framework, implemented in a $\text{Wiedemann99}$-based simulation within PTV Vissim and a Carla-based SUT digital twin. A probabilistic, likelihood-based optimization fits observed mean velocities of cars and trucks, enabling stochastic validation and generation of diverse traffic dynamics for virtual testing, including near-miss scenarios defined for stochastic risk assessment. The work highlights the need for standardized international data formats (OpenDRIVE/OpenSCENARIO/OpenLABEL) and long-term data collection to realize scalable, quantitative virtual validation for SAE Level 4/5 automated driving systems.
Abstract
With growing complexity and criticality of automated driving functions in road traffic and their operational design domains (ODD), there is increasing demand for covering significant proportions of development, validation, and verification in virtual environments and through simulation models. If, however, simulations are meant not only to augment real-world experiments, but to replace them, quantitative approaches are required that measure to what degree and under which preconditions simulation models adequately represent reality, and thus, using their results accordingly. Especially in R&D areas related to the safety impact of the "open world", there is a significant shortage of real-world data to parameterize and/or validate simulations - especially with respect to the behavior of human traffic participants, whom automated driving functions will meet in mixed traffic. We present an approach to systematically acquire data in public traffic by heterogeneous means, transform it into a unified representation, and use it to automatically parameterize traffic behavior models for use in data-driven virtual validation of automated driving functions.
