NeuroNCAP: Photorealistic Closed-loop Safety Testing for Autonomous Driving
William Ljungbergh, Adam Tonderski, Joakim Johnander, Holger Caesar, Kalle Åström, Michael Felsberg, Christoffer Petersson
TL;DR
NeuroNCAP presents a NeRF-based photorealistic simulator for closed-loop safety testing of autonomous driving, learned from real-world sensor sequences and configurable to generate Euro NCAP-inspired safety scenarios. The framework combines a neural renderer, end-to-end AD models, a controller, and a vehicle dynamics model to create a four-step closed-loop loop that renders sensor data, predicts trajectories, applies controls, and propagates ego-state. Evaluation across stationary, frontal, and side collision scenarios reveals that state-of-the-art end-to-end planners often fail in safety-critical, closed-loop settings, even when perception appears robust, underscoring a gap between perception/prediction and planning. By releasing the simulator and a suite of safety-critical scenarios, NeuroNCAP provides a practical benchmark to stress-test and refine AD models, highlighting the need for safer, more robust end-to-end approaches and improved alignment between modules. The work also analyzes real-to-sim transfer gaps and discusses limitations, pointing to future work in expanding scenario diversity, physical realism, and neural rendering capabilities.
Abstract
We present a versatile NeRF-based simulator for testing autonomous driving (AD) software systems, designed with a focus on sensor-realistic closed-loop evaluation and the creation of safety-critical scenarios. The simulator learns from sequences of real-world driving sensor data and enables reconfigurations and renderings of new, unseen scenarios. In this work, we use our simulator to test the responses of AD models to safety-critical scenarios inspired by the European New Car Assessment Programme (Euro NCAP). Our evaluation reveals that, while state-of-the-art end-to-end planners excel in nominal driving scenarios in an open-loop setting, they exhibit critical flaws when navigating our safety-critical scenarios in a closed-loop setting. This highlights the need for advancements in the safety and real-world usability of end-to-end planners. By publicly releasing our simulator and scenarios as an easy-to-run evaluation suite, we invite the research community to explore, refine, and validate their AD models in controlled, yet highly configurable and challenging sensor-realistic environments. Code and instructions can be found at https://github.com/atonderski/neuro-ncap
