Generating Transferable Adversarial Simulation Scenarios for Self-Driving via Neural Rendering
Yasasa Abeysirigoonawardena, Kevin Xie, Chuhan Chen, Salar Hosseini, Ruiting Chen, Ruiqi Wang, Florian Shkurti
TL;DR
The paper addresses the challenge of safely evaluating self-driving systems by automatically generating adversarial, 3D-consistent scenarios. It introduces a differentiable surrogate scene built from Neural Radiance Fields (NeRF) and formulates adversarial scenario generation as a high-dimensional optimal-control problem that perturbs object textures to maximize policy deviation. Using implicit differentiation (adjoint method) and differentiable rendering, it yields gradient-based, transferable attacks that transfer from the NeRF surrogate to real deployment in both simulation and real-world tests. Gradient-based attacks outperform random baselines and can reveal safety-critical failures, offering a scalable framework for automated AV evaluation and robustness testing. Limitations include reliance on differentiable policies and potential non-smooth optimization landscapes, with future work aiming to handle non-differentiable components and broader sim-to-real transfer improvements.
Abstract
Self-driving software pipelines include components that are learned from a significant number of training examples, yet it remains challenging to evaluate the overall system's safety and generalization performance. Together with scaling up the real-world deployment of autonomous vehicles, it is of critical importance to automatically find simulation scenarios where the driving policies will fail. We propose a method that efficiently generates adversarial simulation scenarios for autonomous driving by solving an optimal control problem that aims to maximally perturb the policy from its nominal trajectory. Given an image-based driving policy, we show that we can inject new objects in a neural rendering representation of the deployment scene, and optimize their texture in order to generate adversarial sensor inputs to the policy. We demonstrate that adversarial scenarios discovered purely in the neural renderer (surrogate scene) can often be successfully transferred to the deployment scene, without further optimization. We demonstrate this transfer occurs both in simulated and real environments, provided the learned surrogate scene is sufficiently close to the deployment scene.
