Adv3D: Generating 3D Adversarial Examples for 3D Object Detection in Driving Scenarios with NeRF
Leheng Li, Qing Lian, Ying-Cong Chen
TL;DR
Adv3D tackles the vulnerability of camera-based 3D object detectors in autonomous driving to physically realizable adversarial textures. It introduces a NeRF-based adversarial framework that renders photorealistic 3D patches and optimizes their textures under an Expectation Over Transformation (EOT) regime, with primitive-aware sampling, disentangled texture/shape modeling, and semantic-guided camouflage to ensure realism and transferability across poses and scenes. The approach demonstrates significant attack effectiveness across multiple detectors on the nuScenes dataset, while also proposing a defensible training strategy via data augmentation that partially restores detector performance. The work highlights a practical threat model and provides concrete methods to both evaluate robustness and guide the design of more resilient 3D perception systems for real-world driving.
Abstract
Deep neural networks (DNNs) have been proven extremely susceptible to adversarial examples, which raises special safety-critical concerns for DNN-based autonomous driving stacks (i.e., 3D object detection). Although there are extensive works on image-level attacks, most are restricted to 2D pixel spaces, and such attacks are not always physically realistic in our 3D world. Here we present Adv3D, the first exploration of modeling adversarial examples as Neural Radiance Fields (NeRFs). Advances in NeRF provide photorealistic appearances and 3D accurate generation, yielding a more realistic and realizable adversarial example. We train our adversarial NeRF by minimizing the surrounding objects' confidence predicted by 3D detectors on the training set. Then we evaluate Adv3D on the unseen validation set and show that it can cause a large performance reduction when rendering NeRF in any sampled pose. To generate physically realizable adversarial examples, we propose primitive-aware sampling and semantic-guided regularization that enable 3D patch attacks with camouflage adversarial texture. Experimental results demonstrate that the trained adversarial NeRF generalizes well to different poses, scenes, and 3D detectors. Finally, we provide a defense method to our attacks that involves adversarial training through data augmentation. Project page: https://len-li.github.io/adv3d-web
