Challenger: Affordable Adversarial Driving Video Generation

Zhiyuan Xu; Bohan Li; Huan-ang Gao; Mingju Gao; Yong Chen; Ming Liu; Chenxu Yan; Hang Zhao; Shuo Feng; Hao Zhao

Challenger: Affordable Adversarial Driving Video Generation

Zhiyuan Xu, Bohan Li, Huan-ang Gao, Mingju Gao, Yong Chen, Ming Liu, Chenxu Yan, Hang Zhao, Shuo Feng, Hao Zhao

TL;DR

Challenger introduces a unified framework for generating photorealistic adversarial driving videos by coupling a diffusion-based trajectory generator with physics-aware planning and multiview neural rendering. The approach yields diverse, physically plausible adversarial maneuvers and renders them into six-camera videos, enabling stress-testing of state-of-the-art end-to-end autonomous driving models. Experiments on Adv-nuSc show substantial increases in collision rates across multiple models and demonstrate transferability of adversarial behaviors, highlighting shared vulnerabilities. The work provides a scalable tool for robustness evaluation while discussing limitations and directions for extending to closed-loop settings and additional modalities.

Abstract

Generating photorealistic driving videos has seen significant progress recently, but current methods largely focus on ordinary, non-adversarial scenarios. Meanwhile, efforts to generate adversarial driving scenarios often operate on abstract trajectory or BEV representations, falling short of delivering realistic sensor data that can truly stress-test autonomous driving (AD) systems. In this work, we introduce Challenger, a framework that produces physically plausible yet photorealistic adversarial driving videos. Generating such videos poses a fundamental challenge: it requires jointly optimizing over the space of traffic interactions and high-fidelity sensor observations. Challenger makes this affordable through two techniques: (1) a physics-aware multi-round trajectory refinement process that narrows down candidate adversarial maneuvers, and (2) a tailored trajectory scoring function that encourages realistic yet adversarial behavior while maintaining compatibility with downstream video synthesis. As tested on the nuScenes dataset, Challenger generates a diverse range of aggressive driving scenarios-including cut-ins, sudden lane changes, tailgating, and blind spot intrusions-and renders them into multiview photorealistic videos. Extensive evaluations show that these scenarios significantly increase the collision rate of state-of-the-art end-to-end AD models (UniAD, VAD, SparseDrive, and DiffusionDrive), and importantly, adversarial behaviors discovered for one model often transfer to others.

Challenger: Affordable Adversarial Driving Video Generation

TL;DR

Abstract

Challenger: Affordable Adversarial Driving Video Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (17)