Adversarial Examples in Environment Perception for Automated Driving (Review)
Jun Yan, Huilin Yin
TL;DR
This survey analyzes how adversarial examples threaten environment perception in automated driving and surveys two broad defense categories: empirical defenses (notably adversarial training) and certified defenses (such as randomized smoothing and formal verification). It catalogs a wide range of attack families—gradient-based, black-box, universal, and physical attacks—and maps their impact across traffic signs, vehicle/detection, road semantics, LiDAR, and trajectory prediction. The work highlights the gap between robust performance and clean accuracy, discusses SOTIF relevance, and presents practical directions for rapid, edge-friendly defense methods and robust evaluation frameworks. Overall, it argues for integrated, safety-focused strategies that combine robust training, detection, and certified guarantees to enable trustworthy autonomous driving systems.
Abstract
The renaissance of deep learning has led to the massive development of automated driving. However, deep neural networks are vulnerable to adversarial examples. The perturbations of adversarial examples are imperceptible to human eyes but can lead to the false predictions of neural networks. It poses a huge risk to artificial intelligence (AI) applications for automated driving. This survey systematically reviews the development of adversarial robustness research over the past decade, including the attack and defense methods and their applications in automated driving. The growth of automated driving pushes forward the realization of trustworthy AI applications. This review lists significant references in the research history of adversarial examples.
