AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems

Ishan Sahu; Somnath Hazra; Somak Aditya; Soumyajit Dey

AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems

Ishan Sahu, Somnath Hazra, Somak Aditya, Soumyajit Dey

TL;DR

This work addresses the robustness of end-to-end autonomous driving systems under adversarial perturbations in the visual perception pipeline. It performs closed-loop CARLA experiments with three black-box attack vectors— Poltergeist (motion blur from acoustic interference), SNAL (ghost object injection), and ESIA (electromagnetic interference)—on state-of-the-art agents and demonstrates driving-score degradations up to 99%. To mitigate such threats, it introduces AD$^2$, a lightweight, real-time detector that leverages spatial-temporal attention over multi-camera inputs to identify adversarial frames without requiring access to internal agent latents, achieving superior detection performance (higher AUC/TPR, lower FPR) and efficiency (≈1.6× faster inference and ≈20× fewer parameters vs. a strong baseline). The results underscore persistent vulnerabilities in end-to-end AD systems under visual attacks while showing that external detectors like AD$^2$ can enable safer operation, for example by triggering alarms or safe-mode controllers; future work should address adaptive adversaries and real-world deployment considerations. Key metrics include Driving Score $DS = R imes P$, Route Completion $R$, Infraction Penalty $P$, and Lane Deviation $L_{ ext{dev}}$, with $R$ and $P$ formalized in the paper and $DS$ reflecting overall safety performance.

Abstract

End-to-end autonomous driving systems have achieved significant progress, yet their adversarial robustness remains largely underexplored. In this work, we conduct a closed-loop evaluation of state-of-the-art autonomous driving agents under black-box adversarial threat models in CARLA. Specifically, we consider three representative attack vectors on the visual perception pipeline: (i) a physics-based blur attack induced by acoustic waves, (ii) an electromagnetic interference attack that distorts captured images, and (iii) a digital attack that adds ghost objects as carefully crafted bounded perturbations on images. Our experiments on two advanced agents, Transfuser and Interfuser, reveal severe vulnerabilities to such attacks, with driving scores dropping by up to 99% in the worst case, raising valid safety concerns. To help mitigate such threats, we further propose a lightweight Attack Detection model for Autonomous Driving systems (AD$^2$) based on attention mechanisms that capture spatial-temporal consistency. Comprehensive experiments across multi-camera inputs on CARLA show that our detector achieves superior detection capability and computational efficiency compared to existing approaches.

AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems

TL;DR

, a lightweight, real-time detector that leverages spatial-temporal attention over multi-camera inputs to identify adversarial frames without requiring access to internal agent latents, achieving superior detection performance (higher AUC/TPR, lower FPR) and efficiency (≈1.6× faster inference and ≈20× fewer parameters vs. a strong baseline). The results underscore persistent vulnerabilities in end-to-end AD systems under visual attacks while showing that external detectors like AD

can enable safer operation, for example by triggering alarms or safe-mode controllers; future work should address adaptive adversaries and real-world deployment considerations. Key metrics include Driving Score

, Route Completion

, Infraction Penalty

, and Lane Deviation

, with

and

formalized in the paper and

reflecting overall safety performance.

Abstract

) based on attention mechanisms that capture spatial-temporal consistency. Comprehensive experiments across multi-camera inputs on CARLA show that our detector achieves superior detection capability and computational efficiency compared to existing approaches.

Paper Structure (22 sections, 9 equations, 9 figures, 10 tables)

This paper contains 22 sections, 9 equations, 9 figures, 10 tables.

Introduction
Related Work
End-to-End Autonomous Driving Systems
Adversarial Attacks on AD Systems
Detection of Adversarial Attack
Adversarial Robustness against Black-Box Attacks for End-to-End AD Systems
Adversarial Attacks on AD Systems
Problem Formulation
Adversarial Attack Detection
Problem Formulation
Motivation and Intuition behind the Method
AD$^2$: A Real-time Adversarial Attack Detector for AD systems
Empirical Details
Attack Evaluation on End-to-End AD Systems
Findings
...and 7 more sections

Figures (9)

Figure 1: Illustration of an adversarial attack on the perception system and a low-latency detector to protect against the attack in an end-to-end closed system model for autonomous driving.
Figure 2: Adversarial attack setup for evaluation of end-to-end autonomous driving systems. Red marked labels are used to indicate the change when the system is under attack.
Figure 3: Example images from cameras of Transfuser. The 3 cameras in Transfuser, Interfuser have overlapping fields of view.
Figure 4: Proposed detection model, AD$^2$, leveraging spatial and temporal consistency.
Figure 5: $L_{\text{dev}}$ plots for AD agents. Attacks are with interval $d=1$. Left side of the lane centre is positive. We have different x-axis (Frame number or Timestep) and y-axis (Distance) scales for different configurations. This is due to the different sequence of actions undertaken by the AD system leading to varying driving times. Under strong poltergeist attack on both agents and ESIA on Transfuser, the vehicle deviation is very high. In such cases, the agent goes outside the lane and ultimately gets blocked by some obstacle. Deviation under SNAL for both agents and under ESIA for Interfuser are comparatively minor.
...and 4 more figures

AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems

TL;DR

Abstract

AD$^2$: Analysis and Detection of Adversarial Threats in Visual Perception for End-to-End Autonomous Driving Systems

Authors

TL;DR

Abstract

Table of Contents

Figures (9)