A Multi-Scale Isolation Forest Approach for Real-Time Detection and Filtering of FGSM Adversarial Attacks in Video Streams of Autonomous Vehicles
Richard Abhulimhen, Negash Begashaw, Gurcan Comert, Chunheng Zhao, Pierluigi Pisu
TL;DR
This work tackles the real-time detection and filtering of FGSM adversarial perturbations in video streams from autonomous-vehicle perception. It introduces a multi-scale Isolation Forest-based framework complemented by statistical analysis and OC-SVM, evaluated on a dataset of 60,000 images (10,000 originals plus five perturbation levels up to $ε=0.2$). The proposed pipeline achieves high detection performance (Precision $98.2\%$, Recall $97.5\%$, Accuracy $98.7\%$) and scales efficiently with parallel processing, showing strong viability for deployment in real-time perception systems. The approach demonstrates robust discrimination across perturbation strengths, with near-perfect ROC-AUC ($0.995$) and favorable comparisons to prior defenses, while providing visual and decorative tools to interpret results. Practical impact includes enhanced resilience of autonomous-vehicle perception against FGSM attacks, enabling safer and more reliable real-time decision-making.
Abstract
Deep Neural Networks (DNNs) have demonstrated remarkable success across a wide range of tasks, particularly in fields such as image classification. However, DNNs are highly susceptible to adversarial attacks, where subtle perturbations are introduced to input images, leading to erroneous model outputs. In today's digital era, ensuring the security and integrity of images processed by DNNs is of critical importance. One of the most prominent adversarial attack methods is the Fast Gradient Sign Method (FGSM), which perturbs images in the direction of the loss gradient to deceive the model. This paper presents a novel approach for detecting and filtering FGSM adversarial attacks in image processing tasks. Our proposed method evaluates 10,000 images, each subjected to five different levels of perturbation, characterized by $ε$ values of 0.01, 0.02, 0.05, 0.1, and 0.2. These perturbations are applied in the direction of the loss gradient. We demonstrate that our approach effectively filters adversarially perturbed images, mitigating the impact of FGSM attacks. The method is implemented in Python, and the source code is publicly available on GitHub for reproducibility and further research.
