On Inherent Adversarial Robustness of Active Vision Systems
Amitangshu Mukherjee, Timur Ibrayev, Kaushik Roy
TL;DR
Deep neural networks remain vulnerable to adversarial perturbations, unlike human vision which benefits from saccades and foveation. The authors propose active vision mechanisms, GFNet and FALcon, that process images through glimpses at downsampled resolutions and from multiple fixation points, and demonstrate their inherent robustness under a black-box threat model. Across ImageNet evaluations and multiple iterative attacks, GFNet and FALcon achieve $2$-$3$x higher accuracy under attack compared to a passive baseline, aided by interpretable visualizations such as Initial Fixation Point Maps (IFPM) and occlusion analyses. This work suggests that incorporating active, biologically inspired processing can enhance robustness and informs future defense strategies and bio-inspired robustness research.
Abstract
Current Deep Neural Networks are vulnerable to adversarial examples, which alter their predictions by adding carefully crafted noise. Since human eyes are robust to such inputs, it is possible that the vulnerability stems from the standard way of processing inputs in one shot by processing every pixel with the same importance. In contrast, neuroscience suggests that the human vision system can differentiate salient features by (1) switching between multiple fixation points (saccades) and (2) processing the surrounding with a non-uniform external resolution (foveation). In this work, we advocate that the integration of such active vision mechanisms into current deep learning systems can offer robustness benefits. Specifically, we empirically demonstrate the inherent robustness of two active vision methods - GFNet and FALcon - under a black box threat model. By learning and inferencing based on downsampled glimpses obtained from multiple distinct fixation points within an input, we show that these active methods achieve (2-3) times greater robustness compared to a standard passive convolutional network under state-of-the-art adversarial attacks. More importantly, we provide illustrative and interpretable visualization analysis that demonstrates how performing inference from distinct fixation points makes active vision methods less vulnerable to malicious inputs.
