PatchGuard: Adversarially Robust Anomaly Detection and Localization through Vision Transformers and Pseudo Anomalies
Mojtaba Nafez, Amirhossein Koochakian, Arad Maleki, Jafar Habibi, Mohammad Hossein Rohban
TL;DR
PatchGuard tackles the susceptibility of anomaly detection and localization to adversarial perturbations by generating foreground-aware pseudo-anomalies from normal data and training a Vision Transformer with an attention-regularized loss. The method combines Grad-CAM guided pseudo-anomaly generation, a patch-wise Attention Discriminator, and a loss that increases the ViT’s last-layer attention degree, supported by theoretical analyses. Empirically, PatchGuard achieves substantial robustness gains under PGD-1000 attacks (up to 53.2% in AD and 68.5% in AL) across eight industrial and medical datasets while preserving competitive performance in clean settings, and it outperforms adapted SOTA methods in adversarial scenarios. These results suggest a practical pathway toward reliable, pixel-precise anomaly detection and localization in high-resolution applications, with code available at the authors’ repository.
Abstract
Anomaly Detection (AD) and Anomaly Localization (AL) are crucial in fields that demand high reliability, such as medical imaging and industrial monitoring. However, current AD and AL approaches are often susceptible to adversarial attacks due to limitations in training data, which typically include only normal, unlabeled samples. This study introduces PatchGuard, an adversarially robust AD and AL method that incorporates pseudo anomalies with localization masks within a Vision Transformer (ViT)-based architecture to address these vulnerabilities. We begin by examining the essential properties of pseudo anomalies, and follow it by providing theoretical insights into the attention mechanisms required to enhance the adversarial robustness of AD and AL systems. We then present our approach, which leverages Foreground-Aware Pseudo-Anomalies to overcome the deficiencies of previous anomaly-aware methods. Our method incorporates these crafted pseudo-anomaly samples into a ViT-based framework, with adversarial training guided by a novel loss function designed to improve model robustness, as supported by our theoretical analysis. Experimental results on well-established industrial and medical datasets demonstrate that PatchGuard significantly outperforms previous methods in adversarial settings, achieving performance gains of $53.2\%$ in AD and $68.5\%$ in AL, while also maintaining competitive accuracy in non-adversarial settings. The code repository is available at https://github.com/rohban-lab/PatchGuard .
