Seeing Isn't Believing: Context-Aware Adversarial Patch Synthesis via Conditional GAN
Roie Kazoom, Alon Goldberg, Hodaya Cohen, Ofer Hadar
TL;DR
This work presents a targeted, realism-aware adversarial patch synthesis framework that operates under strict black-box conditions. By conditioning the patch generator on real input images and guiding patch placement with Grad-CAM from a surrogate model, the method achieves precise target misclassification while maintaining visual plausibility. A multi-objective loss combining adversarial, patch-consistency, and perceptual terms enables high attack success and robust realism across CNNs and Vision Transformers, with strong transferability and defense robustness. The approach exposes practical vulnerabilities in modern vision systems and establishes a new benchmark for realistic, context-aware adversarial patches.
Abstract
Adversarial patch attacks pose a severe threat to deep neural networks, yet most existing approaches rely on unrealistic white-box assumptions, untargeted objectives, or produce visually conspicuous patches that limit real-world applicability. In this work, we introduce a novel framework for fully controllable adversarial patch generation, where the attacker can freely choose both the input image x and the target class y target, thereby dictating the exact misclassification outcome. Our method combines a generative U-Net design with Grad-CAM-guided patch placement, enabling semantic-aware localization that maximizes attack effectiveness while preserving visual realism. Extensive experiments across convolutional networks (DenseNet-121, ResNet-50) and vision transformers (ViT-B/16, Swin-B/16, among others) demonstrate that our approach achieves state-of-the-art performance across all settings, with attack success rates (ASR) and target-class success (TCS) consistently exceeding 99%. Importantly, we show that our method not only outperforms prior white-box attacks and untargeted baselines, but also surpasses existing non-realistic approaches that produce detectable artifacts. By simultaneously ensuring realism, targeted control, and black-box applicability-the three most challenging dimensions of patch-based attacks-our framework establishes a new benchmark for adversarial robustness research, bridging the gap between theoretical attack strength and practical stealthiness.
