AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks
Jin Li, Ziqiang He, Anwei Luo, Jian-Fang Hu, Z. Jane Wang, Xiangui Kang
TL;DR
AdvAD reframes imperceptible adversarial attacks as a non-parametric diffusion process initiated from fixed noise, guided at each step by AMG and constrained by PC to minimize perceptual perturbations. It eliminates the need for extra networks by leveraging the attacked model itself to generate adversarial guidance, and grounds the approach in diffusion theory (DDIM/conditional sampling) with provable bounds on perturbation behavior. An enhanced AdvAD-X version introduces Dynamic Guidance Injection and CAM-based suppression to push performance toward an extreme, idealized limit, while maintaining theoretical interpretability. Extensive experiments across four architectures on an ImageNet-like dataset show AdvAD achieves near-perfect attack success with substantially lower perturbation strength and favorable perceptual metrics, outperforming existing imperceptible attack methods and demonstrating robust behavior under defenses. The work suggests a general, theoretically grounded diffusion-based attack framework with potential implications for understanding DNN robustness and guiding defense strategies, including in high-precision floating-point regimes.
Abstract
Imperceptible adversarial attacks aim to fool DNNs by adding imperceptible perturbation to the input data. Previous methods typically improve the imperceptibility of attacks by integrating common attack paradigms with specifically designed perception-based losses or the capabilities of generative models. In this paper, we propose Adversarial Attacks in Diffusion (AdvAD), a novel modeling framework distinct from existing attack paradigms. AdvAD innovatively conceptualizes attacking as a non-parametric diffusion process by theoretically exploring basic modeling approach rather than using the denoising or generation abilities of regular diffusion models requiring neural networks. At each step, much subtler yet effective adversarial guidance is crafted using only the attacked model without any additional network, which gradually leads the end of diffusion process from the original image to a desired imperceptible adversarial example. Grounded in a solid theoretical foundation of the proposed non-parametric diffusion process, AdvAD achieves high attack efficacy and imperceptibility with intrinsically lower overall perturbation strength. Additionally, an enhanced version AdvAD-X is proposed to evaluate the extreme of our novel framework under an ideal scenario. Extensive experiments demonstrate the effectiveness of the proposed AdvAD and AdvAD-X. Compared with state-of-the-art imperceptible attacks, AdvAD achieves an average of 99.9$\%$ (+17.3$\%$) ASR with 1.34 (-0.97) $l_2$ distance, 49.74 (+4.76) PSNR and 0.9971 (+0.0043) SSIM against four prevalent DNNs with three different architectures on the ImageNet-compatible dataset. Code is available at https://github.com/XianguiKang/AdvAD.
