SCA: Improve Semantic Consistent in Unrestricted Adversarial Attacks via DDPM Inversion
Zihao Pan, Lifeng Chen, Weibin Wu, Yuhang Cao, Zibin Zheng
TL;DR
The paper tackles the problem of generating unrestricted adversarial examples that alter high-level semantics while remaining photorealistic. It introduces SCA, a two-stage framework combining Semantic Fixation Inversion and Semantically Guided Perturbation to imprint rich semantic priors from MLLMs into an edit-friendly diffusion latent space, and to optimize perturbations with gradient-free, semantically guided updates. With DPM Solver++ acceleration, SCA achieves substantial efficiency (~12x faster) while maintaining or improving semantic consistency (as measured by CLIP Score and LPIPS) and maintaining competitive attack success across CNNs and ViTs. The approach produces Semantic-Consistent Adversarial Examples (SCAE) that preserve original content and scene context, enabling more covert and transferable attacks. These results advance understanding of diffusion-based adversarial vulnerabilities and highlight potential avenues for robust defenses against semantically driven manipulation.
Abstract
Systems based on deep neural networks are vulnerable to adversarial attacks. Unrestricted adversarial attacks typically manipulate the semantic content of an image (e.g., color or texture) to create adversarial examples that are both effective and photorealistic. Recent works have utilized the diffusion inversion process to map images into a latent space, where high-level semantics are manipulated by introducing perturbations. However, they often result in substantial semantic distortions in the denoised output and suffer from low efficiency. In this study, we propose a novel framework called Semantic-Consistent Unrestricted Adversarial Attacks (SCA), which employs an inversion method to extract edit-friendly noise maps and utilizes a Multimodal Large Language Model (MLLM) to provide semantic guidance throughout the process. Under the condition of rich semantic information provided by MLLM, we perform the DDPM denoising process of each step using a series of edit-friendly noise maps and leverage DPM Solver++ to accelerate this process, enabling efficient sampling with semantic consistency. Compared to existing methods, our framework enables the efficient generation of adversarial examples that exhibit minimal discernible semantic changes. Consequently, we for the first time introduce Semantic-Consistent Adversarial Examples (SCAE). Extensive experiments and visualizations have demonstrated the high efficiency of SCA, particularly in being on average 12 times faster than the state-of-the-art attacks. Our code can be found at https://github.com/Pan-Zihao/SCA.
