Pixel Is Not a Barrier: An Effective Evasion Attack for Pixel-Domain Diffusion Models
Chun-Yen Shih, Li-Xuan Peng, Jia-Wei Liao, Ernie Chu, Cheng-Fu Chou, Jun-Cheng Chen
TL;DR
This work tackles the risk of malicious diffusion-based image editing by developing AtkPDM, a pixel-domain attack against Pixel-domain Diffusion Models (PDMs). It introduces a feature-attacking loss operating on denoising UNet representations and a fidelity constraint, complemented by latent optimization via a pretrained VAE to preserve image naturalness, formulated and solved with alternating optimization. The approach achieves state-of-the-art attack performance on PDMs (and transfers to LDMs), while remaining robust to common defenses such as purification, cropping, and JPEG compression. This reveals a vulnerabilities in UNet-based diffusion models and provides a practical image-protection mechanism against diffusion-based editing, with potential implications for safety and IP protection in visual content.
Abstract
Diffusion Models have emerged as powerful generative models for high-quality image synthesis, with many subsequent image editing techniques based on them. However, the ease of text-based image editing introduces significant risks, such as malicious editing for scams or intellectual property infringement. Previous works have attempted to safeguard images from diffusion-based editing by adding imperceptible perturbations. These methods are costly and specifically target prevalent Latent Diffusion Models (LDMs), while Pixel-domain Diffusion Models (PDMs) remain largely unexplored and robust against such attacks. Our work addresses this gap by proposing a novel attack framework, AtkPDM. AtkPDM is mainly composed of a feature representation attacking loss that exploits vulnerabilities in denoising UNets and a latent optimization strategy to enhance the naturalness of adversarial images. Extensive experiments demonstrate the effectiveness of our approach in attacking dominant PDM-based editing methods (e.g., SDEdit) while maintaining reasonable fidelity and robustness against common defense methods. Additionally, our framework is extensible to LDMs, achieving comparable performance to existing approaches.
