Disrupting Diffusion-based Inpainters with Semantic Digression
Geonho Son, Juhun Lee, Simon S. Woo
TL;DR
This work addresses the rising threat of malicious edits using diffusion-based inpainters by introducing DDD, a disruption framework that performs semantic digression to immunize context images. Rather than brute-forcing through the full diffusion chain, DDD targets vulnerable early timesteps and optimizes a timestep-agnostic hidden-space loss around a multimodal semantic centroid, refined via token-projected text optimization. The key contributions are: (1) identifying a vulnerable timestep range, (2) formulating a timestep-free loss in hidden space, (3) defining a semantic centroid via Monte Carlo sampling, and (4) enabling stable, discrete token projection for text conditioning. Empirically, DDD outperforms the prior state-of-the-art Photoguard across quantitative disruption metrics and qualitative evaluations, while delivering roughly threefold reductions in compute time and memory usage, thereby democratizing image protection against unconsented edits with practical hardware requirements.
Abstract
The fabrication of visual misinformation on the web and social media has increased exponentially with the advent of foundational text-to-image diffusion models. Namely, Stable Diffusion inpainters allow the synthesis of maliciously inpainted images of personal and private figures, and copyrighted contents, also known as deepfakes. To combat such generations, a disruption framework, namely Photoguard, has been proposed, where it adds adversarial noise to the context image to disrupt their inpainting synthesis. While their framework suggested a diffusion-friendly approach, the disruption is not sufficiently strong and it requires a significant amount of GPU and time to immunize the context image. In our work, we re-examine both the minimal and favorable conditions for a successful inpainting disruption, proposing DDD, a "Digression guided Diffusion Disruption" framework. First, we identify the most adversarially vulnerable diffusion timestep range with respect to the hidden space. Within this scope of noised manifold, we pose the problem as a semantic digression optimization. We maximize the distance between the inpainting instance's hidden states and a semantic-aware hidden state centroid, calibrated both by Monte Carlo sampling of hidden states and a discretely projected optimization in the token space. Effectively, our approach achieves stronger disruption and a higher success rate than Photoguard while lowering the GPU memory requirement, and speeding the optimization up to three times faster.
