Adversarial Perturbations Cannot Reliably Protect Artists From Generative AI
Robert Hönig, Javier Rando, Nicholas Carlini, Florian Tramèr
TL;DR
This paper challenges the effectiveness of adversarial perturbation-based protections (e.g., Glaze, Mist, Anti-DreamBooth) against generative AI style mimicry. It introduces a unified, rigorous evaluation protocol and a set of simple, off-the-shelf robust mimicry methods, including Gaussian noising, DiffPure, Noisy Upscaling, and IMPRESS++, to test protections under realistic attacker conditions. Through a comprehensive user study across ten artists, it shows that all existing protections can be bypassed, with Noisy Upscaling often achieving median success rates around or above 40%, effectively making protected artworks indistinguishable from unprotected baselines. The authors argue that adversarial perturbations cannot reliably shield artists and advocate adaptive evaluation and non-technological protective measures to mitigate misuse of generative AI in the art domain.
Abstract
Artists are increasingly concerned about advancements in image generation models that can closely replicate their unique artistic styles. In response, several protection tools against style mimicry have been developed that incorporate small adversarial perturbations into artworks published online. In this work, we evaluate the effectiveness of popular protections -- with millions of downloads -- and show they only provide a false sense of security. We find that low-effort and "off-the-shelf" techniques, such as image upscaling, are sufficient to create robust mimicry methods that significantly degrade existing protections. Through a user study, we demonstrate that all existing protections can be easily bypassed, leaving artists vulnerable to style mimicry. We caution that tools based on adversarial perturbations cannot reliably protect artists from the misuse of generative AI, and urge the development of alternative non-technological solutions.
