WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models
Changhoon Kim, Kyle Min, Maitreya Patel, Sheng Cheng, Yezhou Yang
TL;DR
The paper tackles the problem of attributing responsibility for images produced by text-to-image diffusion models by introducing WOUAF, a distributor-oriented fingerprinting method that embeds per-user signals into the decoder via weight modulation. This approach achieves near-perfect attribution accuracy with negligible degradation to image quality and demonstrates robustness against diverse post-processing and deliberate removal attempts, outperforming baselines. It is computationally efficient, requiring only a single forward pass to generate a fingerprinted model, and scalable to billions of users. The work lays groundwork for accountable model distribution and practical safeguards against misuse, with code available for replication and extension to other modalities in future work.
Abstract
The rapid advancement of generative models, facilitating the creation of hyper-realistic images from textual descriptions, has concurrently escalated critical societal concerns such as misinformation. Although providing some mitigation, traditional fingerprinting mechanisms fall short in attributing responsibility for the malicious use of synthetic images. This paper introduces a novel approach to model fingerprinting that assigns responsibility for the generated images, thereby serving as a potential countermeasure to model misuse. Our method modifies generative models based on each user's unique digital fingerprint, imprinting a unique identifier onto the resultant content that can be traced back to the user. This approach, incorporating fine-tuning into Text-to-Image (T2I) tasks using the Stable Diffusion Model, demonstrates near-perfect attribution accuracy with a minimal impact on output quality. Through extensive evaluation, we show that our method outperforms baseline methods with an average improvement of 11\% in handling image post-processes. Our method presents a promising and novel avenue for accountable model distribution and responsible use. Our code is available in \url{https://github.com/kylemin/WOUAF}.
