Harnessing Frequency Spectrum Insights for Image Copyright Protection Against Diffusion Models
Zhenguang Liu, Chao Shuai, Shaojing Fan, Ziping Dong, Jinwu Hu, Zhongjie Ba, Kui Ren
TL;DR
The paper reveals that diffusion-generated images retain the spectral properties of their training data, a property leveraged to protect image copyrights via CoprGuard, a spectrum-based watermarking framework. By embedding watermarks in the DWT domain with a HiNet-based encoder and counteracting potential erosion with an Information Enhancement Module, CoprGuard enables black-box detection of unauthorized image usage during diffusion-model training or fine-tuning. Empirical results across DDIM, CF-Guidance, and Stable Diffusion on FFHQ, ImageNet, and Pokemon demonstrate 100% infringement-detection accuracy even when watermarked data comprise as little as 1% of the training set, with minimal impact on image quality and strong robustness to transformations. The work provides a practical, model-agnostic approach to safeguard IP in AI-driven image generation and opens directions for extending spectral-watermarking techniques and leakage analysis in diffusion-based systems.
Abstract
Diffusion models have achieved remarkable success in novel view synthesis, but their reliance on large, diverse, and often untraceable Web datasets has raised pressing concerns about image copyright protection. Current methods fall short in reliably identifying unauthorized image use, as they struggle to generalize across varied generation tasks and fail when the training dataset includes images from multiple sources with few identifiable (watermarked or poisoned) samples. In this paper, we present novel evidence that diffusion-generated images faithfully preserve the statistical properties of their training data, particularly reflected in their spectral features. Leveraging this insight, we introduce \emph{CoprGuard}, a robust frequency domain watermarking framework to safeguard against unauthorized image usage in diffusion model training and fine-tuning. CoprGuard demonstrates remarkable effectiveness against a wide range of models, from naive diffusion models to sophisticated text-to-image models, and is robust even when watermarked images comprise a mere 1\% of the training dataset. This robust and versatile approach empowers content owners to protect their intellectual property in the era of AI-driven image generation.
