Image Watermarking of Generative Diffusion Models
Yunzhuo Chen, Jordan Vice, Naveed Akhtar, Nur Al Hasan Haldar, Ajmal Mian
TL;DR
This paper tackles copyright protection for diffusion-generated content by embedding watermarks directly into the diffusion generation process. It introduces a Watermark Autoencoder and a Watermark Extractor to enable end-to-end embedding and recovery of imperceptible watermark features, with a differentiable framework that blends watermark loss with standard diffusion loss. A Generative Watermarked Image (GWI) dataset is presented to benchmark watermark presence, reconstruction, and model-source classification across DDIM and DDPM architectures. Experimental results show that image quality remains high while watermarks are robust to common attacks, enabling reliable ownership verification and model attribution for diffusion-based content.
Abstract
Embedding watermarks into the output of generative models is essential for establishing copyright and verifiable ownership over the generated content. Emerging diffusion model watermarking methods either embed watermarks in the frequency domain or offer limited versatility of the watermark patterns in the image space, which allows simplistic detection and removal of the watermarks from the generated content. To address this issue, we propose a watermarking technique that embeds watermark features into the diffusion model itself. Our technique enables training of a paired watermark extractor for a generative model that is learned through an end-to-end process. The extractor forces the generator, during training, to effectively embed versatile, imperceptible watermarks in the generated content while simultaneously ensuring their precise recovery. We demonstrate highly accurate watermark embedding/detection and show that it is also possible to distinguish between different watermarks embedded with our method to differentiate between generative models.
