WMAdapter: Adding WaterMark Control to Latent Diffusion Models
Hai Ci, Yiren Song, Pei Yang, Jinheng Xie, Mike Zheng Shou
TL;DR
WMAdapter presents a plug-and-play watermarking solution for latent diffusion models by introducing a lightweight contextual adapter that can imprint arbitrary watermark bits during generation without per-watermark finetuning. It leverages a two-stage training regime, including a novel hybrid finetuning that jointly tunes the adapter and a fixed VAE to suppress tiny artifacts while preserving sharpness. Empirical results show competitive bit accuracy and near-perfect tracing across large user pools, with superior image quality (PSNR/SSIM) and competitive robustness compared to post-hoc and diffusion-native baselines. The approach enables scalable, high-fidelity watermarking with potential zero-shot transfer across different VAEs and diffusion variants, albeit with some artifacts in certain finetuning settings that warrant further refinement.
Abstract
Watermarking is crucial for protecting the copyright of AI-generated images. We propose WMAdapter, a diffusion model watermark plugin that takes user-specified watermark information and allows for seamless watermark imprinting during the diffusion generation process. WMAdapter is efficient and robust, with a strong emphasis on high generation quality. To achieve this, we make two key designs: (1) We develop a contextual adapter structure that is lightweight and enables effective knowledge transfer from heavily pretrained post-hoc watermarking models. (2) We introduce an extra finetuning step and design a hybrid finetuning strategy to further improve image quality and eliminate tiny artifacts. Empirical results demonstrate that WMAdapter offers strong flexibility, exceptional image generation quality and competitive watermark robustness.
