Spread them Apart: Towards Robust Watermarking of Generated Content
Mikhail Pautov, Danil Ivanov, Andrey V. Galichin, Oleg Rogov, Ivan Oseledets
TL;DR
The paper addresses the challenge of identifying the provenance and ownership of content produced by powerful diffusion-based generators. It introduces Spread them Apart, a watermarking scheme embedded during inference that assigns each user a binary watermark and a secret, enabling simultaneous detection and attribution with a provable robustness guarantee against bounded additive perturbations. The method optimizes a pixel-pair constraint loss in the latent space to embed watermarks while preserving image quality, and provides theoretical bounds showing that erasing bits requires perturbations above certain magnitudes. Empirically, it achieves competitive image quality and strong watermark robustness against common post-processing attacks, and proposes an extension to defend against geometric transformations by embedding multiple watermarks in pixel and invariant domains. The work offers a practical, non-intrusive approach to protecting generated content and user accountability, with clear avenues for handling more transformations through invariants in future work.
Abstract
Generative models that can produce realistic images have improved significantly in recent years. The quality of the generated content has increased drastically, so sometimes it is very difficult to distinguish between the real images and the generated ones. Such an improvement comes at a price of ethical concerns about the usage of the generative models: the users of generative models can improperly claim ownership of the generated content protected by a license. In this paper, we propose an approach to embed watermarks into the generated content to allow future detection of the generated content and identification of the user who generated it. The watermark is embedded during the inference of the model, so the proposed approach does not require the retraining of the latter. We prove that watermarks embedded are guaranteed to be robust against additive perturbations of a bounded magnitude. We apply our method to watermark diffusion models and show that it matches state-of-the-art watermarking schemes in terms of robustness to different types of synthetic watermark removal attacks.
