Detection Limits and Statistical Separability of Tree Ring Watermarks in Rectified Flow-based Text-to-Image Generation Models
Ved Umrajkar, Aakash Kumar Singh
TL;DR
This work investigates Tree-Ring Watermarking in rectified-flow text-to-image models, focusing on FLUX versus SD 2.1. It analyzes watermark reconstruction via noise-latent inversion and evaluates separability between watermarked and non-watermarked images under various prompts and attacks. The study finds that FLUX's invertibility and watermark detection are highly sensitive to prompt guidance and attack conditions, while diffusion-based SD models maintain more robust separability. The results underscore the need for improved inversion techniques tailored to flow-based architectures and more robust watermarking strategies for real-world deployment.
Abstract
Tree-Ring Watermarking is a significant technique for authenticating AI-generated images. However, its effectiveness in rectified flow-based models remains unexplored, particularly given the inherent challenges of these models with noise latent inversion. Through extensive experimentation, we evaluated and compared the detection and separability of watermarks between SD 2.1 and FLUX.1-dev models. By analyzing various text guidance configurations and augmentation attacks, we demonstrate how inversion limitations affect both watermark recovery and the statistical separation between watermarked and unwatermarked images. Our findings provide valuable insights into the current limitations of Tree-Ring Watermarking in the current SOTA models and highlight the critical need for improved inversion methods to achieve reliable watermark detection and separability. The official implementation, dataset release and all experimental results are available at this \href{https://github.com/dsgiitr/flux-watermarking}{\textbf{link}}.
