WMCopier: Forging Invisible Image Watermarks on Arbitrary Images
Ziping Dong, Chao Shuai, Zhongjie Ba, Peng Cheng, Zhan Qin, Qinglong Wang, Kui Ren
TL;DR
WMCopier exposes a vulnerability in invisible watermarking for AI-generated content by introducing a diffusion-model-based no-box forgery attack. It learns the watermark distribution from watermarked data, uses shallow inversion to fuse watermark signals into clean images, and applies score-based refinement to enhance fidelity to the watermark manifold while preserving content. Empirical results show strong forgery performance across open-source schemes and a deployed Amazon system, with PSNR around 30 dB and high bit-accuracy/FPR. The work also proposes a multi-message defense to raise the bar for future watermark designs, highlighting the need for robust defenses in real-world Gen-AI deployments.
Abstract
Invisible Image Watermarking is crucial for ensuring content provenance and accountability in generative AI. While Gen-AI providers are increasingly integrating invisible watermarking systems, the robustness of these schemes against forgery attacks remains poorly characterized. This is critical, as forging traceable watermarks onto illicit content leads to false attribution, potentially harming the reputation and legal standing of Gen-AI service providers who are not responsible for the content. In this work, we propose WMCopier, an effective watermark forgery attack that operates without requiring any prior knowledge of or access to the target watermarking algorithm. Our approach first models the target watermark distribution using an unconditional diffusion model, and then seamlessly embeds the target watermark into a non-watermarked image via a shallow inversion process. We also incorporate an iterative optimization procedure that refines the reconstructed image to further trade off the fidelity and forgery efficiency. Experimental results demonstrate that WMCopier effectively deceives both open-source and closed-source watermark systems (e.g., Amazon's system), achieving a significantly higher success rate than existing methods. Additionally, we evaluate the robustness of forged samples and discuss the potential defenses against our attack.
