DiffMark: Diffusion-based Robust Watermark Against Deepfakes

Chen Sun; Haiyang Sun; Zhiqing Guo; Yunfeng Diao; Liejun Wang; Dan Ma; Gaobo Yang; Keqin Li

DiffMark: Diffusion-based Robust Watermark Against Deepfakes

Chen Sun, Haiyang Sun, Zhiqing Guo, Yunfeng Diao, Liejun Wang, Dan Ma, Gaobo Yang, Keqin Li

TL;DR

DiffMark tackles the vulnerability of facial image watermarks to Deepfake manipulation by introducing a diffusion-based proactive watermarking framework. It conditions the diffusion process on both the facial image and a binary watermark, with a scaled facial cue $x_c = \sqrt{1-\overline{\alpha}_t} x_0$ to preserve guidance early in sampling and gradually attenuate it. Key contributions include a cross information fusion module with positional-bit embedding indices, a training-time frozen autoencoder to simulate Deepfakes, and a Deepfake-resistant guidance mechanism during DDIM sampling to improve robustness. Experiments on CelebA-HQ and LFW show DiffMark achieves strong invisibility (high PSNR/SSIM, low LPIPS) and robust watermark recovery (low BER) under diverse benign distortions and Deepfake attacks, supporting practical use for authenticity verification and source tracing.

Abstract

Deepfakes pose significant security and privacy threats through malicious facial manipulations. While robust watermarking can aid in authenticity verification and source tracking, existing methods often lack the sufficient robustness against Deepfake manipulations. Diffusion models have demonstrated remarkable performance in image generation, enabling the seamless fusion of watermark with image during generation. In this study, we propose a novel robust watermarking framework based on diffusion model, called DiffMark. By modifying the training and sampling scheme, we take the facial image and watermark as conditions to guide the diffusion model to progressively denoise and generate corresponding watermarked image. In the construction of facial condition, we weight the facial image by a timestep-dependent factor that gradually reduces the guidance intensity with the decrease of noise, thus better adapting to the sampling process of diffusion model. To achieve the fusion of watermark condition, we introduce a cross information fusion (CIF) module that leverages a learnable embedding table to adaptively extract watermark features and integrates them with image features via cross-attention. To enhance the robustness of the watermark against Deepfake manipulations, we integrate a frozen autoencoder during training phase to simulate Deepfake manipulations. Additionally, we introduce Deepfake-resistant guidance that employs specific Deepfake model to adversarially guide the diffusion sampling process to generate more robust watermarked images. Experimental results demonstrate the effectiveness of the proposed DiffMark on typical Deepfakes. Our code will be available at https://github.com/vpsg-research/DiffMark.

DiffMark: Diffusion-based Robust Watermark Against Deepfakes

TL;DR

Abstract

DiffMark: Diffusion-based Robust Watermark Against Deepfakes

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)