Robust Identity Perceptual Watermark Against Deepfake Face Swapping
Tianyi Wang, Mengxiao Huang, Harry Cheng, Bin Ma, Yinglong Wang
TL;DR
This work tackles privacy risks posed by Deepfake face swapping by introducing a robust identity perceptual watermarking framework. It encodes semantically meaningful watermarks tied to image identity, protected by a chaotic encryption scheme, and jointly trains an encoder-decoder to enable detection and source tracing without ground-truth watermarks. Across GAN- and diffusion-based swaps and in cross-dataset settings, the method achieves state-of-the-art watermark recovery, high Deepfake-detection AUC, and strong source-tracing accuracies, validating practical applicability. A noted limitation is its current focus on identity-preserving swaps, with future work aimed at extending protection to face re-enactment manipulations.
Abstract
Notwithstanding offering convenience and entertainment to society, Deepfake face swapping has caused critical privacy issues with the rapid development of deep generative models. Due to imperceptible artifacts in high-quality synthetic images, passive detection models against face swapping in recent years usually suffer performance damping regarding the generalizability issue in cross-domain scenarios. Therefore, several studies have been attempted to proactively protect the original images against malicious manipulations by inserting invisible signals in advance. However, existing proactive defense approaches demonstrate unsatisfactory results with respect to visual quality, detection accuracy, and source tracing ability. In this study, to fulfill the research gap, we propose a robust identity perceptual watermarking framework that concurrently performs detection and source tracing against Deepfake face swapping proactively. We innovatively assign identity semantics regarding the image contents to the watermarks and devise an unpredictable and nonreversible chaotic encryption system to ensure watermark confidentiality. The watermarks are robustly encoded and recovered by jointly training an encoder-decoder framework along with adversarial image manipulations. For a suspect image, falsification is accomplished by justifying the consistency between the content-matched identity perceptual watermark and the recovered robust watermark, without requiring the ground-truth. Moreover, source tracing can be accomplished based on the identity semantics that the recovered watermark carries. Extensive experiments demonstrate state-of-the-art detection and source tracing performance against Deepfake face swapping with promising watermark robustness for both cross-dataset and cross-manipulation settings.
