FractalForensics: Proactive Deepfake Detection and Localization via Fractal Watermarks
Tianyi Wang, Harry Cheng, Ming-Hui Liu, Mohan Kankanhalli
TL;DR
FractalForensics addresses the challenge of proactive Deepfake detection with localization by introducing fractal watermarks generated from a parameter-driven pipeline and encrypted via chaotic maps. The method embeds a 4-bit, 4-channel watermark w^c using an entry-to-patch strategy, enabling spatially aware localization while preserving image quality through an end-to-end trainable framework. Experiments on CelebA-HQ and LFW show strong visual fidelity, high patch- and bit-wise watermark recovery, and superior AUC-based Deepfake detection compared with state-of-the-art semi-fragile and passive detectors, with explicit localization of manipulated regions. The work demonstrates the feasibility and practicality of fractal-based, semi-fragile watermarks for proactive defense and forensic explainability in real-world Deepfake scenarios, and suggests avenues for automatic fractal self-checks and extended tracing capabilities.
Abstract
Proactive Deepfake detection via robust watermarks has seen interest ever since passive Deepfake detectors encountered challenges in identifying high-quality synthetic images. However, while demonstrating reasonable detection performance, they lack localization functionality and explainability in detection results. Additionally, the unstable robustness of watermarks can significantly affect the detection performance. In this study, we propose novel fractal watermarks for proactive Deepfake detection and localization, namely FractalForensics. Benefiting from the characteristics of fractals, we devise a parameter-driven watermark generation pipeline that derives fractal-based watermarks and performs one-way encryption of the selected parameters. Subsequently, we propose a semi-fragile watermarking framework for watermark embedding and recovery, trained to be robust against benign image processing operations and fragile when facing Deepfake manipulations in a black-box setting. Moreover, we introduce an entry-to-patch strategy that implicitly embeds the watermark matrix entries into image patches at corresponding positions, achieving localization of Deepfake manipulations. Extensive experiments demonstrate satisfactory robustness and fragility of our approach against common image processing operations and Deepfake manipulations, outperforming state-of-the-art semi-fragile watermarking algorithms and passive detectors for Deepfake detection. Furthermore, by highlighting the areas manipulated, our method provides explainability for the proactive Deepfake detection results.
