Individualized Deepfake Detection Exploiting Traces Due to Double Neural-Network Operations
Mushfiqur Rahman, Runze Liu, Chau-Wai Wong, Huaiyu Dai
TL;DR
This work targets deepfake detection tailored to specific public figures by conditioning on identity and exploiting the near-idempotence of neural-network deepfake generators. It introduces a reconstruction operator $R$, an identity-aware feature extractor, an identity decoder, and a Siamese classifier trained with contrastive learning to distinguish authentic from deepfake images via changes in feature space after sequential processing. Empirical results on a curated dataset of 45 identities show a mean AUC of $0.940$ with reduced variance, outperforming strong baselines and demonstrating the value of identity conditioning and the idempotence principle. The approach offers a practical, journalist-friendly tool for verifying claims about public figures and suggests avenues for generalization across deepfake operators and broader datasets.
Abstract
In today's digital landscape, journalists urgently require tools to verify the authenticity of facial images and videos depicting specific public figures before incorporating them into news stories. Existing deepfake detectors are not optimized for this detection task when an image is associated with a specific and identifiable individual. This study focuses on the deepfake detection of facial images of individual public figures. We propose to condition the proposed detector on the identity of an identified individual, given the advantages revealed by our theory-driven simulations. While most detectors in the literature rely on perceptible or imperceptible artifacts present in deepfake facial images, we demonstrate that the detection performance can be improved by exploiting the idempotency property of neural networks. In our approach, the training process involves double neural-network operations where we pass an authentic image through a deepfake simulating network twice. Experimental results show that the proposed method improves the area under the curve (AUC) from 0.92 to 0.94 and reduces its standard deviation by 17%. To address the need for evaluating detection performance for individual public figures, we curated and publicly released a dataset of ~32k images featuring 45 public figures, as existing deepfake datasets do not meet this criterion.
