Transcending Forgery Specificity with Latent Space Augmentation for Generalizable Deepfake Detection
Zhiyuan Yan, Yuhao Luo, Siwei Lyu, Qingshan Liu, Baoyuan Wu
TL;DR
This work tackles the generalization gap in deepfake detection by addressing forgery-specific overfitting. It introduces LSDA, a latent-space data augmentation framework that enlarges the forgery space through within-domain transformations (centrifugal, affine, additive) and cross-domain Mixup, coupled with a teacher-student distillation architecture and an ArcFace-informed real-feature prior. The method yields a generalizable detector by distilling augmented forgery representations into a single student encoder with a binary classifier, optimized via Domain, Distillation, and Binary losses. Extensive experiments across FF++ and unseen datasets show improved cross-dataset performance and robustness, outperforming state-of-the-art methods and RGB-based augmentations, with insightful visualizations corroborating the broader, more transferable detection capabilities. Overall, latent-space augmentation offers a scalable, less artifact-dependent path to robust deepfake detection.
Abstract
Deepfake detection faces a critical generalization hurdle, with performance deteriorating when there is a mismatch between the distributions of training and testing data. A broadly received explanation is the tendency of these detectors to be overfitted to forgery-specific artifacts, rather than learning features that are widely applicable across various forgeries. To address this issue, we propose a simple yet effective detector called LSDA (\underline{L}atent \underline{S}pace \underline{D}ata \underline{A}ugmentation), which is based on a heuristic idea: representations with a wider variety of forgeries should be able to learn a more generalizable decision boundary, thereby mitigating the overfitting of method-specific features (see Fig.~\ref{fig:toy}). Following this idea, we propose to enlarge the forgery space by constructing and simulating variations within and across forgery features in the latent space. This approach encompasses the acquisition of enriched, domain-specific features and the facilitation of smoother transitions between different forgery types, effectively bridging domain gaps. Our approach culminates in refining a binary classifier that leverages the distilled knowledge from the enhanced features, striving for a generalizable deepfake detector. Comprehensive experiments show that our proposed method is surprisingly effective and transcends state-of-the-art detectors across several widely used benchmarks.
