A Sanity Check for Multi-In-Domain Face Forgery Detection in the Real World
Jikang Cheng, Renye Yan, Zhiyuan Yan, Yaozhong Gan, Xueyi Zhang, Zhongyuan Wang, Wei Peng, Ling Liang
TL;DR
The paper introduces Multi-In-Domain Face Forgery Detection (MID-FFD), a realistic setting where detectors must judge real vs. fake on frame-by-frame inputs from diverse, unseen domains. It proposes DevDet, a two-stage, model-agnostic framework that first amplifies forgery cues via a Face Forgery Developer (FFDev) and then adapts detectors with Dose-Adaptive Fine-Tuning (DAFT) using a DoseDict to maintain generalization. Empirical results across wide benchmarks show improved real/fake discrimination in domain-unspecified scenarios while preserving cross-domain performance, addressing key limitations of prior generalizable and incremental approaches. The work provides a new paradigm for practical deepfake detection and offers actionable components (FFDev, DAFT, DoseDict) that can be plugged into existing detectors. Overall, DevDet demonstrates stronger MID-FFD performance and robustness, suggesting a viable path toward reliable real-world deployment.
Abstract
Existing methods for deepfake detection aim to develop generalizable detectors. Although "generalizable" is the ultimate target once and for all, with limited training forgeries and domains, it appears idealistic to expect generalization that covers entirely unseen variations, especially given the diversity of real-world deepfakes. Therefore, introducing large-scale multi-domain data for training can be feasible and important for real-world applications. However, within such a multi-domain scenario, the differences between multiple domains, rather than the subtle real/fake distinctions, dominate the feature space. As a result, despite detectors being able to relatively separate real and fake within each domain (i.e., high AUC), they struggle with single-image real/fake judgments in domain-unspecified conditions (i.e., low ACC). In this paper, we first define a new research paradigm named Multi-In-Domain Face Forgery Detection (MID-FFD), which includes sufficient volumes of real-fake domains for training. Then, the detector should provide definitive real-fake judgments to the domain-unspecified inputs, which simulate the frame-by-frame independent detection scenario in the real world. Meanwhile, to address the domain-dominant issue, we propose a model-agnostic framework termed DevDet (Developer for Detector) to amplify real/fake differences and make them dominant in the feature space. DevDet consists of a Face Forgery Developer (FFDev) and a Dose-Adaptive detector Fine-Tuning strategy (DAFT). Experiments demonstrate our superiority in predicting real-fake under the MID-FFD scenario while maintaining original generalization ability to unseen data.
