Model Discrepancy Learning: Synthetic Faces Detection Based on Multi-Reconstruction
Qingchao Jiang, Zhishuo Xu, Zhiying Zhu, Ning Chen, Haoyue Wang, Zhongjie Ba
TL;DR
The paper tackles the challenge of detecting synthetic faces generated by GANs and diffusion models by analyzing reconstruction discrepancies across generation techniques. It introduces a Multi-Reconstruction-based Detector that inverts and reconstructs inputs with both GAN and diffusion models, then classifies using cascaded original and reconstructed images, achieving strong generalization to unseen generators. To support this line of work, the authors present the Asian Synthetic Face Dataset (ASFD), addressing underrepresentation of Asian populations in existing datasets. Empirical results demonstrate superior accuracy and robustness compared with existing detectors, including across cross-dataset scenarios and perturbations, highlighting the practical value of model-discrepancy-aware detection for real-world security tasks.
Abstract
Advances in image generation enable hyper-realistic synthetic faces but also pose risks, thus making synthetic face detection crucial. Previous research focuses on the general differences between generated images and real images, often overlooking the discrepancies among various generative techniques. In this paper, we explore the intrinsic relationship between synthetic images and their corresponding generation technologies. We find that specific images exhibit significant reconstruction discrepancies across different generative methods and that matching generation techniques provide more accurate reconstructions. Based on this insight, we propose a Multi-Reconstruction-based detector. By reversing and reconstructing images using multiple generative models, we analyze the reconstruction differences among real, GAN-generated, and DM-generated images to facilitate effective differentiation. Additionally, we introduce the Asian Synthetic Face Dataset (ASFD), containing synthetic Asian faces generated with various GANs and DMs. This dataset complements existing synthetic face datasets. Experimental results demonstrate that our detector achieves exceptional performance, with strong generalization and robustness.
