Transfer Learning of Real Image Features with Soft Contrastive Loss for Fake Image Detection
Ziyou Liang, Weifeng Liu, Run Wang, Mengjie Wu, Boheng Li, Yuyang Zhang, Lina Wang, Xinyi Yang
TL;DR
This work addresses the challenge of detecting fake images forged by unknown models by shifting focus from model-specific artifacts to stable real-image characteristics called natural traces. It introduces Natural Trace Forensics (NTF), a two-stage framework that first learns homogeneous real-trace representations via self-supervised mapping and then applies soft-contrastive transfer learning to align real traces while separating fake images. The method leverages a diverse dataset comprising 6 GANs, 6 diffusion models, and multi-step forgeries to demonstrate strong generalization, achieving a mean average precision of $96.2\%$ and commercial-model accuracy above $78.4\%$, with robustness to common image transformations. This provides a practical, model-agnostic approach to fake image detection, highlighting the value of stable real-image traces over continually shifting artifact patterns.
Abstract
In the last few years, the artifact patterns in fake images synthesized by different generative models have been inconsistent, leading to the failure of previous research that relied on spotting subtle differences between real and fake. In our preliminary experiments, we find that the artifacts in fake images always change with the development of the generative model, while natural images exhibit stable statistical properties. In this paper, we employ natural traces shared only by real images as an additional target for a classifier. Specifically, we introduce a self-supervised feature mapping process for natural trace extraction and develop a transfer learning based on soft contrastive loss to bring them closer to real images and further away from fake ones. This motivates the detector to make decisions based on the proximity of images to the natural traces. To conduct a comprehensive experiment, we built a high-quality and diverse dataset that includes generative models comprising GANs and diffusion models, to evaluate the effectiveness in generalizing unknown forgery techniques and robustness in surviving different transformations. Experimental results show that our proposed method gives 96.2% mAP significantly outperforms the baselines. Extensive experiments conducted on popular commercial platforms reveal that our proposed method achieves an accuracy exceeding 78.4%, underscoring its practicality for real-world application deployment.
