C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection
Chuangchuang Tan, Renshuai Tao, Huan Liu, Guanghua Gu, Baoyuan Wu, Yao Zhao, Yunchao Wei
TL;DR
This work investigates why CLIP-powered detectors generalize to unseen deepfakes and introduces C2P-CLIP, which injects category concepts into the image encoder through caption enhancement using category prompts. By training with a contrastive objective and a classification loss while freezing the text encoder and using LoRA on the image encoder, the method achieves substantial generalization gains without adding test-time parameters. Empirical results on UniversalFakeDetect and GenImage across 20 generation models show state-of-the-art or near-state-of-the-art performance and robust cross-model transfer, supported by qualitative analyses of logit distributions. The approach provides both a practical, parameter-efficient improvement for universal deepfake detection and insight into how CLIP features drive detection through concept-level matching rather than explicit real/fake semantics.
Abstract
This work focuses on AIGC detection to develop universal detectors capable of identifying various types of forgery images. Recent studies have found large pre-trained models, such as CLIP, are effective for generalizable deepfake detection along with linear classifiers. However, two critical issues remain unresolved: 1) understanding why CLIP features are effective on deepfake detection through a linear classifier; and 2) exploring the detection potential of CLIP. In this study, we delve into the underlying mechanisms of CLIP's detection capabilities by decoding its detection features into text and performing word frequency analysis. Our finding indicates that CLIP detects deepfakes by recognizing similar concepts (Fig. \ref{fig:fig1} a). Building on this insight, we introduce Category Common Prompt CLIP, called C2P-CLIP, which integrates the category common prompt into the text encoder to inject category-related concepts into the image encoder, thereby enhancing detection performance (Fig. \ref{fig:fig1} b). Our method achieves a 12.41\% improvement in detection accuracy compared to the original CLIP, without introducing additional parameters during testing. Comprehensive experiments conducted on two widely-used datasets, encompassing 20 generation models, validate the efficacy of the proposed method, demonstrating state-of-the-art performance. The code is available at \url{https://github.com/chuangchuangtan/C2P-CLIP-DeepfakeDetection}
