Can ChatGPT Detect DeepFakes? A Study of Using Multimodal Large Language Models for Media Forensics
Shan Jia, Reilin Lyu, Kangran Zhao, Yize Chen, Zhiyuan Yan, Yan Ju, Chuanbo Hu, Xin Li, Baoyuan Wu, Siwei Lyu
TL;DR
Problem: can multimodal LLMs detect DeepFakes without programming? Approach: evaluate GPT4V Vision and Gemini 1.0 Pro Vision on the DF3 dataset using targeted prompts that yield Yes/No, probability, and artifact-based explanations, with results averaged over multiple queries to compute $AUC$ and rejection rates. Findings: GPT4V achieves approximately 0.78–0.80 $AUC$ on raw and post-processed AI-generated faces, outperforming Gemini in this task, while real-face detection remains less reliable and largely driven by semantic cues rather than signal-level artifacts. Significance: demonstrates the potential of vision-enabled LLMs as intuitive forensics tools, while highlighting limitations (semantic reliance, post-processing robustness, API restrictions) and the need to combine prompting improvements with traditional signal-based detection for robust DeepFake detection.
Abstract
DeepFakes, which refer to AI-generated media content, have become an increasing concern due to their use as a means for disinformation. Detecting DeepFakes is currently solved with programmed machine learning algorithms. In this work, we investigate the capabilities of multimodal large language models (LLMs) in DeepFake detection. We conducted qualitative and quantitative experiments to demonstrate multimodal LLMs and show that they can expose AI-generated images through careful experimental design and prompt engineering. This is interesting, considering that LLMs are not inherently tailored for media forensic tasks, and the process does not require programming. We discuss the limitations of multimodal LLMs for these tasks and suggest possible improvements.
