LLM-GAN: Construct Generative Adversarial Network Through Large Language Models For Explainable Fake News Detection
Yifeng Wang, Zhouhong Gu, Siwei Zhang, Suhang Zheng, Tao Wang, Tianyu Li, Hongwei Feng, Yanghua Xiao
TL;DR
The paper tackles explainable fake news detection by addressing two core challenges: LLMs can be misled by plausible fake news, and explanations can be unreliable. It introduces LLM-GAN, a prompting-based framework in which an LLM acts as both Generator and Detector, trained through inter-adversary prompting and refined via self-reflection prompting to produce accurate predictions with human-friendly explanations. The approach leverages real-news augmentation, adversarial data generation, and autonomous self-improvement, achieving state-of-the-art performance on Weibo21 and GossipCop while delivering higher-quality explanations as assessed by GPT-4o. The authors further demonstrate practical impact by integrating LLM-GAN into a cloud-native platform, enabling scalable, explainable fake news detection in real-world settings.
Abstract
Explainable fake news detection predicts the authenticity of news items with annotated explanations. Today, Large Language Models (LLMs) are known for their powerful natural language understanding and explanation generation abilities. However, presenting LLMs for explainable fake news detection remains two main challenges. Firstly, fake news appears reasonable and could easily mislead LLMs, leaving them unable to understand the complex news-faking process. Secondly, utilizing LLMs for this task would generate both correct and incorrect explanations, which necessitates abundant labor in the loop. In this paper, we propose LLM-GAN, a novel framework that utilizes prompting mechanisms to enable an LLM to become Generator and Detector and for realistic fake news generation and detection. Our results demonstrate LLM-GAN's effectiveness in both prediction performance and explanation quality. We further showcase the integration of LLM-GAN to a cloud-native AI platform to provide better fake news detection service in the cloud.
