Have LLMs Reopened the Pandora's Box of AI-Generated Fake News?
Xinyu Wang, Wenbo Zhang, Sai Koneru, Hangzhi Guo, Bonam Mingole, S. Shyam Sundar, Sarah Rajtmajer, Amulya Yadav
TL;DR
The paper investigates whether LLMs reopen the Pandora's box of AI-generated fake news by running a university-scale two-phase competition that analyzes both generation and detection, including human versus LLM performance and the role of visuals. It finds that while LLMs are adept at identifying real news, both humans and LLMs struggle with AI-generated fake content, especially in collaborative human–AI generation scenarios. The work also uncovers topic- and prompt-dependent variability in detection, with visuals offering only modest gains and processing mode influencing outcomes. These findings underscore the need for robust, multi-modal, topic-aware detection strategies and careful governance of AI-assisted content creation.
Abstract
With the rise of AI-generated content spewed at scale from large language models (LLMs), genuine concerns about the spread of fake news have intensified. The perceived ability of LLMs to produce convincing fake news at scale poses new challenges for both human and automated fake news detection systems. To address this gap, this paper presents the findings from a university-level competition that aimed to explore how LLMs can be used by humans to create fake news, and to assess the ability of human annotators and AI models to detect it. A total of 110 participants used LLMs to create 252 unique fake news stories, and 84 annotators participated in the detection tasks. Our findings indicate that LLMs are ~68% more effective at detecting real news than humans. However, for fake news detection, the performance of LLMs and humans remains comparable (~60% accuracy). Additionally, we examine the impact of visual elements (e.g., pictures) in news on the accuracy of detecting fake news stories. Finally, we also examine various strategies used by fake news creators to enhance the credibility of their AI-generated content. This work highlights the increasing complexity of detecting AI-generated fake news, particularly in collaborative human-AI settings.
