Explore the Potential of LLMs in Misinformation Detection: An Empirical Study
Mengyang Chen, Lingwei Wei, Han Cao, Wei Zhou, Songlin Hu
TL;DR
This paper systematically investigates the potential of large language models for misinformation detection, separating evaluation into LLM-based detectors (direct prompting) and LLM-enhanced detectors (embedding and data augmentation). It demonstrates that LLMs can match or exceed small models on text-based misinformation when prompted effectively, but struggle with propagation-aware detection due to graph-structured data. The study shows that LLM-enhanced detectors, using LLM embeddings or generated data, often improve performance over traditional baselines, with notable gains from Chinese LLMs on Chinese datasets. Together, these findings highlight the practical value and limitations of deploying LLMs for misinformation detection and point to promising directions for integrating LLMs with existing detectors.
Abstract
Large Language Models (LLMs) have garnered significant attention for their powerful ability in natural language understanding and reasoning. In this paper, we present a comprehensive empirical study to explore the performance of LLMs on misinformation detection tasks. This study stands as the pioneering investigation into the understanding capabilities of multiple LLMs regarding both content and propagation across social media platforms. Our empirical studies on eight misinformation detection datasets show that LLM-based detectors can achieve comparable performance in text-based misinformation detection but exhibit notably constrained capabilities in comprehending propagation structure compared to existing models in propagation-based misinformation detection. Our experiments further demonstrate that LLMs exhibit great potential to enhance existing misinformation detection models. These findings highlight the potential ability of LLMs to detect misinformation.
