Is LLMs Hallucination Usable? LLM-based Negative Reasoning for Fake News Detection
Chaowei Zhang, Zongling Feng, Zewei Zhang, Jipeng Qiang, Guandong Xu, Yun Li
TL;DR
The study investigates whether LLM knowledge hallucination can be harnessed to generate negative reasoning to aid fake news detection. It introduces SR^3 to supervise LLM outputs toward high-quality positive and negative reasoning, and builds NRFE, a dual-encoder model that learns semantic consistency between news and reasoning, along with NRFE-D, a distillation-based student that operates on news content alone. Across three datasets, NRFE-D achieves state-of-the-art performance, surpassing prompting-based LLM approaches, fine-tuned SLMs, and other methods. The work demonstrates a novel use of hallucinations as adversarial signals to improve robustness and accuracy in fake news detection, with potential for further exploration using additional LLMs and multi-agent setups.
Abstract
The questionable responses caused by knowledge hallucination may lead to LLMs' unstable ability in decision-making. However, it has never been investigated whether the LLMs' hallucination is possibly usable to generate negative reasoning for facilitating the detection of fake news. This study proposes a novel supervised self-reinforced reasoning rectification approach - SR$^3$ that yields both common reasonable reasoning and wrong understandings (negative reasoning) for news via LLMs reflection for semantic consistency learning. Upon that, we construct a negative reasoning-based news learning model called - \emph{NRFE}, which leverages positive or negative news-reasoning pairs for learning the semantic consistency between them. To avoid the impact of label-implicated reasoning, we deploy a student model - \emph{NRFE-D} that only takes news content as input to inspect the performance of our method by distilling the knowledge from \emph{NRFE}. The experimental results verified on three popular fake news datasets demonstrate the superiority of our method compared with three kinds of baselines including prompting on LLMs, fine-tuning on pre-trained SLMs, and other representative fake news detection methods.
