Breaking Event Rumor Detection via Stance-Separated Multi-Agent Debate
Mingqing Zhang, Haisong Gong, Qiang Liu, Shu Wu, Liang Wang
TL;DR
This work tackles rumor detection during breaking events, where data scarcity and controversy hinder traditional methods. It introduces S2MAD, a stance-separated multi-agent debate framework that uses LLMs to first segregate comments into supportive and opposing sets, then generates initial opinions with prompts tailored to whether the claim is subjective or objective, followed by a two-agent debate and a judge when needed to decide veracity. The framework is validated in zero-shot experiments on Twitter-COVID19 and Weibo-COVID19, showing consistent gains over strong baselines and highlighting the importance of stance separation, claim categorization, and debate. The approach demonstrates practical value for rapid, diverse, and robust rumor verification in breaking news contexts.
Abstract
The rapid spread of rumors on social media platforms during breaking events severely hinders the dissemination of the truth. Previous studies reveal that the lack of annotated resources hinders the direct detection of unforeseen breaking events not covered in yesterday's news. Leveraging large language models (LLMs) for rumor detection holds significant promise. However, it is challenging for LLMs to provide comprehensive responses to complex or controversial issues due to limited diversity. In this work, we propose the Stance Separated Multi-Agent Debate (S2MAD) to address this issue. Specifically, we firstly introduce Stance Separation, categorizing comments as either supporting or opposing the original claim. Subsequently, claims are classified as subjective or objective, enabling agents to generate reasonable initial viewpoints with different prompt strategies for each type of claim. Debaters then follow specific instructions through multiple rounds of debate to reach a consensus. If a consensus is not reached, a judge agent evaluates the opinions and delivers a final verdict on the claim's veracity. Extensive experiments conducted on two real-world datasets demonstrate that our proposed model outperforms state-of-the-art methods in terms of performance and effectively improves the performance of LLMs in breaking event rumor detection.
