LLM-based Few-Shot Early Rumor Detection with Imitation Agent
Fengzhu Zeng, Qian Shao, Ling Cheng, Wei Gao, Shih-Fen Cheng, Jing Ma, Cheng Niu
TL;DR
The paper tackles early rumor detection under data scarcity by proposing a decoupled framework that uses a lightweight autonomous agent to determine when to invoke a powerful LLM detector. EARD is formulated as an MDP and learned via imitation learning using three expert trajectories (Conservative, Early-Action, Misleading) to optimize timing while avoiding false starts. Across four real-world datasets and multiple LLMs, the approach improves macro-F1 and reduces Early Rate compared to existing EARD methods and LLM-only baselines, demonstrating robust generalization to unseen events and resilience to limited supervision. The framework achieves cost-efficient, timely, and accurate rumor detection by minimizing LLM inferences to a single, strategically-timed prediction.
Abstract
Early Rumor Detection (EARD) aims to identify the earliest point at which a claim can be accurately classified based on a sequence of social media posts. This is especially challenging in data-scarce settings. While Large Language Models (LLMs) perform well in few-shot NLP tasks, they are not well-suited for time-series data and are computationally expensive for both training and inference. In this work, we propose a novel EARD framework that combines an autonomous agent and an LLM-based detection model, where the agent acts as a reliable decision-maker for \textit{early time point determination}, while the LLM serves as a powerful \textit{rumor detector}. This approach offers the first solution for few-shot EARD, necessitating only the training of a lightweight agent and allowing the LLM to remain training-free. Extensive experiments on four real-world datasets show our approach boosts performance across LLMs and surpasses existing EARD methods in accuracy and earliness.
