TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News Detection
Hui Liu, Wenya Wang, Haoru Li, Haoliang Li
TL;DR
Teller addresses the trustworthiness gap in fake news detection by decoupling the problem into a cognition system that encodes human expertise into interpretable predicates and a decision system that learns generalizable rules through a neural-symbolic DNF Layer. The cognition system uses question templates and logic atoms answered by LLMs and tools, while the decision system aggregates these atoms into domain-invariant rules, enabling robust cross-domain performance and human controllability. Across four datasets and multiple LLMs, Teller demonstrates strong accuracy and macro-F1, with explicit explainability via extractable rules and pruning, and controllability through rule adjustments and cognition-system interventions. This framework has practical impact for trustworthy deployment of fake news detectors by combining scalable AI with transparent reasoning and human oversight, reducing reliance on opaque models. Future work may enhance the semantic grounding of predicates and broaden the decision model to further improve interpretability, controllability, and lifecyle trustworthiness.
Abstract
The proliferation of fake news has emerged as a severe societal problem, raising significant interest from industry and academia. While existing deep-learning based methods have made progress in detecting fake news accurately, their reliability may be compromised caused by the non-transparent reasoning processes, poor generalization abilities and inherent risks of integration with large language models (LLMs). To address this challenge, we propose {\methodname}, a novel framework for trustworthy fake news detection that prioritizes explainability, generalizability and controllability of models. This is achieved via a dual-system framework that integrates cognition and decision systems, adhering to the principles above. The cognition system harnesses human expertise to generate logical predicates, which guide LLMs in generating human-readable logic atoms. Meanwhile, the decision system deduces generalizable logic rules to aggregate these atoms, enabling the identification of the truthfulness of the input news across diverse domains and enhancing transparency in the decision-making process. Finally, we present comprehensive evaluation results on four datasets, demonstrating the feasibility and trustworthiness of our proposed framework. Our implementation is available at \url{https://github.com/less-and-less-bugs/Trust_TELLER}.
