A Graph-based Verification Framework for Fact-Checking
Yani Huang, Richong Zhang, Zhijie Nie, Junfan Chen, Xuefeng Zhang
TL;DR
GraphFC introduces a graph-based fact-checking framework that reformulates claims as a bipartite triplet graph and grounds evidence via corresponding graphs. Its three components—graph construction, graph-guided planning, and graph-guided checking—enable fine-grained, context-preserving verification and improved handling of unknown entities through planning-driven grounding and completion. Across HOVER, FEVEROUS, and SciFact, GraphFC yields state-of-the-art macro-F1, with particular strength on multi-hop reasoning and robustness in Open Book settings. The work highlights the importance of structured evidence and planning in complex verification tasks, while acknowledging limitations in simple-claim gains and sensitivity to the graph-construction model, suggesting avenues for lighter-weight components and efficiency optimizations.
Abstract
Fact-checking plays a crucial role in combating misinformation. Existing methods using large language models (LLMs) for claim decomposition face two key limitations: (1) insufficient decomposition, introducing unnecessary complexity to the verification process, and (2) ambiguity of mentions, leading to incorrect verification results. To address these challenges, we suggest introducing a claim graph consisting of triplets to address the insufficient decomposition problem and reduce mention ambiguity through graph structure. Based on this core idea, we propose a graph-based framework, GraphFC, for fact-checking. The framework features three key components: graph construction, which builds both claim and evidence graphs; graph-guided planning, which prioritizes the triplet verification order; and graph-guided checking, which verifies the triples one by one between claim and evidence graphs. Extensive experiments show that GraphFC enables fine-grained decomposition while resolving referential ambiguities through relational constraints, achieving state-of-the-art performance across three datasets.
