Navigating the Noisy Crowd: Finding Key Information for Claim Verification
Haisong Gong, Huanhuan Ma, Qiang Liu, Shu Wu, Liang Wang
TL;DR
The paper tackles claim verification under noisy evidence by introducing EACon, a framework that first abstracts evidence through keyword-guided extraction and summarization, then deconstructs claims into subclaims for independent verification. It combines Evidence Abstraction, Claim Deconstruction, and Subclaim Verification to reduce noise and improve LLM-based verification, using fuzzy keyword matching and selective summarization to guide evidence processing. Empirical results on HOVER and FEVEROUS-S with Vicuna-13B and Mixtral-8x7B show consistent Macro-F1 gains and robust ablation evidence for the two core components, with larger gains on more complex, multi-hop tasks. The work demonstrates a practical strategy to leverage LLMs for reliable claim verification in noisy real-world information environments, offering a scalable approach that can generalize across datasets and backbones.
Abstract
Claim verification is a task that involves assessing the truthfulness of a given claim based on multiple evidence pieces. Using large language models (LLMs) for claim verification is a promising way. However, simply feeding all the evidence pieces to an LLM and asking if the claim is factual does not yield good results. The challenge lies in the noisy nature of both the evidence and the claim: evidence passages typically contain irrelevant information, with the key facts hidden within the context, while claims often convey multiple aspects simultaneously. To navigate this "noisy crowd" of information, we propose EACon (Evidence Abstraction and Claim Deconstruction), a framework designed to find key information within evidence and verify each aspect of a claim separately. EACon first finds keywords from the claim and employs fuzzy matching to select relevant keywords for each raw evidence piece. These keywords serve as a guide to extract and summarize critical information into abstracted evidence. Subsequently, EACon deconstructs the original claim into subclaims, which are then verified against both abstracted and raw evidence individually. We evaluate EACon using two open-source LLMs on two challenging datasets. Results demonstrate that EACon consistently and substantially improve LLMs' performance in claim verification.
