Improving the fact-checking performance of language models by relying on their entailment ability

Gaurav Kumar; Debajyoti Mazumder; Ayush Garg; Jasabanta Patro

Improving the fact-checking performance of language models by relying on their entailment ability

Gaurav Kumar, Debajyoti Mazumder, Ayush Garg, Jasabanta Patro

TL;DR

This work introduces an entailment-based fact-checking framework that leverages entailed justifications generated by GLMs to train encoder-only language models for veracity prediction. By organizing the process into three steps—entailment classification of evidence, generation of supporting/refuting justifications, and conditional veracity prediction using an ELM—the approach achieves substantial improvements over strong baselines, especially when training with entailed justifications. Through three training-based experiments and four inference-based experiments on LIAR-RAW and RAW-FC datasets, the authors demonstrate that entailed explanations can greatly enhance accuracy and interpretability, while ablations and linguistic analyses illuminate the critical role of justification content and attention to evidence. The findings suggest practical implications for deploying scalable, explainable fact-checking systems, with future work pointing toward multilingual generalization and open-domain evidence retrieval.

Abstract

Automated fact-checking has been a challenging task for the research community. Past works tried various strategies, such as end-to-end training, retrieval-augmented generation, and prompt engineering, to build robust fact-checking systems. However, their accuracy has not been very high for real-world deployment. We, on the other hand, propose a simple yet effective strategy, where entailed justifications generated by LLMs are used to train encoder-only language models (ELMs) for fact-checking. We conducted a rigorous set of experiments, comparing our approach with recent works and various prompting and fine-tuning strategies to demonstrate the superiority of our approach. Additionally, we did quality analysis of model explanations, ablation studies, and error analysis to provide a comprehensive understanding of our approach.

Improving the fact-checking performance of language models by relying on their entailment ability

TL;DR

Abstract

Improving the fact-checking performance of language models by relying on their entailment ability

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)