Mining the Explainability and Generalization: Fact Verification Based on Self-Instruction
Guangyao Lu, Yulin Liu
TL;DR
This work tackles explainable fact-checking with open-source LLMs by introducing a self-instruction based fine-tuning framework that combines 2D data augmentation and improved DPO fine-tuning. Using Llama2-7B, it jointly models claim veracity and explanations on FEVEROUS and HOVER, achieving competitive accuracy with a fraction of trainable parameters and fluent explanations. The approach leverages counterfactual data and a difficulty-based sampling strategy to enhance generalization across challenging, multi-hop tasks, while addressing data privacy concerns. It demonstrates the practicality of privacy-preserving fact-checking in real-world settings, though it acknowledges limitations in suppressing hallucinations and multi-hop explanation quality, suggesting future distillation and open-world extensions.
Abstract
Fact-checking based on commercial LLMs has become mainstream. Although these methods offer high explainability, it falls short in accuracy compared to traditional fine-tuning approaches, and data security is also a significant concern. In this paper, we propose a self-instruction based fine-tuning approach for fact-checking that balances accuracy and explainability. Our method consists of Data Augmentation and Improved DPO fine-tuning. The former starts by instructing the model to generate both positive and negative explanations based on claim-evidence pairs and labels, then sampling the dataset according to our customized difficulty standards. The latter employs our proposed improved DPO to fine-tune the model using the generated samples. We fine-tune the smallest-scale LLaMA-7B model and evaluate it on the challenging fact-checking datasets FEVEROUS and HOVER, utilizing four fine-tuning methods and three few-shot learning methods for comparison. The experiments demonstrate that our approach not only retains accuracy comparable to, or even surpassing, traditional fine-tuning methods, but also generates fluent explanation text. Moreover, it also exhibit high generalization performance. Our method is the first to leverage self-supervised learning for fact-checking and innovatively combines contrastive learning and improved DPO in fine-tuning LLMs, as shown in the experiments.
