Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables
Xuzhao Geng, Haozhao Wang, Jun Wang, Wei Liu, Ruixuan Li
TL;DR
This work tackles persistent hallucinations in retrieval-augmented generation (RAG) by leveraging unlabeled conversation records through active learning to build a high-quality human-preference dataset. It introduces AL4RAG, a diversity-aware sampling framework tailored to the three-field structure of RAG data, and Retrieval-Augmented Similarity (RAS) to measure cross-sample distance more accurately. A novel preference dataset is constructed by labeling hallucinations and pairing model responses with explicit refusals, enabling Direct Preference Optimization (DPO) fine-tuning. Experiments on the RAGTruth dataset across multiple tasks show that AL4RAG and its Ras-enhanced variant consistently outperform baselines, especially under limited annotation budgets, demonstrating practical potential for safer and more reliable RAG systems.
Abstract
Retrieval-augmented generation (RAG) is a key technique for leveraging external knowledge and reducing hallucinations in large language models (LLMs). However, RAG still struggles to fully prevent hallucinated responses. To address this, it is essential to identify samples prone to hallucination or guide LLMs toward correct responses, which experts then annotate to develop high-quality datasets for refining LLMs. However, the growing scarcity of such datasets makes their creation challenging. This paper proposes using the vast amount of conversations from widespread LLM usage to build these datasets, training LLMs to avoid hallucination-prone questions while accurately responding to manageable ones. Given the impracticality of expert-annotating all conversation records, the paper introduces AL4RAG, which uses active learning to select the most suitable conversation samples for annotation, optimizing performance within an annotation budget. Additionally, recognizing that traditional active learning methods are not fully compatible with RAG due to unsuitable distance metrics, we develop a novel sample distance measurement for RAG active learning. Extensive experiments show that our method consistently outperforms baselines across multiple metrics.
