Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval
Mengjia Niu, Hao Li, Jie Shi, Hamed Haddadi, Fan Mo
TL;DR
The paper tackles hallucinations in medical QA by introducing Re-KGR, a post-generation framework that identifies high-risk tokens via token-level uncertainty, extracts and refines factual triples, retrieves corroborating KG evidence, and rectifies content to boost truthfulness. By combining entropy and layer-divergence signals with selective triple extraction, it reduces the number of necessary KG lookups while maintaining or improving factual accuracy. Evaluations on MedQuAD with LLaMA-7b and DoLa show notable improvements in truthfulness (up to 0.610 on GPT-4 scoring) and substantial reductions in retrieval time (63–75%), demonstrating practical efficiency gains. The approach emphasizes minimal retraining, domain-specific KG integration, and potential generalization to other high-stakes domains where reliable external knowledge is available.
Abstract
Large language models (LLMs) have demonstrated remarkable capabilities across various domains, although their susceptibility to hallucination poses significant challenges for their deployment in critical areas such as healthcare. To address this issue, retrieving relevant facts from knowledge graphs (KGs) is considered a promising method. Existing KG-augmented approaches tend to be resource-intensive, requiring multiple rounds of retrieval and verification for each factoid, which impedes their application in real-world scenarios. In this study, we propose Self-Refinement-Enhanced Knowledge Graph Retrieval (Re-KGR) to augment the factuality of LLMs' responses with less retrieval efforts in the medical field. Our approach leverages the attribution of next-token predictive probability distributions across different tokens, and various model layers to primarily identify tokens with a high potential for hallucination, reducing verification rounds by refining knowledge triples associated with these tokens. Moreover, we rectify inaccurate content using retrieved knowledge in the post-processing stage, which improves the truthfulness of generated responses. Experimental results on a medical dataset demonstrate that our approach can enhance the factual capability of LLMs across various foundational models as evidenced by the highest scores on truthfulness.
