Empowering GraphRAG with Knowledge Filtering and Integration
Kai Guo, Harry Shomer, Shenglai Zeng, Haoyu Han, Yu Wang, Jiliang Tang
TL;DR
This work tackles hallucination and knowledge gaps in graph retrieval-augmented generation for KGQA. It introduces GraphRAG-FI, combining a two-stage filtering module with a logits-based integration strategy to suppress noisy retrieval and to balance external knowledge with the LLM’s intrinsic reasoning. Empirical results on WebQSP and CWQ across multiple backbones show consistent improvements over baselines, with robust performance under noise and clear ablations demonstrating the contributions of filtering and integration. The approach yields a more reliable GraphRAG framework with practical impact for multi-hop knowledge graph reasoning.
Abstract
In recent years, large language models (LLMs) have revolutionized the field of natural language processing. However, they often suffer from knowledge gaps and hallucinations. Graph retrieval-augmented generation (GraphRAG) enhances LLM reasoning by integrating structured knowledge from external graphs. However, we identify two key challenges that plague GraphRAG:(1) Retrieving noisy and irrelevant information can degrade performance and (2)Excessive reliance on external knowledge suppresses the model's intrinsic reasoning. To address these issues, we propose GraphRAG-FI (Filtering and Integration), consisting of GraphRAG-Filtering and GraphRAG-Integration. GraphRAG-Filtering employs a two-stage filtering mechanism to refine retrieved information. GraphRAG-Integration employs a logits-based selection strategy to balance external knowledge from GraphRAG with the LLM's intrinsic reasoning,reducing over-reliance on retrievals. Experiments on knowledge graph QA tasks demonstrate that GraphRAG-FI significantly improves reasoning performance across multiple backbone models, establishing a more reliable and effective GraphRAG framework.
