KG-Infused RAG: Augmenting Corpus-Based RAG with External Knowledge Graphs
Dingjun Wu, Yukun Yan, Zhenghao Liu, Zhiyuan Liu, Maosong Sun
TL;DR
KG-Infused RAG introduces spreading activation over pre-existing large-scale knowledge graphs to augment corpus-based retrieval in RAG. By constructing a query-specific KG subgraph from seeds, expanding the query with KG-derived knowledge, and generating answers from multimodal evidence, the method achieves superior factual grounding and efficiency compared to ad-hoc KG approaches. The framework supports plug-in integration with existing RAG systems and is enhanced by Direct Preference Optimization to improve knowledge augmentation. Experiments on five QA benchmarks show consistent gains over vanilla RAG and competitive performance versus KG-based baselines, highlighting its practicality for open-domain QA with scalable, structured knowledge grounding.
Abstract
Retrieval-Augmented Generation (RAG) improves factual accuracy by grounding responses in external knowledge. However, existing RAG methods either rely solely on text corpora and neglect structural knowledge, or build ad-hoc knowledge graphs (KGs) at high cost and low reliability. To address these issues, we propose KG-Infused RAG, a framework that incorporates pre-existing large-scale KGs into RAG and applies spreading activation to enhance both retrieval and generation. KG-Infused RAG directly performs spreading activation over external KGs to retrieve relevant structured knowledge, which is then used to expand queries and integrated with corpus passages, enabling interpretable and semantically grounded multi-source retrieval. We further improve KG-Infused RAG through preference learning on sampled key stages of the pipeline. Experiments on five QA benchmarks show that KG-Infused RAG consistently outperforms vanilla RAG (by 3.9% to 17.8%). Compared with KG-based approaches such as GraphRAG and LightRAG, our method obtains structured knowledge at lower cost while achieving superior performance. Additionally, integrating KG-Infused RAG with Self-RAG and DeepNote yields further gains, demonstrating its effectiveness and versatility as a plug-and-play enhancement module for corpus-based RAG methods.
