A Pilot Empirical Study on When and How to Use Knowledge Graphs as Retrieval Augmented Generation

Xujie Yuan; Yongxu Liu; Shimin Di; Shiwen Wu; Libin Zheng; Rui Meng; Lei Chen; Xiaofang Zhou; Jian Yin

A Pilot Empirical Study on When and How to Use Knowledge Graphs as Retrieval Augmented Generation

Xujie Yuan, Yongxu Liu, Shimin Di, Shiwen Wu, Libin Zheng, Rui Meng, Lei Chen, Xiaofang Zhou, Jian Yin

TL;DR

This work analyzes when and how to use Knowledge Graphs within Retrieval Augmented Generation by introducing a configurable KG-RAG framework and two methods, Pilot and Meta, and evaluating 6 KG-RAG approaches across 9 datasets and 17 LLMs. It shows strong, task-dependent gains in domain-specific medical QA and exams, with more modest improvements in open-domain tasks and limited benefits in multi-turn dialogues. The study demonstrates that KG quality and configuration interactions critically affect performance, and that benefits persist across BOS-LLMs as model scales. Overall, KG-RAG can match or surpass commercial models in specialized domains, offering practical guidelines for deploying KG-augmented retrieval in real-world applications.

Abstract

The integration of Knowledge Graphs (KGs) into the Retrieval Augmented Generation (RAG) framework has attracted significant interest, with early studies showing promise in mitigating hallucinations and improving model accuracy. However, a systematic understanding and comparative analysis of the rapidly emerging KG-RAG methods are still lacking. This paper seeks to lay the foundation for systematically answering the question of when and how to use KG-RAG by analyzing their performance in various application scenarios associated with different technical configurations. After outlining the mind map using KG-RAG framework and summarizing its popular pipeline, we conduct a pilot empirical study of KG-RAG works to reimplement and evaluate 6 KG-RAG methods across 9 datasets in diverse domains and scenarios, analyzing the impact of 9 KG-RAG configurations in combination with 17 LLMs, and combining Metacognition with KG-RAG as a pilot attempt. Our results underscore the critical role of appropriate application conditions and optimal configurations of KG-RAG components.

A Pilot Empirical Study on When and How to Use Knowledge Graphs as Retrieval Augmented Generation

TL;DR

Abstract

A Pilot Empirical Study on When and How to Use Knowledge Graphs as Retrieval Augmented Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (2)

Theorems & Definitions (1)