Table of Contents
Fetching ...

Knowledge Graph-extended Retrieval Augmented Generation for Question Answering

Jasper Linders, Jakub M. Tomczak

TL;DR

This work addresses the limitations of LLMs and KGs in isolation by proposing KG-RAG, a training-free system that combines KG-based retrieval with LLM reasoning. A new question decomposition module generates a chain-of-thought followed by sub-questions to improve multi-hop retrieval and explainability, while keeping generalizability across KGs. Evaluations on the MetaQA benchmark show improved accuracy for multi-hop questions, with a small trade-off on single-hop cases, and qualitative analysis highlights enhanced transparency through explicit reasoning traces. The approach demonstrates that bridging unstructured NL understanding with structured KG retrieval can yield more interpretable QA systems without domain-specific training, with potential for broader KG-enabled AI applications.

Abstract

Large Language Models (LLMs) and Knowledge Graphs (KGs) offer a promising approach to robust and explainable Question Answering (QA). While LLMs excel at natural language understanding, they suffer from knowledge gaps and hallucinations. KGs provide structured knowledge but lack natural language interaction. Ideally, an AI system should be both robust to missing facts as well as easy to communicate with. This paper proposes such a system that integrates LLMs and KGs without requiring training, ensuring adaptability across different KGs with minimal human effort. The resulting approach can be classified as a specific form of a Retrieval Augmented Generation (RAG) with a KG, thus, it is dubbed Knowledge Graph-extended Retrieval Augmented Generation (KG-RAG). It includes a question decomposition module to enhance multi-hop information retrieval and answer explainability. Using In-Context Learning (ICL) and Chain-of-Thought (CoT) prompting, it generates explicit reasoning chains processed separately to improve truthfulness. Experiments on the MetaQA benchmark show increased accuracy for multi-hop questions, though with a slight trade-off in single-hop performance compared to LLM with KG baselines. These findings demonstrate KG-RAG's potential to improve transparency in QA by bridging unstructured language understanding with structured knowledge retrieval.

Knowledge Graph-extended Retrieval Augmented Generation for Question Answering

TL;DR

This work addresses the limitations of LLMs and KGs in isolation by proposing KG-RAG, a training-free system that combines KG-based retrieval with LLM reasoning. A new question decomposition module generates a chain-of-thought followed by sub-questions to improve multi-hop retrieval and explainability, while keeping generalizability across KGs. Evaluations on the MetaQA benchmark show improved accuracy for multi-hop questions, with a small trade-off on single-hop cases, and qualitative analysis highlights enhanced transparency through explicit reasoning traces. The approach demonstrates that bridging unstructured NL understanding with structured KG retrieval can yield more interpretable QA systems without domain-specific training, with potential for broader KG-enabled AI applications.

Abstract

Large Language Models (LLMs) and Knowledge Graphs (KGs) offer a promising approach to robust and explainable Question Answering (QA). While LLMs excel at natural language understanding, they suffer from knowledge gaps and hallucinations. KGs provide structured knowledge but lack natural language interaction. Ideally, an AI system should be both robust to missing facts as well as easy to communicate with. This paper proposes such a system that integrates LLMs and KGs without requiring training, ensuring adaptability across different KGs with minimal human effort. The resulting approach can be classified as a specific form of a Retrieval Augmented Generation (RAG) with a KG, thus, it is dubbed Knowledge Graph-extended Retrieval Augmented Generation (KG-RAG). It includes a question decomposition module to enhance multi-hop information retrieval and answer explainability. Using In-Context Learning (ICL) and Chain-of-Thought (CoT) prompting, it generates explicit reasoning chains processed separately to improve truthfulness. Experiments on the MetaQA benchmark show increased accuracy for multi-hop questions, though with a slight trade-off in single-hop performance compared to LLM with KG baselines. These findings demonstrate KG-RAG's potential to improve transparency in QA by bridging unstructured language understanding with structured knowledge retrieval.

Paper Structure

This paper contains 54 sections, 1 equation, 8 figures, 3 tables.

Figures (8)

  • Figure 1: An example of a Retrieval-Augmented Generation (RAG) system, which combines information retrieval and text generation techniques. The red block indicates processing by a text embedding model, whereas the blue block depicts processing by an LLM. The yellow block shows a selector of nearest text chunks in the database.
  • Figure 2: The architecture of the proposed system. An example of a 2-hop question is included, to give an idea of the data structures that are involved in the end-to-end process. The green color indicates processing with the KG; the red block shows the embedding model and the blue modules utilize an LLM.
  • Figure 3: A simple subgraph of triples from MetaQA Zhang. As indicated by the arrows, this KG is a directed graph, which has implications for candidate triple retrieval. If Inception were the entity we were retrieving for, each darker tint of gray shows the entities that would be reached for a hop deeper.
  • Figure 4: The distribution of degrees (triples per entity) in the MetaQA KG. (Note that the distribution is long-tailed, so the cut-off at the value of 30 is for the purpose of visualization.)
  • Figure 5: MetaQA performance results for experiment 1, over 10 samples of 100 questions for each of the three datasets. The bars show the mean Hit@1 for different parameter configurations; the error bars show the standard deviation.
  • ...and 3 more figures