Table of Contents
Fetching ...

Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA

Xiangqing Shen, Fanfan Wang, Rui Xia

TL;DR

RAR presents Reasoner, Aligner, and Responser to jointly reason over questions and ground them to KG paths. It uses EM to optimize a latent-variable model of reasoning chains and KG paths, producing interpretable, KG-aligned answers. Experiments across WebQSP, CWQ, CSQA, and MedQA show state-of-the-art results with strong zero-shot generalization and efficient inference. The approach advances reliable KGQA by tightly coupling human-like reasoning with structured knowledge and constraining decoding to valid KG paths.

Abstract

LLMs have demonstrated remarkable capabilities in complex reasoning tasks, yet they often suffer from hallucinations and lack reliable factual grounding. Meanwhile, knowledge graphs (KGs) provide structured factual knowledge but lack the flexible reasoning abilities of LLMs. In this paper, we present Reason-Align-Respond (RAR), a novel framework that systematically integrates LLM reasoning with knowledge graphs for KGQA. Our approach consists of three key components: a Reasoner that generates human-like reasoning chains, an Aligner that maps these chains to valid KG paths, and a Responser that synthesizes the final answer. We formulate this process as a probabilistic model and optimize it using the Expectation-Maximization algorithm, which iteratively refines the reasoning chains and knowledge paths. Extensive experiments on multiple benchmarks demonstrate the effectiveness of RAR, achieving state-of-the-art performance with Hit@1 scores of 93.3% and 91.0% on WebQSP and CWQ respectively. Human evaluation confirms that RAR generates high-quality, interpretable reasoning chains well-aligned with KG paths. Furthermore, RAR exhibits strong zero-shot generalization capabilities and maintains computational efficiency during inference.

Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA

TL;DR

RAR presents Reasoner, Aligner, and Responser to jointly reason over questions and ground them to KG paths. It uses EM to optimize a latent-variable model of reasoning chains and KG paths, producing interpretable, KG-aligned answers. Experiments across WebQSP, CWQ, CSQA, and MedQA show state-of-the-art results with strong zero-shot generalization and efficient inference. The approach advances reliable KGQA by tightly coupling human-like reasoning with structured knowledge and constraining decoding to valid KG paths.

Abstract

LLMs have demonstrated remarkable capabilities in complex reasoning tasks, yet they often suffer from hallucinations and lack reliable factual grounding. Meanwhile, knowledge graphs (KGs) provide structured factual knowledge but lack the flexible reasoning abilities of LLMs. In this paper, we present Reason-Align-Respond (RAR), a novel framework that systematically integrates LLM reasoning with knowledge graphs for KGQA. Our approach consists of three key components: a Reasoner that generates human-like reasoning chains, an Aligner that maps these chains to valid KG paths, and a Responser that synthesizes the final answer. We formulate this process as a probabilistic model and optimize it using the Expectation-Maximization algorithm, which iteratively refines the reasoning chains and knowledge paths. Extensive experiments on multiple benchmarks demonstrate the effectiveness of RAR, achieving state-of-the-art performance with Hit@1 scores of 93.3% and 91.0% on WebQSP and CWQ respectively. Human evaluation confirms that RAR generates high-quality, interpretable reasoning chains well-aligned with KG paths. Furthermore, RAR exhibits strong zero-shot generalization capabilities and maintains computational efficiency during inference.

Paper Structure

This paper contains 44 sections, 12 equations, 12 figures, 6 tables, 1 algorithm.

Figures (12)

  • Figure 1: The comparison between our Reason-Align-Respond framework and the existing methods for LLM-based KGQA.
  • Figure 2: Illustration of our RAR framework comprising Reasoner, Aligner, Responser with iterative EM optimization.
  • Figure 3: Impact of iteration steps of the EM algorithm.
  • Figure 4: Human evaluation of reasoning chains on CWQ.
  • Figure 5: Impact of different beam size on CWQ.
  • ...and 7 more figures