Grounding LLM Reasoning with Knowledge Graphs
Alfonso Amayuelas, Joy Sain, Simerjot Kaur, Charese Smiley
TL;DR
We address the verifiability gap in LLM outputs by grounding stepwise reasoning in domain-specific knowledge graphs. The authors propose a framework that binds each reasoning step to graph data and evaluate three reasoning strategies—CoT, ToT, and GoT—coupled with two KG interaction pipelines (Agent and Automatic Graph Exploration). The approach delivers state-of-the-art performance on GRBench (≥26.5% improvement over CoT) and provides detailed analyses of how step depth, branching, and model size shape reasoning quality. While grounding improves interpretability and reduces certain failures, it also introduces computational costs and complexities, especially for ToT and GoT, highlighting practical trade-offs for scalable, verifiable KG-grounded reasoning in real-world settings.
Abstract
Large Language Models (LLMs) excel at generating natural language answers, yet their outputs often remain unverifiable and difficult to trace. Knowledge Graphs (KGs) offer a complementary strength by representing entities and their relationships in structured form, providing a foundation for more reliable reasoning. We propose a novel framework that integrates LLM reasoning with KGs by linking each step of the reasoning process to graph-structured data. This grounding turns intermediate ``thoughts'' into interpretable traces that remain consistent with external knowledge. Our approach incorporates multiple reasoning strategies, Chain-of-Thought (CoT), Tree-of-Thought (ToT), and Graph-of-Thought (GoT), and is evaluated on GRBench, a benchmark for domain-specific graph reasoning. Our experiments show state-of-the-art (SOTA) performance, with at least 26.5\% improvement over CoT baselines. Beyond accuracy, we analyze how step depth, branching structure, and model size influence reasoning quality, offering insights into the conditions that support effective reasoning. Together, these contributions highlight how grounding LLMs in structured knowledge enables both higher accuracy and greater interpretability in complex reasoning tasks.
