Self-Reflective Planning with Knowledge Graphs: Enhancing LLM Reasoning Reliability for Question Answering
Jiajun Zhu, Ye Liu, Meikai Bao, Kai Zhang, Yanghai Zhang, Qi Liu
TL;DR
This work tackles LLM hallucinations in knowledge-grounded question answering by introducing Self-Reflective Planning (SRP), a framework that integrates LLMs with knowledge graphs through reference-guided planning, iterative retrieval, and self-reflection. SRP comprises four modules—Reference Searching, Path Planning (relation check and path generation), Knowledge Retrieval, and Reflection and Reasoning (sequence judge, path edit, and answering)—to generate and refine reliable reasoning paths and triplet sequences. Empirical results on WebQSP, CWQ, and GrailQA show SRP achieving state-of-the-art performance on two benchmarks and strong results on the third, with significantly higher reliable answering rates than baselines. The approach demonstrates that structured references and iterative self-correction can substantially improve factual grounding and reasoning reliability in KGQA without relying on fine-tuning. Limitations include higher computational cost due to repeated reasoning cycles and sensitivity to reference quality, suggesting directions for efficiency and robust reference dynamics in future work.
Abstract
Recently, large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks, yet they remain prone to hallucinations when reasoning with insufficient internal knowledge. While integrating LLMs with knowledge graphs (KGs) provides access to structured, verifiable information, existing approaches often generate incomplete or factually inconsistent reasoning paths. To this end, we propose Self-Reflective Planning (SRP), a framework that synergizes LLMs with KGs through iterative, reference-guided reasoning. Specifically, given a question and topic entities, SRP first searches for references to guide planning and reflection. In the planning process, it checks initial relations and generates a reasoning path. After retrieving knowledge from KGs through a reasoning path, it implements iterative reflection by judging the retrieval result and editing the reasoning path until the answer is correctly retrieved. Extensive experiments on three public datasets demonstrate that SRP surpasses various strong baselines and further underscore its reliable reasoning ability.
