Beyond Chains: Bridging Large Language Models and Knowledge Bases in Complex Question Answering
Yihua Zhu, Qianying Liu, Akiko Aizawa, Hidetoshi Shimodaira
TL;DR
This paper tackles knowledge base question answering (KBQA) with large language models by addressing the limitations of chain-only KG‑RAG methods through a training‑free four‑stage framework called PDRR (Predict–Decompose–Retrieve–Reason). PDRR introduces an explicit planning module that predicts question type (chain vs parallel) and decomposes queries into structured decomposition triples, followed by a retrieval and reasoning module that grounds and executes the plan over a knowledge graph. The approach handles both chain and non‑chain (e.g., conjunction) reasoning, showing robust improvements across multiple backbones (e.g., GPT‑4o, GPT‑3.5‑turbo) and datasets (CWQ, WebQSP, SimpleQuestions, GrailQA). Empirical results demonstrate that PDRR outperforms training‑free baselines and remains competitive with training‑based methods, highlighting the value of explicit planning and principled grounding in complex KBQA. The work advances practical KBQA by delivering transparent, scalable reasoning workflows that mitigate hallucination and knowledge staleness while preserving interpretability through decomposition triples and planning.
Abstract
Knowledge Base Question Answering (KBQA) aims to answer natural language questions using structured knowledge from KBs. While LLM-only approaches offer generalization, they suffer from outdated knowledge, hallucinations, and lack of transparency. Chain-based KG-RAG methods address these issues by incorporating external KBs, but are limited to simple chain-structured questions due to the absence of planning and logical structuring. Inspired by semantic parsing methods, we propose PDRR: a four-stage framework consisting of Predict, Decompose, Retrieve, and Reason. Our method first predicts the question type and decomposes the question into structured triples. Then retrieves relevant information from KBs and guides the LLM as an agent to reason over and complete the decomposed triples. Experimental results demonstrate that PDRR consistently outperforms existing methods across various LLM backbones and achieves superior performance on both chain-structured and non-chain complex questions.
