TreeRare: Syntax Tree-Guided Retrieval and Reasoning for Knowledge-Intensive Question Answering
Boyi Zhang, Zhuo Liu, Hangfeng He
TL;DR
TreeRare tackles knowledge-intensive QA by interleaving syntax-driven question decomposition with bottom-up retrieval and reasoning. It uses a syntax tree to guide subcomponent query generation, targeted retrieval, and subcomponent answering, followed by aggregating evidence for a final answer. Across five benchmarks and multiple LLM backbones, TreeRare yields substantial improvements over state-of-the-art baselines, and the ablation studies confirm the critical role of syntax-guided retrieval and per-node reasoning. The approach reduces hallucinations and ambiguity by grounding evidence at fine-grained sub-phrases, though it introduces higher computational costs and depends on parsing quality.
Abstract
In real practice, questions are typically complex and knowledge-intensive, requiring Large Language Models (LLMs) to recognize the multifaceted nature of the question and reason across multiple information sources. Iterative and adaptive retrieval, where LLMs decide when and what to retrieve based on their reasoning, has been shown to be a promising approach to resolve complex, knowledge-intensive questions. However, the performance of such retrieval frameworks is limited by the accumulation of reasoning errors and misaligned retrieval results. To overcome these limitations, we propose TreeRare (Syntax Tree-Guided Retrieval and Reasoning), a framework that utilizes syntax trees to guide information retrieval and reasoning for question answering. Following the principle of compositionality, TreeRare traverses the syntax tree in a bottom-up fashion, and in each node, it generates subcomponent-based queries and retrieves relevant passages to resolve localized uncertainty. A subcomponent question answering module then synthesizes these passages into concise, context-aware evidence. Finally, TreeRare aggregates the evidence across the tree to form a final answer. Experiments across five question answering datasets involving ambiguous or multi-hop reasoning demonstrate that TreeRare achieves substantial improvements over existing state-of-the-art methods.
