DualRAG: A Dual-Process Approach to Integrate Reasoning and Retrieval for Multi-Hop Question Answering
Rong Cheng, Jinyi Liu, Yan Zheng, Fei Ni, Jiazhen Du, Hangyu Mao, Fuzheng Zhang, Bo Wang, Jianye Hao
TL;DR
DualRAG presents a dual-process RAG framework that tightly couples Reasoning-augmented Querying (RaQ) with progressive Knowledge Aggregation (pKA) to handle dynamic, multi-hop reasoning. RaQ actively detects knowledge gaps and generates targeted retrieval queries, while pKA constructs a progressive knowledge outline and summarizes retrieved content to support coherent reasoning. The approach yields substantial gains in MHQA accuracy and coherence across datasets, reduces the number of retrieval iterations, and remains effective for compact models via targeted fine-tuning. This work offers a scalable, noise-resilient solution for complex reasoning tasks with practical impact on real-world information-seeking AI systems.
Abstract
Multi-Hop Question Answering (MHQA) tasks permeate real-world applications, posing challenges in orchestrating multi-step reasoning across diverse knowledge domains. While existing approaches have been improved with iterative retrieval, they still struggle to identify and organize dynamic knowledge. To address this, we propose DualRAG, a synergistic dual-process framework that seamlessly integrates reasoning and retrieval. DualRAG operates through two tightly coupled processes: Reasoning-augmented Querying (RaQ) and progressive Knowledge Aggregation (pKA). They work in concert: as RaQ navigates the reasoning path and generates targeted queries, pKA ensures that newly acquired knowledge is systematically integrated to support coherent reasoning. This creates a virtuous cycle of knowledge enrichment and reasoning refinement. Through targeted fine-tuning, DualRAG preserves its sophisticated reasoning and retrieval capabilities even in smaller-scale models, demonstrating its versatility and core advantages across different scales. Extensive experiments demonstrate that this dual-process approach substantially improves answer accuracy and coherence, approaching, and in some cases surpassing, the performance achieved with oracle knowledge access. These results establish DualRAG as a robust and efficient solution for complex multi-hop reasoning tasks.
