Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs
Yangning Li, Weizhi Zhang, Yuyao Yang, Wei-Chieh Huang, Yaozu Wu, Junyu Luo, Yuanchen Bei, Henry Peng Zou, Xiao Luo, Yusheng Zhao, Chunkit Chan, Yankai Chen, Zhongfen Deng, Yinghui Li, Hai-Tao Zheng, Dongyuan Li, Renhe Jiang, Ming Zhang, Yangqiu Song, Philip S. Yu
TL;DR
This survey analyzes three evolutionary trajectories in aligning retrieval with reasoning in LLMs: Reasoning-Enhanced RAG, which injects reasoning into RAG stages; RAG-Enhanced Reasoning, which grounds long Chain-of-Thought with external knowledge; and Synergized RAG-Reasoning, where iterative, agentic loops jointly optimize search and inference. It provides a unified taxonomy across retrieval optimization, integration, and generation, then extends to external knowledge sources, in-context cues, and agent orchestration in deep research systems. Key contributions include a comprehensive framework, an extensive benchmark panorama, and open challenges for efficiency, multimodality, trust, and human collaboration. The work highlights practical implications for building robust, knowledge-grounded, and transparent LLM systems capable of complex, knowledge-intensive reasoning.
Abstract
Retrieval-Augmented Generation (RAG) lifts the factuality of Large Language Models (LLMs) by injecting external knowledge, yet it falls short on problems that demand multi-step inference; conversely, purely reasoning-oriented approaches often hallucinate or mis-ground facts. This survey synthesizes both strands under a unified reasoning-retrieval perspective. We first map how advanced reasoning optimizes each stage of RAG (Reasoning-Enhanced RAG). Then, we show how retrieved knowledge of different type supply missing premises and expand context for complex inference (RAG-Enhanced Reasoning). Finally, we spotlight emerging Synergized RAG-Reasoning frameworks, where (agentic) LLMs iteratively interleave search and reasoning to achieve state-of-the-art performance across knowledge-intensive benchmarks. We categorize methods, datasets, and open challenges, and outline research avenues toward deeper RAG-Reasoning systems that are more effective, multimodally-adaptive, trustworthy, and human-centric. The collection is available at https://github.com/DavidZWZ/Awesome-RAG-Reasoning.
