Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval

Yulong Hui; Chao Chen; Zhihang Fu; Yihao Liu; Jieping Ye; Huanchen Zhang

Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval

Yulong Hui, Chao Chen, Zhihang Fu, Yihao Liu, Jieping Ye, Huanchen Zhang

TL;DR

Interact-RAG tackles the bottleneck of black-box retrieval in retrieval-augmented generation by equipping LLM agents with a Corpus Interaction Engine and a reasoning-enhanced workflow. The system trains an autonomous end-to-end agent via supervised fine-tuning on reasoned interaction trajectories, followed by reinforcement learning with Group Relative Policy Optimization, achieving significant gains across six RAG benchmarks, especially in multi-hop tasks. This approach improves retrieval efficiency and generalization to out-of-distribution data, demonstrating the value of integrating interactive retrieval with structured reasoning. Overall, Interact-RAG offers a transparent, capable paradigm for agent-driven information seeking in RAG pipelines.

Abstract

Retrieval-Augmented Generation (RAG) has significantly enhanced LLMs by incorporating external information. However, prevailing agentic RAG approaches are constrained by a critical limitation: they treat the retrieval process as a black-box querying operation. This confines agents' actions to query issuing, hindering its ability to tackle complex information-seeking tasks. To address this, we introduce Interact-RAG, a new paradigm that elevates the LLM agent from a passive query issuer into an active manipulator of the retrieval process. We dismantle the black-box with a Corpus Interaction Engine, equipping the agent with a set of action primitives for fine-grained control over information retrieval. To further empower the agent on the entire RAG pipeline, we first develop a reasoning-enhanced workflow, which enables both zero-shot execution and the synthesis of interaction trajectories. We then leverage this synthetic data to train a fully autonomous end-to-end agent via Supervised Fine-Tuning (SFT), followed by refinement with Reinforcement Learning (RL). Extensive experiments across six benchmarks demonstrate that Interact-RAG significantly outperforms other advanced methods, validating the efficacy of our reasoning-interaction strategy.

Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval

TL;DR

Abstract

Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)