Probing-RAG: Self-Probing to Guide Language Models in Selective Document Retrieval
Ingeol Baek, Hwan Chang, Byeongjeong Kim, Jimin Lee, Hwanhee Lee
TL;DR
Probing-RAG tackles the challenge of when to retrieve external knowledge by tapping into the language model’s internal hidden states with a lightweight prober. The prober, trained on synthetic open-domain QA data, predicts whether additional retrieval will improve answer quality, enabling adaptive retrieval that avoids unnecessary external information. Across five open-domain QA datasets, Probing-RAG outperforms prior adaptive retrieval methods while reducing retrieval calls by about 50% on average, and demonstrates robust consistency and strong correlation between prober accuracy and QA performance. This approach offers a practical, efficient mechanism to mitigate external-knowledge conflicts and hallucinations by balancing internal model knowledge with retrieved content. The findings highlight the potential of internal-state-guided retrieval to enhance real-world RAG systems, with limitations related to access to hidden states and training data requirements.
Abstract
Retrieval-Augmented Generation (RAG) enhances language models by retrieving and incorporating relevant external knowledge. However, traditional retrieve-and-generate processes may not be optimized for real-world scenarios, where queries might require multiple retrieval steps or none at all. In this paper, we propose a Probing-RAG, which utilizes the hidden state representations from the intermediate layers of language models to adaptively determine the necessity of additional retrievals for a given query. By employing a pre-trained prober, Probing-RAG effectively captures the model's internal cognition, enabling reliable decision-making about retrieving external documents. Experimental results across five open-domain QA datasets demonstrate that Probing-RAG outperforms previous methods while reducing the number of redundant retrieval steps.
