Improving Neural Retrieval with Attribution-Guided Query Rewriting

Moncef Garouani; Josiane Mothe

Improving Neural Retrieval with Attribution-Guided Query Rewriting

Moncef Garouani, Josiane Mothe

TL;DR

This work tackles the brittleness of neural information retrieval when queries are underspecified by introducing an attribution-guided query rewriting framework. It computes token-level contributions from the retriever using Integrated Gradients over the top-$k$ retrieved documents and uses these scores as soft guidance in an LLM prompt to rewrite the query while preserving intent. The rewritten query is then submitted to the same retriever in a closed loop, enabling retriever-aware improvements without retraining. Empirical results on BEIR with SPLADE and TCT-ColBERT show consistent gains in $\text{nDCG}$, $\text{MAP}$, and precision, especially for implicit or domain-specific needs, demonstrating that explanations can be effectively leveraged to improve neural retrieval in practice.

Abstract

Neural retrievers are effective but brittle: underspecified or ambiguous queries can misdirect ranking even when relevant documents exist. Existing approaches address this brittleness only partially: LLMs rewrite queries without retriever feedback, and explainability methods identify misleading tokens but are used for post-hoc analysis. We close this loop and propose an attribution-guided query rewriting method that uses token-level explanations to guide query rewriting. For each query, we compute gradient-based token attributions from the retriever and then use these scores as soft guidance in a structured prompt to an LLM that clarifies weak or misleading query components while preserving intent. Evaluated on BEIR collections, the resulting rewrites consistently improve retrieval effectiveness over strong baselines, with larger gains for implicit or ambiguous information needs.

Improving Neural Retrieval with Attribution-Guided Query Rewriting

TL;DR

retrieved documents and uses these scores as soft guidance in an LLM prompt to rewrite the query while preserving intent. The rewritten query is then submitted to the same retriever in a closed loop, enabling retriever-aware improvements without retraining. Empirical results on BEIR with SPLADE and TCT-ColBERT show consistent gains in

, and precision, especially for implicit or domain-specific needs, demonstrating that explanations can be effectively leveraged to improve neural retrieval in practice.

Abstract

Paper Structure (14 sections, 1 equation, 2 tables)

This paper contains 14 sections, 1 equation, 2 tables.

Introduction
Related Work
Attribution-Guided Query Rewriting
Problem Formulation
Attribution-Guided Prompting for Query Rewriting
LLM-Based Query Reformulation
Experimental Setup
Datasets, Retrievers, and LLM
Baselines and Evaluation Metrics
Implementation Details
Results and Analysis
Comparison with Baselines
Qualitative Analysis
Conclusion

Improving Neural Retrieval with Attribution-Guided Query Rewriting

TL;DR

Abstract

Improving Neural Retrieval with Attribution-Guided Query Rewriting

Authors

TL;DR

Abstract

Table of Contents