Table of Contents
Fetching ...

Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey

Minghan Li, Xinxuan Lv, Junjie Zou, Tongna Chen, Chao Zhang, Suchao An, Ercong Nie, Guodong Zhou

TL;DR

The paper surveys the evolution of query expansion from traditional corpus and lexicon driven methods to neural methods based on pre trained language models and large language models. It presents a four axis framework encompassing point of injection, grounding and interaction, learning and alignment, and KG augmentation to organize QE research. It provides a model taxonomy and practical guidance for deploying QE across retrieval stages including first stage recall, re ranking, and retrieval augmented generation. It highlights challenges in reliability, safety, efficiency, and continual adaptation, and offers a principled blueprint for combining techniques under real world constraints.

Abstract

Modern information retrieval (IR) must reconcile short, ambiguous queries with increasingly diverse and dynamic corpora. Query expansion (QE) remains central to alleviating vocabulary mismatch, yet the design space has shifted with pre-trained and large language models (PLMs, LLMs). In this survey, we organize recent work along four complementary dimensions: the point of injection (implicit/embedding vs. selection-based explicit), grounding and interaction (from zero-grounding prompts to multi-round retrieve-expand loops), learning and alignment (SFT/PEFT/DPO), and knowledge-graph integration. A model-centric taxonomy is also outlined, spanning encoder-only, encoder-decoder, decoder-only, instruction-tuned, and domain or multilingual variants, with affordances for QE such as contextual disambiguation, controllable generation, and zero-shot or few-shot reasoning. Practice-oriented guidance specifies where neural QE helps most: first-stage retrieval, multi-query fusion, re-ranking, and retrieval-augmented generation (RAG). The survey compares traditional and neural QE across seven aspects and maps applications in web search, biomedicine, e-commerce, open-domain question answering/RAG, conversational and code search, and cross-lingual settings. The survey concludes with an agenda focused on reliable, safe, efficient, and adaptive QE, offering a principled blueprint for deploying and combining techniques under real-world constraints.

Query Expansion in the Age of Pre-trained and Large Language Models: A Comprehensive Survey

TL;DR

The paper surveys the evolution of query expansion from traditional corpus and lexicon driven methods to neural methods based on pre trained language models and large language models. It presents a four axis framework encompassing point of injection, grounding and interaction, learning and alignment, and KG augmentation to organize QE research. It provides a model taxonomy and practical guidance for deploying QE across retrieval stages including first stage recall, re ranking, and retrieval augmented generation. It highlights challenges in reliability, safety, efficiency, and continual adaptation, and offers a principled blueprint for combining techniques under real world constraints.

Abstract

Modern information retrieval (IR) must reconcile short, ambiguous queries with increasingly diverse and dynamic corpora. Query expansion (QE) remains central to alleviating vocabulary mismatch, yet the design space has shifted with pre-trained and large language models (PLMs, LLMs). In this survey, we organize recent work along four complementary dimensions: the point of injection (implicit/embedding vs. selection-based explicit), grounding and interaction (from zero-grounding prompts to multi-round retrieve-expand loops), learning and alignment (SFT/PEFT/DPO), and knowledge-graph integration. A model-centric taxonomy is also outlined, spanning encoder-only, encoder-decoder, decoder-only, instruction-tuned, and domain or multilingual variants, with affordances for QE such as contextual disambiguation, controllable generation, and zero-shot or few-shot reasoning. Practice-oriented guidance specifies where neural QE helps most: first-stage retrieval, multi-query fusion, re-ranking, and retrieval-augmented generation (RAG). The survey compares traditional and neural QE across seven aspects and maps applications in web search, biomedicine, e-commerce, open-domain question answering/RAG, conversational and code search, and cross-lingual settings. The survey concludes with an agenda focused on reliable, safe, efficient, and adaptive QE, offering a principled blueprint for deploying and combining techniques under real-world constraints.

Paper Structure

This paper contains 123 sections, 41 equations, 3 figures, 3 tables.

Figures (3)

  • Figure 1: A Taxonomy of Query Expansion Techniques: From Traditional Methods to PLM/LLM-Driven Techniques and Applications.
  • Figure 2: Application Scenarios of Query Expansion in Information Retrieval
  • Figure 3: Application Scenarios of Query Expansion in Information Retrieval