Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM Expansions for Query Expansion
Minghan Li, Ercong Nie, Siqi Zhao, Tongna Chen, Huiping Huang, Guodong Zhou
TL;DR
This paper tackles the instability and non-scalability of LLM-driven query expansion under domain shift by introducing a fully automated, domain-adaptive QE framework. It constructs large in-domain exemplar pools via a BM25–MonoT5 pipeline, then selects diverse demonstrations with a simple clustering strategy, enabling training-free in-context learning. To exploit model complementarities, it proposes a two-LLM expansion ensemble whose outputs are synthesized by an LLM refinement module, eliminating the need for additional retrieval passes. Across DL20, DBPedia-Entity, and SciFact, the approach yields robust gains over lexical baselines and single-LLM prompts, with the refined two-LLM ensemble providing the strongest, often significant improvements, and extends benefit to dense retrieval setups as well.
Abstract
Query expansion with large language models is promising but often relies on hand-crafted prompts, manually chosen exemplars, or a single LLM, making it non-scalable and sensitive to domain shift. We present an automated, domain-adaptive QE framework that builds in-domain exemplar pools by harvesting pseudo-relevant passages using a BM25-MonoT5 pipeline. A training-free cluster-based strategy selects diverse demonstrations, yielding strong and stable in-context QE without supervision. To further exploit model complementarity, we introduce a two-LLM ensemble in which two heterogeneous LLMs independently generate expansions and a refinement LLM consolidates them into one coherent expansion. Across TREC DL20, DBPedia, and SciFact, the refined ensemble delivers consistent and statistically significant gains over BM25, Rocchio, zero-shot, and fixed few-shot baselines. The framework offers a reproducible testbed for exemplar selection and multi-LLM generation, and a practical, label-free solution for real-world QE.
