IterCQR: Iterative Conversational Query Reformulation with Retrieval Guidance
Yunah Jang, Kang-il Lee, Hyunkyung Bae, Hwanhee Lee, Kyomin Jung
TL;DR
This work tackles conversational search by reframing context-dependent queries into stand-alone forms without relying on human rewrites. It introduces IterCQR, an iterative, IR-guided training framework that starts from an LLM-generated rewrite dataset and progressively improves via Minimum Bayes Risk training and Top-1 candidate selection, using a cosine-similarity reward between candidate rewrites and gold passages. Across TopiOCQA and QReCC, IterCQR achieves state-of-the-art results for both dense and sparse retrievers, and exhibits strong generalization to unseen datasets as well as robustness in low-resource settings. The approach reduces dependency on manually crafted rewrites while aligning reformulations with retriever behavior, offering practical gains for real-world conversational search systems.
Abstract
Conversational search aims to retrieve passages containing essential information to answer queries in a multi-turn conversation. In conversational search, reformulating context-dependent conversational queries into stand-alone forms is imperative to effectively utilize off-the-shelf retrievers. Previous methodologies for conversational query reformulation frequently depend on human-annotated rewrites. However, these manually crafted queries often result in sub-optimal retrieval performance and require high collection costs. To address these challenges, we propose Iterative Conversational Query Reformulation (IterCQR), a methodology that conducts query reformulation without relying on human rewrites. IterCQR iteratively trains the conversational query reformulation (CQR) model by directly leveraging information retrieval (IR) signals as a reward. Our IterCQR training guides the CQR model such that generated queries contain necessary information from the previous dialogue context. Our proposed method shows state-of-the-art performance on two widely-used datasets, demonstrating its effectiveness on both sparse and dense retrievers. Moreover, IterCQR exhibits superior performance in challenging settings such as generalization on unseen datasets and low-resource scenarios.
