Diversify-verify-adapt: Efficient and Robust Retrieval-Augmented Ambiguous Question Answering
Yeonjun In, Sungchul Kim, Ryan A. Rossi, Md Mehrab Tanjim, Tong Yu, Ritwik Sinha, Chanyoung Park
TL;DR
Diva tackles the practical inefficiencies and low-quality retrievals in retrieval-augmented QA for ambiguous questions by introducing Retrieval Diversification (RD) to proactively cover diverse interpretations and Adaptive Generation (AG) to verify passage quality and choose the most suitable generation strategy. RD infers pseudo-interpretations via dual LLM prompts and retrieves passages that maximize coverage of these interpretations, while AG uses a novel retrieval verification (RV) scheme that grades retrieval quality into Useful, PartialUseful, and Useless, guiding whether to answer via RAG or rely on the LLM's internal knowledge. Empirical results on ASQA and SituatedQA show Diva outperforms Iterative RAG and CRAG in both accuracy and efficiency (approximately $1.5$–$3\times$ faster), with RD consistently improving retrieval quality and AG enhancing robustness under low-quality retrieval. The framework demonstrates strong generalization across backbones (from Llama3 to GPT-4) and remains effective on unambiguous questions, indicating practical impact for real-world ambiguous QA systems. Limitations include sensitivity of the RV component to the LLM choice and the need for enhanced ambiguity classifiers and verifiers in future work.
Abstract
The retrieval augmented generation (RAG) framework addresses an ambiguity in user queries in QA systems by retrieving passages that cover all plausible interpretations and generating comprehensive responses based on the passages. However, our preliminary studies reveal that a single retrieval process often suffers from low quality results, as the retrieved passages frequently fail to capture all plausible interpretations. Although the iterative RAG approach has been proposed to address this problem, it comes at the cost of significantly reduced efficiency. To address these issues, we propose the diversify-verify-adapt (DIVA) framework. DIVA first diversifies the retrieved passages to encompass diverse interpretations. Subsequently, DIVA verifies the quality of the passages and adapts the most suitable approach tailored to their quality. This approach improves the QA systems accuracy and robustness by handling low quality retrieval issue in ambiguous questions, while enhancing efficiency.
