CLAM: Selective Clarification for Ambiguous Questions with Generative Language Models
Lorenz Kuhn, Yarin Gal, Sebastian Farquhar
TL;DR
CLAM tackles the problem of ambiguous user questions in large language models by enabling selective clarification: detect ambiguity, generate clarifying questions, and answer with the clarification. The authors introduce Ambiguous TriviaQA and an automatic evaluation protocol using an oracle model to simulate user clarifications, showing substantial gains in end-to-end QA accuracy on ambiguous inputs while preserving performance on unambiguous ones. The work frames meta-cognition as a practical strategy for safer model deployment and provides a data-generating evaluation methodology to scale research in multi-turn dialogues.
Abstract
Users often ask dialogue systems ambiguous questions that require clarification. We show that current language models rarely ask users to clarify ambiguous questions and instead provide incorrect answers. To address this, we introduce CLAM: a framework for getting language models to selectively ask for clarification about ambiguous user questions. In particular, we show that we can prompt language models to detect whether a given question is ambiguous, generate an appropriate clarifying question to ask the user, and give a final answer after receiving clarification. We also show that we can simulate users by providing language models with privileged information. This lets us automatically evaluate multi-turn clarification dialogues. Finally, CLAM significantly improves language models' accuracy on mixed ambiguous and unambiguous questions relative to SotA.
