Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions
Alberto Testoni, Raquel Fernández
TL;DR
This work examines whether model uncertainty aligns with human uncertainty in a collaborative CoDraw task and finds a weak correspondence between the two. It demonstrates that relying on human clarification decisions as supervision may not optimally resolve model uncertainty, and introduces QDrawer, an uncertainty-based clarification module that generates template-based questions when the model is uncertain. QDrawer achieves substantial gains in task success and calibration (e.g., size accuracy up to 87.3% and SS of 3.40, with improved ECE and Brier scores) compared to baselines. The results support incorporating self-assessed uncertainty into dialogue systems to improve grounding and clarify underspecifications, with implications for broader vision-language collaboration tasks.
Abstract
Clarification questions are an essential dialogue tool to signal misunderstanding, ambiguities, and under-specification in language use. While humans are able to resolve uncertainty by asking questions since childhood, modern dialogue systems struggle to generate effective questions. To make progress in this direction, in this work we take a collaborative dialogue task as a testbed and study how model uncertainty relates to human uncertainty -- an as yet under-explored problem. We show that model uncertainty does not mirror human clarification-seeking behavior, which suggests that using human clarification questions as supervision for deciding when to ask may not be the most effective way to resolve model uncertainty. To address this issue, we propose an approach to generating clarification questions based on model uncertainty estimation, compare it to several alternatives, and show that it leads to significant improvements in terms of task success. Our findings highlight the importance of equipping dialogue systems with the ability to assess their own uncertainty and exploit in interaction.
