Active Task Disambiguation with LLMs
Katarzyna Kobalczyk, Nicolas Astorga, Tennison Liu, Mihaela van der Schaar
TL;DR
This work addresses how large language models handle ambiguously specified tasks by introducing a formal notion of task ambiguity and a Bayesian Experimental Design (BED) based framework for active task disambiguation. The method explicitly samples candidate solutions and candidate clarifying questions, then selects the question that maximizes the expected information gain while accounting for cost, thereby concentrating the solution distribution ${p}_{\phi_h}(\cdot|{\mathcal S})$ toward the true viable set ${\mathcal H}^*$. Empirical results from a 20-questions game and from open-ended code-generation tasks show that BED-based question generation (notably EIG-uniform) significantly outperforms baselines that rely on implicit reasoning about questions, with open questions generally offering higher information gains than yes/no questions. The findings suggest that shifting some reasoning to explicit evaluation over the space of candidate solutions improves task disambiguation and that this approach has broad applicability to interactive AI systems requiring user-guided clarification and more reliable outputs.
Abstract
Despite the impressive performance of large language models (LLMs) across various benchmarks, their ability to address ambiguously specified problems--frequent in real-world interactions--remains underexplored. To address this gap, we introduce a formal definition of task ambiguity and frame the problem of task disambiguation through the lens of Bayesian Experimental Design. By posing clarifying questions, LLM agents can acquire additional task specifications, progressively narrowing the space of viable solutions and reducing the risk of generating unsatisfactory outputs. Yet, generating effective clarifying questions requires LLM agents to engage in a form of meta-cognitive reasoning, an ability LLMs may presently lack. Our proposed approach of active task disambiguation enables LLM agents to generate targeted questions maximizing the information gain. Effectively, this approach shifts the load from implicit to explicit reasoning about the space of viable solutions. Empirical results demonstrate that this form of question selection leads to more effective task disambiguation in comparison to approaches relying on reasoning solely within the space of questions.
