Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation

Mert İnan; Anthony Sicilia; Alex Xie; Saujas Vaduguru; Daniel Fried; Malihe Alikhani

Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation

Mert İnan, Anthony Sicilia, Alex Xie, Saujas Vaduguru, Daniel Fried, Malihe Alikhani

TL;DR

Ambiguity in natural language to data visualization code is quantified via $E(I, \\mathbb{C}(U)) - E(I, \\mathbb{C}(U^*))$, motivating a director-coder framework that treats code generation as a cooperative dialogue. The authors develop a multimodal taxonomy of plotting-domain ambiguity and a set of automatic metrics to identify it, showing that these metrics align with human ambiguity annotations better than standard uncertainty baselines. They evaluate pragmatics-inspired dialogue strategies (Cooperative, Discoursive, Inquisitive) with GPT-4o on DS1000 Matplotlib problems, showing that interactive dialogue improves code accuracy and more effectively targets ambiguities than non-dialogue baselines. The work provides a principled approach to aligning user goals with generated visualization code and points to practical implications for next-generation interactive coding assistants.

Abstract

Establishing shared goals is a fundamental step in human-AI communication. However, ambiguities can lead to outputs that seem correct but fail to reflect the speaker's intent. In this paper, we explore this issue with a focus on the data visualization domain, where ambiguities in natural language impact the generation of code that visualizes data. The availability of multiple views on the contextual (e.g., the intended plot and the code rendering the plot) allows for a unique and comprehensive analysis of diverse ambiguity types. We develop a taxonomy of types of ambiguity that arise in this task and propose metrics to quantify them. Using Matplotlib problems from the DS-1000 dataset, we demonstrate that our ambiguity metrics better correlate with human annotations than uncertainty baselines. Our work also explores how multi-turn dialogue can reduce ambiguity, therefore, improve code accuracy by better matching user goals. We evaluate three pragmatic models to inform our dialogue strategies: Gricean Cooperativity, Discourse Representation Theory, and Questions under Discussion. A simulated user study reveals how pragmatic dialogues reduce ambiguity and enhance code accuracy, highlighting the value of multi-turn exchanges in code generation.

Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation

TL;DR

Abstract

Identifying & Interactively Refining Ambiguous User Goals for Data Visualization Code Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (5)