Breathing New Life into Existing Visualizations: A Natural Language-Driven Manipulation Framework
Can Liu, Jiacheng Yu, Yuhan Guo, Jiayi Zhuang, Yuchu Luo, Xiaoru Yuan
TL;DR
The paper introduces a natural language–driven framework to manipulate existing visualizations by translating user queries into a hierarchical set of visualization tasks and then into a sequence of atomic manipulations performed in situ. It defines a four-level, seven-type manipulation space and a hierarchical task design, powered by a two-stage NL–task translator and manipulation parser, with a knowledge-distillation training approach that uses a large LLM to curate data for a smaller T5 model. A cross-domain NL–task dataset (486 attribute combinations, 65 topics, 5,867 pairs) and extensive evaluation demonstrate high parsing accuracy and effective, smooth visualization transformations validated through a user study. The framework enables flexible, intuitive, in-situ interaction with diverse visualizations, offering practical benefits for data exploration and interpretation.
Abstract
We propose an approach to manipulate existing interactive visualizations to answer users' natural language queries. We analyze the natural language tasks and propose a design space of a hierarchical task structure, which allows for a systematic decomposition of complex queries. We introduce a four-level visualization manipulation space to facilitate in-situ manipulations for visualizations, enabling a fine-grained control over the visualization elements. Our methods comprise two essential components: the natural language-to-task translator and the visualization manipulation parser. The natural language-to-task translator employs advanced NLP techniques to extract structured, hierarchical tasks from natural language queries, even those with varying degrees of ambiguity. The visualization manipulation parser leverages the hierarchical task structure to streamline these tasks into a sequence of atomic visualization manipulations. To illustrate the effectiveness of our approach, we provide real-world examples and experimental results. The evaluation highlights the precision of our natural language parsing capabilities and underscores the smooth transformation of visualization manipulations.
