DeepVIS: Bridging Natural Language and Data Visualization Through Step-wise Reasoning
Zhihao Shuai, Boyan Li, Siyu Yan, Yuyu Luo, Weikai Yang
TL;DR
This work tackles transparency and refinement challenges in NL2VIS by integrating Chain-of-Thought reasoning into the NL2VIS pipeline. It introduces nvBench-CoT to capture explicit step-by-step reasoning and DeepVIS, an interactive interface that exposes and lets users edit the reasoning chain, enabling targeted refinements. Quantitative results show state-of-the-art All Acc and strong axis/data accuracy, while user studies confirm enhanced trust and usability through reasoning transparency. Overall, the approach advances human–AI collaboration in visualization by coupling automated generation with interpretable, controllable reasoning processes across data-to-visualization tasks.
Abstract
Although data visualization is powerful for revealing patterns and communicating insights, creating effective visualizations requires familiarity with authoring tools and often disrupts the analysis flow. While large language models show promise for automatically converting analysis intent into visualizations, existing methods function as black boxes without transparent reasoning processes, which prevents users from understanding design rationales and refining suboptimal outputs. To bridge this gap, we propose integrating Chain-of-Thought (CoT) reasoning into the Natural Language to Visualization (NL2VIS) pipeline. First, we design a comprehensive CoT reasoning process for NL2VIS and develop an automatic pipeline to equip existing datasets with structured reasoning steps. Second, we introduce nvBench-CoT, a specialized dataset capturing detailed step-by-step reasoning from ambiguous natural language descriptions to finalized visualizations, which enables state-of-the-art performance when used for model fine-tuning. Third, we develop DeepVIS, an interactive visual interface that tightly integrates with the CoT reasoning process, allowing users to inspect reasoning steps, identify errors, and make targeted adjustments to improve visualization outcomes. Quantitative benchmark evaluations, two use cases, and a user study collectively demonstrate that our CoT framework effectively enhances NL2VIS quality while providing insightful reasoning steps to users.
