Table of Contents
Fetching ...

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

Victor Dibia

TL;DR

LIDA tackles automatic visualization generation by orchestrating four modules—summarizer, goal explorer, visGenerator, and infographer—to convert data into grounding language, propose goals, produce executable visualization code across multiple grammars, and render data-faithful, stylized infographics. It leverages LLMS for goal reasoning and code construction and IGMs for styling, all within a grammar-agnostic pipeline, and introduces ver and sevq as reliability and quality metrics. The authors demonstrate a low ver (~3.5%) across 57 Vega-derived datasets, with ablation showing benefits from enriched data summaries and multi-grammar support. The work provides an open-source Python API and UI, positioning LIDA as a modular building block for automated data exploration, storytelling, accessibility, and chart QA.

Abstract

Systems that support users in the automatic creation of visualizations must address several subtasks - understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pose visualization generation as a multi-stage generation problem and argue that well-orchestrated pipelines based on large language models (LLMs) such as ChatGPT/GPT-4 and image generation models (IGMs) are suitable to addressing these tasks. We present LIDA, a novel tool for generating grammar-agnostic visualizations and infographics. LIDA comprises of 4 modules - A SUMMARIZER that converts data into a rich but compact natural language summary, a GOAL EXPLORER that enumerates visualization goals given the data, a VISGENERATOR that generates, refines, executes and filters visualization code and an INFOGRAPHER module that yields data-faithful stylized graphics using IGMs. LIDA provides a python api, and a hybrid user interface (direct manipulation and multilingual natural language) for interactive chart, infographics and data story generation. Learn more about the project here - https://microsoft.github.io/lida/

LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

TL;DR

LIDA tackles automatic visualization generation by orchestrating four modules—summarizer, goal explorer, visGenerator, and infographer—to convert data into grounding language, propose goals, produce executable visualization code across multiple grammars, and render data-faithful, stylized infographics. It leverages LLMS for goal reasoning and code construction and IGMs for styling, all within a grammar-agnostic pipeline, and introduces ver and sevq as reliability and quality metrics. The authors demonstrate a low ver (~3.5%) across 57 Vega-derived datasets, with ablation showing benefits from enriched data summaries and multi-grammar support. The work provides an open-source Python API and UI, positioning LIDA as a modular building block for automated data exploration, storytelling, accessibility, and chart QA.

Abstract

Systems that support users in the automatic creation of visualizations must address several subtasks - understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pose visualization generation as a multi-stage generation problem and argue that well-orchestrated pipelines based on large language models (LLMs) such as ChatGPT/GPT-4 and image generation models (IGMs) are suitable to addressing these tasks. We present LIDA, a novel tool for generating grammar-agnostic visualizations and infographics. LIDA comprises of 4 modules - A SUMMARIZER that converts data into a rich but compact natural language summary, a GOAL EXPLORER that enumerates visualization goals given the data, a VISGENERATOR that generates, refines, executes and filters visualization code and an INFOGRAPHER module that yields data-faithful stylized graphics using IGMs. LIDA provides a python api, and a hybrid user interface (direct manipulation and multilingual natural language) for interactive chart, infographics and data story generation. Learn more about the project here - https://microsoft.github.io/lida/
Paper Structure (26 sections, 1 equation, 11 figures, 1 table)

This paper contains 26 sections, 1 equation, 11 figures, 1 table.

Figures (11)

  • Figure 1: lida generates visualizations and infographics across 4 modules - data summarization, goal exploration, visualization generation and infographics generations. Example output from each module is shown.
  • Figure 2: Example data-faithful infographics and associated style prompts generated with lida.
  • Figure 3: The summarizer module constructs a nl summary from extracted data properties (atomic types, field statistics) and an optional LLM enrichment (predicted field descriptions, semantic types).
  • Figure 4: A goal generated by lida is a JSON data structure that contains a question, a visualization and a rationale.
  • Figure 5: The visGenerator module constructs visualization code scaffolds, fills a constrained section ($<stub>$) and executes the scaffold.
  • ...and 6 more figures