VizAbility: Enhancing Chart Accessibility with LLM-based Conversational Interaction
Joshua Gorniak, Yoon Kim, Donglai Wei, Nam Wook Kim
TL;DR
VizAbility addresses the data-visualization accessibility gap for blind and low-vision users by marrying keyboard-navigable chart encodings with an LLM-based conversational QA system. It leverages Olli's text-based chart tree, Vega-Lite data, and context-aware prompting to support visual, analytical, contextual, and navigation queries, with proactive and reactive mechanisms to mitigate LLM failures. The approach is evaluated through a Q&A benchmark (87.39% classification accuracy; strong human/LLM agreement with Kendall's $\tau$=$0.5526$, $p<0.001$) and a qualitative user study with six BLV participants, showing meaningful usability gains and specific improvement opportunities. Findings indicate VizAbility outperforms baselines (including GPT-4V), enhances transparency about sources and reasoning, and demonstrates potential for integration into existing visualization workflows, while identifying directions for richer benchmarks and vision integration to broaden applicability.
Abstract
Traditional accessibility methods like alternative text and data tables typically underrepresent data visualization's full potential. Keyboard-based chart navigation has emerged as a potential solution, yet efficient data exploration remains challenging. We present VizAbility, a novel system that enriches chart content navigation with conversational interaction, enabling users to use natural language for querying visual data trends. VizAbility adapts to the user's navigation context for improved response accuracy and facilitates verbal command-based chart navigation. Furthermore, it can address queries for contextual information, designed to address the needs of visually impaired users. We designed a large language model (LLM)-based pipeline to address these user queries, leveraging chart data & encoding, user context, and external web knowledge. We conducted both qualitative and quantitative studies to evaluate VizAbility's multimodal approach. We discuss further opportunities based on the results, including improved benchmark testing, incorporation of vision models, and integration with visualization workflows.
