Accessible Text Descriptions for UpSet Plots
Andrew McNutt, Maggie K McCracken, Ishrat Jahan Eliza, Daniel Hajas, Jake Wagoner, Nate Lanza, Jack Wilburn, Sarah Creem-Regehr, Alexander Lex
TL;DR
The paper tackles the accessibility barrier of UpSet plots for blind and low-vision users by developing an automatic, pattern-driven text description system. It starts with a survey of UpSet usage to identify describable patterns, then designs long and short descriptions grounded in semantic levels of context, implemented via a JSON-based standard and a web API. The authors validate the approach through BLV interviews and two contextual experiments with sighted users and LLM baselines, finding that text descriptions can be informative and, when paired with visuals, offer a curb-cut benefit to multiple user groups. The work contributes an open-source description generator, a scalable evaluation framework, and a pathway to extending text-based accessibility to other nonstandard chart types, advancing inclusive scientific communication.
Abstract
Data visualizations are typically not accessible to blind and low-vision (BLV) users. Automatically generating text descriptions offers an enticing mechanism for democratizing access to the information held in complex scientific charts, yet appropriate procedures for generating those texts remain elusive. Pursuing this issue, we study a single complex chart form: UpSet plots. UpSet Plots are a common way to analyze set data, an area largely unexplored by prior accessibility literature. By analyzing the patterns present in real-world examples, we develop a system for automatically captioning any UpSet plot. We evaluated the utility of our captions via semi-structured interviews with (N=11) BLV users and found that BLV users find them informative. In extensions, we find that sighted users can use our texts similarly to UpSet plots and that they are better than naive LLM usage.
