Accessible Color Sequences for Data Visualization
Matthew A. Petroff
TL;DR
This work addresses the need for color sequences in data visualization that are both aesthetically pleasing and accessible to individuals with color-vision deficiencies. It combines a crowdsourced aesthetic-preference framework with rigorous perceptual constraints in CAM02-UCS and deficiency simulations to generate six-, eight-, and ten-color sequences, including considerations for grayscale readability and color naming. A conjoined neural-network approach trained on pairwise survey data yields scores used to select near-optimal sequences, which are further refined by a sequence-accessibility metric that accounts for perceptual and lightness distances. The resulting color sequences, shown to outperform many common defaults in accessibility, provide robust defaults for plotting libraries while remaining interpretable and describable in verbal and written descriptions.
Abstract
Color sequences, ordered sets of colors for data visualization, that balance aesthetics with accessibility considerations are presented. In order to model aesthetic preference, data were collected with an online survey, and the results were used to train a machine-learning model. To ensure accessibility, this model was combined with minimum-perceptual-distance constraints, including for simulated color-vision deficiencies, as well as with minimum-lightness-distance constraints for grayscale printing, maximum-lightness constraints for maintaining contrast with a white background, and scores from a color-saliency model for ease of use of the colors in verbal and written descriptions. Optimal color sequences containing six, eight, and ten colors were generated using the data-driven aesthetic-preference model and accessibility constraints. Due to the balance of aesthetics and accessibility considerations, the resulting color sequences can serve as reasonable defaults in data-plotting codes, e.g., for use in scatter plots and line plots.
