Investigating Color Blind User Interface Accessibility via Simulated Interfaces
Amaan Jamil, Gyorgy Denes
TL;DR
This paper addresses how color vision deficiency (CVD) affects UI usability and aesthetics by employing a simulation-based protocol that uses physiologically grounded CVD models. Non-CVD observers evaluated 20 popular UIs under three CVD simulants across standard and high-contrast modes, yielding 120 stimuli and mean opinion scores (MOS) for functionality alongside binary aesthetics judgments. The study finds a positive correlation between functionality and aesthetics ($r=0.74$) and shows that OS-wide high-contrast mode generally degrades both metrics, though some UIs (e.g., Slack with built-in CVD themes) can mitigate these effects. These results offer practical guidance for UI designers to anticipate CVD-related usability issues and advocate for targeted, CVD-aware design, while providing a dataset for further research and a simple AAA/AA/AAA classification framework to benchmark interfaces.
Abstract
Over 300 million people who live with color vision deficiency (CVD) have a decreased ability to distinguish between colors, limiting their ability to interact with websites and software packages. User interface designers have taken various approaches to tackle the issue with most offering a high contrast mode. The Web Content Accessibility Guidelines (WCAG) outline some best practices for maintaining accessibility that have been adopted and recommended by several governments; however, it is currently uncertain how this impacts perceived user functionality and if this could result in a reduced aesthetic look. In the absence of subjective data, we aim to investigate how a CVD observer might rate the functionality and aesthetics of existing UIs. However, the design of a comparative study of CVD vs. non-CVD populations is inherently hard, therefore we build on the successful field of physiologically-based CVD models, and propose a novel simulation-based experimental protocol, where non-CVD observers rate the relative aesthetics and functionality of screenshots of 20 popular websites as seen in full color vs. with simulated CVD. Our results show that relative aesthetics and functionality correlate positively and that an operating-system-wide high contrast mode can reduce both aesthetics and functionality. While our results are only valid in the context of simulated CVD screenshots, the approach has the benefit of being easily deployable, and can help to spot a number of common pitfalls in production. Finally, we propose a AAA-A classification of the interfaces we analyzed.
