ACCESS: Prompt Engineering for Automated Web Accessibility Violation Corrections
Calista Huang, Alyssa Ma, Suchir Vyasamudri, Eugenie Puype, Sayem Kamal, Juan Belza Garcia, Salar Cheema, Michael Lutz
TL;DR
This work targets the pervasive problem of WCAG violations by introducing ACCESS, a pipeline that automatically corrects web accessibility violations through real-time DOM modifications guided by foundation models. It constructs the ACCESS Benchmark to quantitatively evaluate corrections, using a five-level severity scale and aggregate scores, and explores three prompt-engineering strategies (ReAct, Few-Shot Guided, and Transformer-based prompts) to generate corrected HTML that is applied to the DOM. The results show a substantial reduction in accessibility violations, with a best-case reduction of about 51% in severity using ReAct prompting on GPT-3.5-turbo-16K, and a GPT-4 baseline offering additional gains on a smaller subset. The work highlights practical implications for end-user accessibility tools and outlines future directions, including dataset expansion and multimodal approaches, to further automate and generalize web accessibility repairs.
Abstract
With the increasing need for inclusive and user-friendly technology, web accessibility is crucial to ensuring equal access to online content for individuals with disabilities, including visual, auditory, cognitive, or motor impairments. Despite the existence of accessibility guidelines and standards such as Web Content Accessibility Guidelines (WCAG) and the Web Accessibility Initiative (W3C), over 90% of websites still fail to meet the necessary accessibility requirements. For web users with disabilities, there exists a need for a tool to automatically fix web page accessibility errors. While research has demonstrated methods to find and target accessibility errors, no research has focused on effectively correcting such violations. This paper presents a novel approach to correcting accessibility violations on the web by modifying the document object model (DOM) in real time with foundation models. Leveraging accessibility error information, large language models (LLMs), and prompt engineering techniques, we achieved greater than a 51% reduction in accessibility violation errors after corrections on our novel benchmark: ACCESS. Our work demonstrates a valuable approach toward the direction of inclusive web content, and provides directions for future research to explore advanced methods to automate web accessibility.
