Design Considerations for Automatic Musical Soundscapes of Visual Art for People with Blindness or Low Vision
Stephen James Krol, Maria Teresa Llano, Matthew Butler, Cagatay Goncu
TL;DR
The paper tackles the challenge of making visual art accessible to blind and low-vision audiences by automating musical soundscapes that convey mood and context. It introduces a two-stage prototype system that first generates emotive music from paintings and then augments it with contextual sound effects, guided by object detection; the system is evaluated through a pilot with sighted and BLV participants and an in-depth qualitative study with BLV participants. Key contributions include an emotion-classification pipeline mapped to the Valence-Arousal space, a Magenta Music Transformer-based music generator, and a context-aware sound-effects database, all analyzed to derive design considerations emphasizing narrative feel, diversity and accuracy of contextual cues, and integration with other accessibility methods. The findings suggest automated soundscapes can produce meaningful aesthetic experiences for BLV users while highlighting design principles such as narrative framing, customization, historical accuracy, ambient immersion, and careful audio-reproduction to ensure representativeness and minimize misrepresentation, with potential for scalable museum deployment.
Abstract
Music has been identified as a promising medium to enhance the accessibility and experience of visual art for people who are blind or have low vision (BLV). However, composing music and designing soundscapes for visual art is a time-consuming, resource intensive process - limiting its scalability for large exhibitions. In this paper, we investigate the use of automated soundscapes to increase the accessibility of visual art. We built a prototype system and ran a qualitative study to evaluate the aesthetic experience provided by the automated soundscapes with 10 BLV participants. From the study, we identified a set of design considerations that reveal requirements from BLV people for the development of automated soundscape systems, setting new directions in which creative systems could enrich the aesthetic experience conveyed by these.
