Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis
Stefan Bleeck
TL;DR
This work tackles the gap between traditional audiometry and real-world speech understanding in presbycusis by introducing an Automatic Speech Recognition (ASR)–based frequency-specific speech test. The approach simulates hearing loss by acoustically degrading speech and analyzes phoneme-level confusions to generate a detailed impairment profile, enabling granular mapping of frequency-specific deficits. Through a two-phase item curation and diagnostic simulation, the method demonstrates discriminative power between normal-hearing and hearing-impaired listeners in a controlled setting and curates a 200-item test battery focused on high-frequency cues. The framework offers a scalable, objective avenue for deeper speech perception diagnostics and sets the stage for human validation and integration with advanced AI models to enhance clinical precision and efficiency.
Abstract
Traditional audiometry often fails to fully characterize the functional impact of hearing loss on speech understanding, particularly supra-threshold deficits and frequency-specific perception challenges in conditions like presbycusis. This paper presents the development and simulated evaluation of a novel Automatic Speech Recognition (ASR)-based frequency-specific speech test designed to provide granular diagnostic insights. Our approach leverages ASR to simulate the perceptual effects of moderate sloping hearing loss by processing speech stimuli under controlled acoustic degradation and subsequently analyzing phoneme-level confusion patterns. Key findings indicate that simulated hearing loss introduces specific phoneme confusions, predominantly affecting high-frequency consonants (e.g., alveolar/palatal to labiodental substitutions) and leading to significant phoneme deletions, consistent with the acoustic cues degraded in presbycusis. A test battery curated from these ASR-derived confusions demonstrated diagnostic value, effectively differentiating between simulated normal-hearing and hearing-impaired listeners in a comprehensive simulation. This ASR-driven methodology offers a promising avenue for developing objective, granular, and frequency-specific hearing assessment tools that complement traditional audiometry. Future work will focus on validating these findings with human participants and exploring the integration of advanced AI models for enhanced diagnostic precision.
