Hearing Loss Detection from Facial Expressions in One-on-one Conversations
Yufeng Yin, Ishwarya Ananthabhotla, Vamsi Krishna Ithapu, Stavros Petridis, Yu-Hsiang Wu, Christi Miller
TL;DR
The paper addresses detecting hearing loss from a subject’s facial expressions in one-on-one conversations. It introduces a self-supervised variation modeling approach to capture within-subject expression changes across noise levels, and employs adversarial representation learning via a gradient reversal layer to mitigate age-related bias. The method leverages a Marlin-based facial encoder, a variation encoder, and an HL classifier, achieving superior F1-scores on a large egocentric dataset (RLR-CHAT) compared to baselines. Findings show that age bias can degrade performance for younger subjects, which is effectively mitigated by the proposed ABM strategy, enabling more accurate real-time hearing loss detection in social interactions. Overall, this work provides a practical framework for nonverbal-behavior-based monitoring to support timely interventions such as communication strategies or hearing-aid adjustments.
Abstract
Individuals with impaired hearing experience difficulty in conversations, especially in noisy environments. This difficulty often manifests as a change in behavior and may be captured via facial expressions, such as the expression of discomfort or fatigue. In this work, we build on this idea and introduce the problem of detecting hearing loss from an individual's facial expressions during a conversation. Building machine learning models that can represent hearing-related facial expression changes is a challenge. In addition, models need to disentangle spurious age-related correlations from hearing-driven expressions. To this end, we propose a self-supervised pre-training strategy tailored for the modeling of expression variations. We also use adversarial representation learning to mitigate the age bias. We evaluate our approach on a large-scale egocentric dataset with real-world conversational scenarios involving subjects with hearing loss and show that our method for hearing loss detection achieves superior performance over baselines.
