Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design
Ming Gao, Hang Chen, Jun Du, Xin Xu, Hongxiao Guo, Hui Bu, Jianxing Yang, Ming Li, Chin-Hui Lee
TL;DR
The paper tackles inclusive wake-up word spotting for dysarthric speakers by releasing the Mandarin Dysarthria Speech Corpus (MDSC) and designing a customized WWS system. It presents a detailed dataset with 18,630 recordings (17 hours) from 21 dysarthric and 25 control speakers, along with intelligibility annotations and enrollment data, and demonstrates the limitations of conventional systems on dysarthric speech. A three-tier WWS framework (SIC, SID, SDD) is proposed, with baseline DS-TCN and augmentation, and enrollment-based speaker customization yielding substantial improvements—especially for moderately intelligible users—while still facing challenges for highly unintelligible cases. The work advances practical accessibility for dysarthria in smart-home contexts and lays groundwork for language- and speaker-aware WWS research, with potential societal impact in reducing exclusion from voice-controlled technologies.
Abstract
Smart home technology has gained widespread adoption, facilitating effortless control of devices through voice commands. However, individuals with dysarthria, a motor speech disorder, face challenges due to the variability of their speech. This paper addresses the wake-up word spotting (WWS) task for dysarthric individuals, aiming to integrate them into real-world applications. To support this, we release the open-source Mandarin Dysarthria Speech Corpus (MDSC), a dataset designed for dysarthric individuals in home environments. MDSC encompasses information on age, gender, disease types, and intelligibility evaluations. Furthermore, we perform comprehensive experimental analysis on MDSC, highlighting the challenges encountered. We also develop a customized dysarthria WWS system that showcases robustness in handling intelligibility and achieving exceptional performance. MDSC will be released on https://www.aishelltech.com/AISHELL_6B.
