EMALG: An Enhanced Mandarin Lombard Grid Corpus with Meaningful Sentences
Baifeng Li, Qingmu Liu, Yuhong Yang, Hongyang Chen, Weiping Tu, Song Lin
TL;DR
This work introduces EMALG, an enhanced Mandarin Lombard grid corpus with meaningful sentences to overcome the limitations of MALG and enable robust Lombard-effect analysis in Mandarin. The authors collect data from 34 native Mandarin speakers producing 10,200 meaningful utterances across three noise levels, extract phonetic and acoustic parameters, and compare results with the English Lombard Grid. They show that meaningful sentences elicit stronger Lombard effects, with notable gender differences (females showing larger adjustments) and Mandarin-specific patterns in vowel and consonant adjustments. The findings corroborate prior English–Mandarin comparisons while emphasizing language-specific adaptations, with potential benefits for speech recognition, enhancement, and voice-conversion systems in noisy environments.
Abstract
This study investigates the Lombard effect, where individuals adapt their speech in noisy environments. We introduce an enhanced Mandarin Lombard grid (EMALG) corpus with meaningful sentences , enhancing the Mandarin Lombard grid (MALG) corpus. EMALG features 34 speakers and improves recording setups, addressing challenges faced by MALG with nonsense sentences. Our findings reveal that in Mandarin, meaningful sentences are more effective in enhancing the Lombard effect. Additionally, we uncover that female exhibit a more pronounced Lombard effect than male when uttering meaningful sentences. Moreover, our results reaffirm the consistency in the Lombard effect comparison between English and Mandarin found in previous research.
