Sustained Vowels for Pre- vs Post-Treatment COPD Classification
Andreas Triantafyllopoulos, Anton Batliner, Wolfgang Mayr, Markus Fendler, Florian Pokorny, Maurice Gerczuk, Shahin Amiriparian, Thomas Berghaus, Björn Schuller
TL;DR
This study investigates whether sustained vowels can augment automatic COPD monitoring by distinguishing pre- vs post-treatment states. Using a cohort of 50 COPD patients producing five cardinal vowels and a German read passage, the authors extract 88 acoustic features (eGeMAPS) across whole vowels and segmented onset/centre/offset portions, with both mismatched and matched duration strategies. A nested leave-one-speaker-out SVM with speaker normalisation demonstrates that sustained vowels add complementary information to read speech, achieving a best $UAR = 79\%$ via late fusion of vowel, MPT, and phrase-based predictions, highlighting the role of voice quality, loudness dynamics, and formant dispersion in COPD manifestations. The work provides interpretable feature insights and supports integrating multiple vocal modalities for noninvasive COPD assessment, while noting the limitation of a relatively small dataset and arguing for broader multimodal future work.
Abstract
Chronic obstructive pulmonary disease (COPD) is a serious inflammatory lung disease affecting millions of people around the world. Due to an obstructed airflow from the lungs, it also becomes manifest in patients' vocal behaviour. Of particular importance is the detection of an exacerbation episode, which marks an acute phase and often requires hospitalisation and treatment. Previous work has shown that it is possible to distinguish between a pre- and a post-treatment state using automatic analysis of read speech. In this contribution, we examine whether sustained vowels can provide a complementary lens for telling apart these two states. Using a cohort of 50 patients, we show that the inclusion of sustained vowels can improve performance to up to 79\% unweighted average recall, from a 71\% baseline using read speech. We further identify and interpret the most important acoustic features that characterise the manifestation of COPD in sustained vowels.
