A Transversal Study of Fundamental Frequency Contours in Parkinsonian Voices
Pablo Rodriguez-Perez, Ruben Fraile, Miguel Garcia-Escrig, Nicolas Saenz-Lechon, Juana M. Gutierrez-Arriola, Victor Osma-Ruiz
TL;DR
The paper addresses how Parkinson's disease alters speech prosody, focusing on fundamental frequency contours during read speech. It combines traditional $f_o$ statistics with a modulation-spectrum analysis to capture slow and fast pitch variations, and it accounts for inter-subject variability by analyzing men and women separately. The study finds that relative pitch range $\sigma_{f_o}/\mu_{f_o}$ correlates with PD stage, especially in women, while modulation-band energies ($LFER$, $MFER$, $HFER$) also relate to disease progression; however, regression models achieve modest explanatory power ($R^2$ about 0.37 overall, higher in women, lower in men) and yield only moderate diagnostic utility (AUC ~0.634, EER ~36%). Overall, intonation descriptors offer some PD-relevant information but are insufficient for reliable early diagnosis without additional markers, and they underscore the need for sex-specific analyses in PD voice studies.
Abstract
A transversal study of the pitch variability of parkinsonian voices in read speech is presented. 30 patients suffering from Parkinson's disease (PD) and 32 healthy speakers were recorded while reading a text without voiceless phonemes. The fundamental frequency contours were calculated from the recordings, and the following measures were used for describing them: mean, minimum, maximum, and standard deviation of the estimated fundamental frequencies. Results based on these measures indicate that the influence of PD on some aspects of intonation can be masked by the effects of aging, especially for male voices. However, some parameters such as the relative fundamental frequency range exhibit lower correlations with age than with PD stage, as evaluated using the Hoehn and Yahr scale. These correlations between relative fundamental frequency range and PD stage reach moderate-to-high values in the case of women. Additionally, three parameters describing the form of the fundamental frequency modulation spectrum were investigated for correlation with age and PD stage. The study of this modulation spectrum provides some insight into the ability of the speakers to plan the intonation of full phrases. For both male and female populations, significant correlations were found between parameters obtained from the modulation spectrum of fundamental frequency and the PD stage. Nevertheless, the quantitative assessment of the performance of regression models built from these modulation parameters and fundamental frequency range suggests that such measures are likely to be of limited value in the early diagnosis of PD due to inter-speaker variability.
