Table of Contents
Fetching ...

The use of vocal biomarkers in the detection of Parkinson's disease: a robust statistical performance comparison of classic machine learning models

Katia Pires Nascimento do Sacramento, Elliot Q. C. Garcia, Nicéias Silva Vilela, Vinicius P. Sacramento, Tiago A. E. Ferreira

TL;DR

This study addresses early detection of Parkinson's disease using vocal biomarkers derived from MFCC features. It conducts a rigorous, multi-run comparison of a deep neural network against traditional ML models on two public datasets, ensuring robustness with $1000$ independent runs. The DNN consistently achieves the highest accuracies (Italian dataset: $0.9865$, Telemonitoring: $0.9211$) and demonstrates low variability, with nonparametric tests confirming significant differences from most baselines. The findings highlight the potential of DNN-based voice analysis as a non-invasive, scalable screening tool for PD, while also calling for validation on more heterogeneous, multilingual datasets to ensure broad applicability.

Abstract

Parkinson's disease (PD) is a progressive neurodegenerative disorder that, in addition to directly impairing functional mobility, is frequently associated with vocal impairments such as hypophonia and dysarthria, which typically manifest in the early stages. The use of vocal biomarkers to support the early diagnosis of PD presents a non-invasive, low-cost, and accessible alternative in clinical settings. Thus, the objective of this cross-sectional study was to consistently evaluate the effectiveness of a Deep Neural Network (DNN) in distinguishing individuals with Parkinson's disease from healthy controls, in comparison with traditional Machine Learning (ML) methods, using vocal biomarkers. Two publicly available voice datasets were used. Mel-frequency cepstral coefficients (MFCCs) were extracted from the samples, and model robustness was assessed using a validation strategy with 1000 independent random executions. Performance was evaluated using classification statistics. Since normality assumptions were not satisfied, non-parametric tests (Kruskal-Wallis and Bonferroni post-hoc tests) were applied to verify whether the tested classification models were similar or different in the classification of PD. With an average accuracy of $98.65\%$ and $92.11\%$ on the Italian Voice dataset and Parkinson's Telemonitoring dataset, respectively, the DNN demonstrated superior performance and efficiency compared to traditional ML models, while also achieving competitive results when benchmarked against relevant studies. Overall, this study confirms the efficiency of DNNs and emphasizes their potential to provide greater accuracy and reliability for the early detection of neurodegenerative diseases using voice-based biomarkers.

The use of vocal biomarkers in the detection of Parkinson's disease: a robust statistical performance comparison of classic machine learning models

TL;DR

This study addresses early detection of Parkinson's disease using vocal biomarkers derived from MFCC features. It conducts a rigorous, multi-run comparison of a deep neural network against traditional ML models on two public datasets, ensuring robustness with independent runs. The DNN consistently achieves the highest accuracies (Italian dataset: , Telemonitoring: ) and demonstrates low variability, with nonparametric tests confirming significant differences from most baselines. The findings highlight the potential of DNN-based voice analysis as a non-invasive, scalable screening tool for PD, while also calling for validation on more heterogeneous, multilingual datasets to ensure broad applicability.

Abstract

Parkinson's disease (PD) is a progressive neurodegenerative disorder that, in addition to directly impairing functional mobility, is frequently associated with vocal impairments such as hypophonia and dysarthria, which typically manifest in the early stages. The use of vocal biomarkers to support the early diagnosis of PD presents a non-invasive, low-cost, and accessible alternative in clinical settings. Thus, the objective of this cross-sectional study was to consistently evaluate the effectiveness of a Deep Neural Network (DNN) in distinguishing individuals with Parkinson's disease from healthy controls, in comparison with traditional Machine Learning (ML) methods, using vocal biomarkers. Two publicly available voice datasets were used. Mel-frequency cepstral coefficients (MFCCs) were extracted from the samples, and model robustness was assessed using a validation strategy with 1000 independent random executions. Performance was evaluated using classification statistics. Since normality assumptions were not satisfied, non-parametric tests (Kruskal-Wallis and Bonferroni post-hoc tests) were applied to verify whether the tested classification models were similar or different in the classification of PD. With an average accuracy of and on the Italian Voice dataset and Parkinson's Telemonitoring dataset, respectively, the DNN demonstrated superior performance and efficiency compared to traditional ML models, while also achieving competitive results when benchmarked against relevant studies. Overall, this study confirms the efficiency of DNNs and emphasizes their potential to provide greater accuracy and reliability for the early detection of neurodegenerative diseases using voice-based biomarkers.

Paper Structure

This paper contains 15 sections, 4 figures, 4 tables.

Figures (4)

  • Figure 1: Workflow of the experimental protocol used for predictive classification of Parkinson’s disease using vocal biomarkers.
  • Figure 2: Distribution of model performance across 1,000 randomized training runs on the Italian voice dataset.
  • Figure 3: Distribution of model performance across 1,000 randomized training runs on the Parkinson’s Telemonitoring dataset.
  • Figure 4: Post-hoc pairwise comparison using Dunn test (Bonferroni correction).