Table of Contents
Fetching ...

Evaluating Parkinson's Disease Detection in Anonymized Speech: A Performance and Acoustic Analysis

Carlos Franzreb, Francisco Teixeira, Ben Luks, Sebastian Möller, Alberto Abad

TL;DR

This work assesses the trade-off between privacy and PD detection for two anonymizers (STT-TTS and kNN-VC) using two Spanish datasets to demonstrate that privacy-preserving PD detection is viable when using appropriate anonymization.

Abstract

Automatic detection of Parkinson's disease (PD) from speech is a promising non-invasive diagnostic tool, but it raises significant privacy concerns. Speaker anonymization mitigates these risks, but it may suppress the pathological information necessary for PD detection. We assess the trade-off between privacy and PD detection for two anonymizers (STT-TTS and kNN-VC) using two Spanish datasets. STT-TTS provides better privacy but severely degrades PD detection by eradicating prosodic information. kNN-VC preserves macro-prosodic features such as duration and F0 contours, achieving F1 scores only 3-7\% lower than original baselines, demonstrating that privacy-preserving PD detection is viable when using appropriate anonymization. Finally, an acoustic distortion analysis characterizes specific weaknesses in kNN-VC, offering insights for designing anonymizers that better preserve PD information.

Evaluating Parkinson's Disease Detection in Anonymized Speech: A Performance and Acoustic Analysis

TL;DR

This work assesses the trade-off between privacy and PD detection for two anonymizers (STT-TTS and kNN-VC) using two Spanish datasets to demonstrate that privacy-preserving PD detection is viable when using appropriate anonymization.

Abstract

Automatic detection of Parkinson's disease (PD) from speech is a promising non-invasive diagnostic tool, but it raises significant privacy concerns. Speaker anonymization mitigates these risks, but it may suppress the pathological information necessary for PD detection. We assess the trade-off between privacy and PD detection for two anonymizers (STT-TTS and kNN-VC) using two Spanish datasets. STT-TTS provides better privacy but severely degrades PD detection by eradicating prosodic information. kNN-VC preserves macro-prosodic features such as duration and F0 contours, achieving F1 scores only 3-7\% lower than original baselines, demonstrating that privacy-preserving PD detection is viable when using appropriate anonymization. Finally, an acoustic distortion analysis characterizes specific weaknesses in kNN-VC, offering insights for designing anonymizers that better preserve PD information.
Paper Structure (11 sections, 1 figure, 2 tables)

This paper contains 11 sections, 1 figure, 2 tables.

Figures (1)

  • Figure 1: EMD and MI per feature, for the monologue task of PC-GITA.