The VoicePrivacy 2024 Challenge Evaluation Plan
Natalia Tomashenko, Xiaoxiao Miao, Pierre Champion, Sarina Meyer, Xin Wang, Emmanuel Vincent, Michele Panariello, Nicholas Evans, Junichi Yamagishi, Massimiliano Todisco
TL;DR
This paper presents the VoicePrivacy 2024 Challenge Evaluation Plan, outlining an objective framework for evaluating utterance-level voice anonymization that conceals speaker identity while preserving linguistic content and emotional state. The approach combines a rigorous privacy-utility tradeoff framework with a strong attacker model, fixed downstream task models, and a suite of public baselines (B1–B6) to benchmark progress. It provides a comprehensive data ecosystem, training resources, and a detailed, multilayer evaluation protocol using privacy (EER) and utility (WER, UAR) metrics across multiple operating points, all within an accessible, Python-centered workflow. The plan aims to standardize comparisons, accelerate progress in practical privacy-preserving speech technologies, and enable reproducible assessments in real-world GDPR-like contexts.
Abstract
The task of the challenge is to develop a voice anonymization system for speech data which conceals the speaker's voice identity while protecting linguistic content and emotional states. The organizers provide development and evaluation datasets and evaluation scripts, as well as baseline anonymization systems and a list of training resources formed on the basis of the participants' requests. Participants apply their developed anonymization systems, run evaluation scripts and submit evaluation results and anonymized speech data to the organizers. Results will be presented at a workshop held in conjunction with Interspeech 2024 to which all participants are invited to present their challenge systems and to submit additional workshop papers.
