Improving the Speaker Anonymization Evaluation's Robustness to Target Speakers with Adversarial Learning
Carlos Franzreb, Arnab Das, Tim Polzehl, Sebastian Möller
TL;DR
The paper addresses overestimated privacy in speaker anonymization evaluations that arise when same-gender target selection is used, due to leaked target information in anonymized speech. It introduces a target classifier trained alongside the source recognizer and employs a gradient reversal layer to adversarially suppress target information, improving the reliability of privacy assessments. Experiments across two anonymizers and TSAs show that target information is disproportionately encoded and that the proposed adversarial approach yields more consistent privacy estimates, especially under same-gender TSA; results generalize across evaluation setups. This approach provides a practical, low-overhead improvement to privacy evaluation, enabling more trustworthy assessments and better-informed deployment decisions for anonymization systems.
Abstract
The current privacy evaluation for speaker anonymization often overestimates privacy when a same-gender target selection algorithm (TSA) is used, although this TSA leaks the speaker's gender and should hence be more vulnerable. We hypothesize that this occurs because the evaluation does not account for the fact that anonymized speech contains information from both the source and target speakers. To address this, we propose to add a target classifier that measures the influence of target speaker information in the evaluation, which can also be removed with adversarial learning. Experiments demonstrate that this approach is effective for multiple anonymizers, particularly when using a same-gender TSA, leading to a more reliable assessment.
