Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning
Haibin Wu, Xu Li, Andy T. Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee
TL;DR
This paper tackles the vulnerability of ASV systems to adversarial attacks by proposing a self-supervised learning-based reformer (SSLR) framework that operates without knowledge of attack algorithms. It introduces two complementary defenses: adversarial perturbation purification via cascaded SSLR models and adversarial perturbation detection via score-variation analysis across SSLR cascades, supported by a formal evaluation framework. Empirical results show substantial reductions in adversarial success rates and competitive preservation of genuine-sample performance, with ASV fine-tuning further mitigating collateral impact. The work demonstrates practical robustness gains on VoxCeleb data and provides benchmarks to guide future adversarial defenses in speaker verification.
Abstract
Previous works have shown that automatic speaker verification (ASV) is seriously vulnerable to malicious spoofing attacks, such as replay, synthetic speech, and recently emerged adversarial attacks. Great efforts have been dedicated to defending ASV against replay and synthetic speech; however, only a few approaches have been explored to deal with adversarial attacks. All the existing approaches to tackle adversarial attacks for ASV require the knowledge for adversarial samples generation, but it is impractical for defenders to know the exact attack algorithms that are applied by the in-the-wild attackers. This work is among the first to perform adversarial defense for ASV without knowing the specific attack algorithms. Inspired by self-supervised learning models (SSLMs) that possess the merits of alleviating the superficial noise in the inputs and reconstructing clean samples from the interrupted ones, this work regards adversarial perturbations as one kind of noise and conducts adversarial defense for ASV by SSLMs. Specifically, we propose to perform adversarial defense from two perspectives: 1) adversarial perturbation purification and 2) adversarial perturbation detection. Experimental results show that our detection module effectively shields the ASV by detecting adversarial samples with an accuracy of around 80%. Moreover, since there is no common metric for evaluating the adversarial defense performance for ASV, this work also formalizes evaluation metrics for adversarial defense considering both purification and detection based approaches into account. We sincerely encourage future works to benchmark their approaches based on the proposed evaluation framework.
