Continual Audio Deepfake Detection via Universal Adversarial Perturbation
Wangjie Li, Lin Li, Qingyang Hong
TL;DR
This work tackles the problem of continual audio deepfake detection under evolving spoofing attacks by introducing Universal Adversarial Perturbation (UAP) into a continual learning framework. The method generates a UAP from the previous model and uses it as a pseudo-spoofed signal during fine-tuning of a pre-trained self-supervised audio model (WavLM), with distillation to preserve prior distributions on real samples. The authors show that feature-level UAPs outperform waveform-level perturbations in maintaining memory of past spoofing distributions, achieving better cross-domain generalization across multiple public datasets (e.g., ASVspoof and CFAD) while avoiding storing historical data. Overall, the approach provides an efficient, data-leakage-free pathway to robust continual learning in audio anti-spoofing, with significant gains over standard sequence fine-tuning. The work also highlights the practical impact of adversarial perturbations as a discriminative feature for spoofing in the context of evolving threats.
Abstract
The rapid advancement of speech synthesis and voice conversion technologies has raised significant security concerns in multimedia forensics. Although current detection models demonstrate impressive performance, they struggle to maintain effectiveness against constantly evolving deepfake attacks. Additionally, continually fine-tuning these models using historical training data incurs substantial computational and storage costs. To address these limitations, we propose a novel framework that incorporates Universal Adversarial Perturbation (UAP) into audio deepfake detection, enabling models to retain knowledge of historical spoofing distribution without direct access to past data. Our method integrates UAP seamlessly with pre-trained self-supervised audio models during fine-tuning. Extensive experiments validate the effectiveness of our approach, showcasing its potential as an efficient solution for continual learning in audio deepfake detection.
