Spoofing attack augmentation: can differently-trained attack models improve generalisation?
Wanying Ge, Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Nicholas Evans
TL;DR
This work addresses the vulnerability of deepfake/spoofing countermeasures to unpredictable, machine-generated attacks. It analyzes a VITS-based spoofing strategy against three countermeasures (AASIST, RawNet2, SSL-AASIST) and demonstrates that artefacts vary with attack training conditions, potentially degrading detection. The authors propose attack-augmentation—training CM models with spoof data from multiple, differently configured attack algorithms—and show it improves generalisation, especially for AASIST and SSL-AASIST. The findings highlight attack-augmentation as a practical tool to bolster robustness and motivate broader evaluation across diverse attack paradigms.
Abstract
A reliable deepfake detector or spoofing countermeasure (CM) should be robust in the face of unpredictable spoofing attacks. To encourage the learning of more generaliseable artefacts, rather than those specific only to known attacks, CMs are usually exposed to a broad variety of different attacks during training. Even so, the performance of deep-learning-based CM solutions are known to vary, sometimes substantially, when they are retrained with different initialisations, hyper-parameters or training data partitions. We show in this paper that the potency of spoofing attacks, also deep-learning-based, can similarly vary according to training conditions, sometimes resulting in substantial degradations to detection performance. Nevertheless, while a RawNet2 CM model is vulnerable when only modest adjustments are made to the attack algorithm, those based upon graph attention networks and self-supervised learning are reassuringly robust. The focus upon training data generated with different attack algorithms might not be sufficient on its own to ensure generaliability; some form of spoofing attack augmentation at the algorithm level can be complementary.
