Are Deep Speech Denoising Models Robust to Adversarial Noise?
Will Schwarzer, Philip S. Thomas, Andrea Fanelli, Xiaoyu Liu
TL;DR
This study demonstrates that four prominent deep-noise-suppression models are vulnerable to imperceptible adversarial perturbations, capable of making outputs gibberish or near-targeted speech under white-box conditions and even in simulated over-the-air settings. The authors develop an attack framework based on psychoacoustic masking and projected gradient descent, using STOI as the optimization objective, and show that defenses like simple Gaussian noise provide only limited protection. They analyze four DNS architectures (Demucs, FSN+, FRCRN, MP-SENet) across untargeted and targeted attacks, revealing that model-transferability of attacks is generally weak and that FSN+ exhibits pseudo-robustness due to gradient instability. The work highlights a practical security concern for DNS systems in real-world use (e.g., communication, hearing aids) and emphasizes the need for stronger defenses and broader threat-model testing. Overall, the paper expands adversarial evaluation beyond ASR to generative audio denoising, demonstrating both the feasibility and the limitations of current attacks and defenses.
Abstract
Deep noise suppression (DNS) models enjoy widespread use throughout a variety of high-stakes speech applications. However, in this paper, we show that four recent DNS models can each be reduced to outputting unintelligible gibberish through the addition of imperceptible adversarial noise. Furthermore, our results show the near-term plausibility of targeted attacks, which could induce models to output arbitrary utterances, and over-the-air attacks. While the success of these attacks varies by model and setting, and attacks appear to be strongest when model-specific (i.e., white-box and non-transferable), our results highlight a pressing need for practical countermeasures in DNS systems.
