Audio Adversarial Examples: Targeted Attacks on Speech-to-Text
Nicholas Carlini, David Wagner
TL;DR
Targeted audio adversarial examples are shown to be feasible against state-of-the-art speech-to-text systems by perturbing input audio in a white-box, end-to-end optimization framework. The method operates on raw audio and through MFCC preprocessing and CTC-based decoding, achieving exact target transcriptions with substantial imperceptibility and outputs up to 50 characters per second. The authors compare initial and improved loss formulations, demonstrate robustness challenges and limitations, and explore non-speech to speech and silence-targeting attacks. This work establishes audio as a new domain for adversarial research, prompting development of defenses and transferability studies.
Abstract
We construct targeted audio adversarial examples on automatic speech recognition. Given any audio waveform, we can produce another that is over 99.9% similar, but transcribes as any phrase we choose (recognizing up to 50 characters per second of audio). We apply our white-box iterative optimization-based attack to Mozilla's implementation DeepSpeech end-to-end, and show it has a 100% success rate. The feasibility of this attack introduce a new domain to study adversarial examples.
