Trading Devil: Robust backdoor attack via Stochastic investment models and Bayesian approach
Orson Mengara
TL;DR
This paper addresses backdoor attacks on audio speech recognition by introducing MarketBack, a robust clean-label backdoor that leverages stochastic investment models to encode malicious content in audio. It fuses Bayesian diffusion sampling with drift functions from the Vasicek, Hull-White, and Longstaff-Schwartz models to design and sample poisoned data, enabling attacks that survive outsourced training. Experiments on the GTZAN dataset across seven Hugging Face transformer models show MarketBack achieving near-100% attack success while preserving benign accuracy at poisoning rates below 1%. The work highlights security risks for audio systems relying on outsourced data and provides a Bayesian-diffusion framework for evaluating and understanding audio backdoors under drift-driven dynamics.
Abstract
With the growing use of voice-activated systems and speech recognition technologies, the danger of backdoor attacks on audio data has grown significantly. This research looks at a specific type of attack, known as a Stochastic investment-based backdoor attack (MarketBack), in which adversaries strategically manipulate the stylistic properties of audio to fool speech recognition systems. The security and integrity of machine learning models are seriously threatened by backdoor attacks, in order to maintain the reliability of audio applications and systems, the identification of such attacks becomes crucial in the context of audio data. Experimental results demonstrated that MarketBack is feasible to achieve an average attack success rate close to 100% in seven victim models when poisoning less than 1% of the training data.
