BadSNN: Backdoor Attacks on Spiking Neural Networks via Adversarial Spiking Neuron
Abdullah Arafat Miah, Kevin Vu, Yu Bi
TL;DR
This work addresses backdoor threats in Spiking Neural Networks by exploiting the hyperparameters of spiking neurons, notably $V_{\text{thr}}$ and $\tau$, to embed a backdoor without input data poisoning. The authors propose BadSNN, a two-part approach combining malicious spike poisoning during training with a trigger-optimization pipeline (via $\mathcal{T}_p$ and $\mathcal{T}_o$) to activate the backdoor on inference, extended to neuromorphic data with a temporally varying trigger $\mathcal{T}_s$. Empirical results across four datasets and multiple architectures show strong attack effectiveness (high ASR and controlled CA degradation) and robustness to several defenses, with ablations illustrating parameter regimes that balance stealth and impact. The findings underscore a new threat surface in SNNs and emphasize the need for defenses that account for spiking-neuron hyperparameters and trigger-learning dynamics.
Abstract
Spiking Neural Networks (SNNs) are energy-efficient counterparts of Deep Neural Networks (DNNs) with high biological plausibility, as information is transmitted through temporal spiking patterns. The core element of an SNN is the spiking neuron, which converts input data into spikes following the Leaky Integrate-and-Fire (LIF) neuron model. This model includes several important hyperparameters, such as the membrane potential threshold and membrane time constant. Both the DNNs and SNNs have proven to be exploitable by backdoor attacks, where an adversary can poison the training dataset with malicious triggers and force the model to behave in an attacker-defined manner. Yet, how an adversary can exploit the unique characteristics of SNNs for backdoor attacks remains underexplored. In this paper, we propose \textit{BadSNN}, a novel backdoor attack on spiking neural networks that exploits hyperparameter variations of spiking neurons to inject backdoor behavior into the model. We further propose a trigger optimization process to achieve better attack performance while making trigger patterns less perceptible. \textit{BadSNN} demonstrates superior attack performance on various datasets and architectures, as well as compared with state-of-the-art data poisoning-based backdoor attacks and robustness against common backdoor mitigation techniques. Codes can be found at https://github.com/SiSL-URI/BadSNN.
