Accuracy-Robustness Trade Off via Spiking Neural Network Gradient Sparsity Trail
Luu Trong Nhan, Luu Trung Duong, Pham Ngoc Nam, Truong Cong Thang
TL;DR
The paper investigates adversarial robustness in Spiking Neural Networks (SNNs) and uncovers a natural gradient sparsity phenomenon that can yield state-of-the-art robustness without explicit defenses under certain architectures. Through empirical benchmarking on CIFAR and event-based datasets, it reveals a robustness-generalization trade-off driven by gradient density, with architectural choices like pooling shaping gradient sparsity. The authors provide theoretical bounds linking input-gradient sparsity to reduced weight-gradient density, and show that reducing gradient sparsity (e.g., via average pooling) can improve clean generalization at the cost of robustness. These findings offer a principled perspective on designing sparse-gradient SNNs for robust, energy-efficient vision systems and suggest future sparsity-aware architectural strategies.
Abstract
Spiking Neural Networks (SNNs) have attracted growing interest in both computational neuroscience and artificial intelligence, primarily due to their inherent energy efficiency and compact memory footprint. However, achieving adversarial robustness in SNNs, (particularly for vision-related tasks) remains a nascent and underexplored challenge. Recent studies have proposed leveraging sparse gradients as a form of regularization to enhance robustness against adversarial perturbations. In this work, we present a surprising finding: under specific architectural configurations, SNNs exhibit natural gradient sparsity and can achieve state-of-the-art adversarial defense performance without the need for any explicit regularization. Further analysis reveals a trade-off between robustness and generalization: while sparse gradients contribute to improved adversarial resilience, they can impair the model's ability to generalize; conversely, denser gradients support better generalization but increase vulnerability to attacks. Our findings offer new insights into the dual role of gradient sparsity in SNN training.
