Efficient Training of Spiking Neural Networks by Spike-aware Data Pruning

Chenxiang Ma; Xinyi Chen; Yujie Wu; Kay Chen Tan; Jibin Wu

Efficient Training of Spiking Neural Networks by Spike-aware Data Pruning

Chenxiang Ma, Xinyi Chen, Yujie Wu, Kay Chen Tan, Jibin Wu

TL;DR

This work addresses the high training cost of spiking neural networks by introducing spike-aware data pruning (SADP). SADP optimizes data usage by selecting examples with probabilities proportional to an upper-bound proxy of their gradient norms, called the spike-aware importance score, and adds smoothing and a dynamic pruning schedule to stabilize training. The approach yields substantial training speedups while preserving accuracy across diverse datasets and architectures, including large-scale ImageNet experiments, and proves compatible with online, local, and efficient-inference settings. By reducing gradient variance and avoiding expensive per-example gradient computations, SADP offers a data-centric route to scaling SNNs to bigger models and datasets with practical efficiency gains.

Abstract

Spiking neural networks (SNNs), recognized as an energy-efficient alternative to traditional artificial neural networks (ANNs), have advanced rapidly through the scaling of models and datasets. However, such scaling incurs considerable training overhead, posing challenges for researchers with limited computational resources and hindering the sustained development of SNNs. Data pruning is a promising strategy for accelerating training by retaining the most informative examples and discarding redundant ones, but it remains largely unexplored in SNNs. Directly applying ANN-based data pruning methods to SNNs fails to capture the intrinsic importance of examples and suffers from high gradient variance. To address these challenges, we propose a novel spike-aware data pruning (SADP) method. SADP reduces gradient variance by determining each example's selection probability to be proportional to its gradient norm, while avoiding the high cost of direct gradient computation through an efficient upper bound, termed spike-aware importance score. This score accounts for the influence of all-or-nothing spikes on the gradient norm and can be computed with negligible overhead. Extensive experiments across diverse datasets and architectures demonstrate that SADP consistently outperforms data pruning baselines and achieves training speedups close to the theoretical maxima at different pruning ratios. Notably, SADP reduces training time by 35% on ImageNet while maintaining accuracy comparable to that of full-data training. This work, therefore, establishes a data-centric paradigm for efficient SNN training and paves the way for scaling SNNs to larger models and datasets. The source code will be released publicly after the review process.

Efficient Training of Spiking Neural Networks by Spike-aware Data Pruning

TL;DR

Abstract

Efficient Training of Spiking Neural Networks by Spike-aware Data Pruning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (4)