Spikingformer: A Key Foundation Model for Spiking Neural Networks
Chenlin Zhou, Liutao Yu, Zhaokun Zhou, Han Zhang, Jiaqi Wang, Huihui Zhou, Zhengyu Ma, Yonghong Tian
TL;DR
Spikingformer introduces a spike-driven Transformer backbone by integrating MS Residual with Self-Attention to eliminate non-spike computations that hinder energy efficiency in Spikformer. The approach preserves global modeling capabilities via Spiking Self Attention and a Spiking Tokenizer, achieving strong performance across ImageNet, CIFAR, neuromorphic data, and GLUE with significantly reduced energy consumption. The authors provide theoretical energy analysis, extensive experiments on 13 datasets, and detailed Appendix resources to support deployment on neuromorphic hardware, positioning Spikingformer as a robust foundation model for energy-efficient AI. The work demonstrates that spike-driven residuals can sustain high accuracy while maintaining event-driven computation, advancing practical SNNs for diverse tasks.
Abstract
Spiking neural networks (SNNs) offer a promising energy-efficient alternative to artificial neural networks, due to their event-driven spiking computation. However, some foundation SNN backbones (including Spikformer and SEW ResNet) suffer from non-spike computations (integer-float multiplications) caused by the structure of their residual connections. These non-spike computations increase SNNs' power consumption and make them unsuitable for deployment on mainstream neuromorphic hardware. In this paper, we analyze the spike-driven behavior of the residual connection methods in SNNs. We then present Spikingformer, a novel spiking transformer backbone that merges the MS Residual connection with Self-Attention in a biologically plausible way to address the non-spike computation challenge in Spikformer while maintaining global modeling capabilities. We evaluate Spikingformer across 13 datasets spanning large static images, neuromorphic data, and natural language tasks, and demonstrate the effectiveness and universality of Spikingformer, setting a vital benchmark for spiking neural networks. In addition, with the spike-driven features and global modeling capabilities, Spikingformer is expected to become a more efficient general-purpose SNN backbone towards energy-efficient artificial intelligence. Code: https://github.com/TheBrainLab/Spikingformer
