Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Yongqi Ding; Lin Zuo; Mengmeng Jing; Pei He; Yongjun Xiao

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Yongqi Ding, Lin Zuo, Mengmeng Jing, Pei He, Yongjun Xiao

TL;DR

This work tackles the latency–accuracy tension in neuromorphic object recognition by introducing Shrinking SNN (SSNN), which splits the network into stages with progressively shrinking timesteps and a temporal transformer to preserve information across temporal scales. It augments training with multiple early classifiers to provide immediate gradient feedback, mitigating surrogate-gradient mismatch and gradient vanishing/exploding without inflating inference cost. Empirical results on CIFAR10-DVS, N-Caltech101, and DVS-Gesture show substantial gains at low average timesteps (e.g., 5), including 73.63% on CIFAR10-DVS without augmentation and up to 90.74% on DVS-Gesture, outperforming several state-of-the-art approaches at similar latencies. The findings demonstrate the effectiveness of heterogeneous temporal scales for achieving high-performance, low-latency SNNs and offer practical guidance for designing efficient neuromorphic recognition systems.

Abstract

Neuromorphic object recognition with spiking neural networks (SNNs) is the cornerstone of low-power neuromorphic computing. However, existing SNNs suffer from significant latency, utilizing 10 to 40 timesteps or more, to recognize neuromorphic objects. At low latencies, the performance of existing SNNs is drastically degraded. In this work, we propose the Shrinking SNN (SSNN) to achieve low-latency neuromorphic object recognition without reducing performance. Concretely, we alleviate the temporal redundancy in SNNs by dividing SNNs into multiple stages with progressively shrinking timesteps, which significantly reduces the inference latency. During timestep shrinkage, the temporal transformer smoothly transforms the temporal scale and preserves the information maximally. Moreover, we add multiple early classifiers to the SNN during training to mitigate the mismatch between the surrogate gradient and the true gradient, as well as the gradient vanishing/exploding, thus eliminating the performance degradation at low latency. Extensive experiments on neuromorphic datasets, CIFAR10-DVS, N-Caltech101, and DVS-Gesture have revealed that SSNN is able to improve the baseline accuracy by 6.55% ~ 21.41%. With only 5 average timesteps and without any data augmentation, SSNN is able to achieve an accuracy of 73.63% on CIFAR10-DVS. This work presents a heterogeneous temporal scale SNN and provides valuable insights into the development of high-performance, low-latency SNNs.

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

TL;DR

Abstract

Paper Structure (30 sections, 11 equations, 8 figures, 5 tables, 1 algorithm)

This paper contains 30 sections, 11 equations, 8 figures, 5 tables, 1 algorithm.

Introduction
Background
Spiking Neuron
Surrogate Gradient-Based SNN Training Method
Method
Timestep Shrinkage
Temporal Transformer
Average Timestep and Overhead Analysis
Early Classifier
Additional Overhead Analysis
The SSNN Framework
Experiments
Ablation Studies
Comparison with Existing Methods
CIFAR10-DVS
...and 15 more sections

Figures (8)

Figure 1: Comparative results of the proposed SSNN and existing SNNs (with an average timestep of 5). Results show that SSNN exceeds existing SNNs by a large margin.
Figure 2: Overview of SSNN. The SSNN is divided into $n$ stages with gradually shrinking timesteps $\{T_1,T_2,\cdot \cdot \cdot, T_n\}$, and the temporal transformer transforms the temporal scale of the information. During training, an early classifier is added after each stage of the SSNN except the last one. The predictions generated by each early classifier are used to facilitate the optimization of the parameters by calculating the losses along with the ground truth (green arrows).
Figure 3: Influence of $\lambda$ on performance. The accuracy remains at [68.70%,70.36%] as long as $\lambda_1$, $\lambda_2$ and $\lambda_3$ are not zero, indicating that our method is not sensitive to $\lambda$.
Figure 4: Influence of the temporal transformer (TT). The performance of SNNs is severely degraded and inferior to the baseline without the temporal transformer.
Figure 5: Influence of stage division and stage-wise timesteps, S$n\{t_1,t_2,... ,t_n\}$ denotes division into $n$ stages and stage-wise timestep of $\{t_1,t_2,... ,t_n\}$. Results indicate that our method is not sensitive to different stage divisions and stage-wise timesteps.
...and 3 more figures

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

TL;DR

Abstract

Shrinking Your TimeStep: Towards Low-Latency Neuromorphic Object Recognition with Spiking Neural Network

Authors

TL;DR

Abstract

Table of Contents

Figures (8)