Efficient Logit-based Knowledge Distillation of Deep Spiking Neural Networks for Full-Range Timestep Deployment
Chengting Yu, Xiaochen Zhao, Lei Liu, Shu Yang, Gaoang Wang, Erping Li, Aili Wang
TL;DR
This work tackles the rigidity of SNN deployment caused by fixed inference timesteps and accuracy gaps to ANNs. It introduces a temporal-wise logits-based distillation framework that decouples targets across timesteps and augments learning with final ensemble self-distillation, backed by convergence proofs. Empirically, it achieves state-of-the-art performance among distillation-based SNN methods on CIFAR-10/100, ImageNet, and CIFAR10-DVS, while preserving training efficiency comparable to standard KD. The approach enables a single trained model to perform robustly across a full range of timesteps, facilitating flexible, energy-efficient deployment on neuromorphic hardware. Overall, the method advances SNN usability by providing theoretical guarantees and practical benefits for full-range timestep deployment.
Abstract
Spiking Neural Networks (SNNs) are emerging as a brain-inspired alternative to traditional Artificial Neural Networks (ANNs), prized for their potential energy efficiency on neuromorphic hardware. Despite this, SNNs often suffer from accuracy degradation compared to ANNs and face deployment challenges due to fixed inference timesteps, which require retraining for adjustments, limiting operational flexibility. To address these issues, our work considers the spatio-temporal property inherent in SNNs, and proposes a novel distillation framework for deep SNNs that optimizes performance across full-range timesteps without specific retraining, enhancing both efficacy and deployment adaptability. We provide both theoretical analysis and empirical validations to illustrate that training guarantees the convergence of all implicit models across full-range timesteps. Experimental results on CIFAR-10, CIFAR-100, CIFAR10-DVS, and ImageNet demonstrate state-of-the-art performance among distillation-based SNNs training methods. Our code is available at https://github.com/Intelli-Chip-Lab/snn\_temporal\_decoupling\_distillation.
