Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications
Tong Bu, Maohua Li, Zhaofei Yu
TL;DR
The paper tackles the challenge of efficiently converting pre-trained ANNs to high-performance SNNs with minimal training cost by introducing inference-scale conversion techniques. It combines a theoretical error bound with practical threshold optimization (local and channel-wise) and a delayed evaluation strategy to mitigate spike-delay effects, enabling fast, low-power inference. The approach demonstrates strong performance on image classification and extends to semantic segmentation, object detection, and video tasks, achieving notable energy-efficiency advantages (e.g., ~622 FPS/W) while requiring far less training data and compute than retraining-based methods. This work offers a practical path for deploying SNNs on neuromorphic hardware, enabling fast, low-power AI with negligible performance loss relative to ANN baselines.
Abstract
Spiking Neural Networks (SNNs) have emerged as a promising substitute for Artificial Neural Networks (ANNs) due to their advantages of fast inference and low power consumption. However, the lack of efficient training algorithms has hindered their widespread adoption. Even efficient ANN-SNN conversion methods necessitate quantized training of ANNs to enhance the effectiveness of the conversion, incurring additional training costs. To address these challenges, we propose an efficient ANN-SNN conversion framework with only inference scale complexity. The conversion framework includes a local threshold balancing algorithm, which enables efficient calculation of the optimal thresholds and fine-grained adjustment of the threshold value by channel-wise scaling. We also introduce an effective delayed evaluation strategy to mitigate the influence of the spike propagation delays. We demonstrate the scalability of our framework in typical computer vision tasks: image classification, semantic segmentation, object detection, and video classification. Our algorithm outperforms existing methods, highlighting its practical applicability and efficiency. Moreover, we have evaluated the energy consumption of the converted SNNs, demonstrating their superior low-power advantage compared to conventional ANNs. This approach simplifies the deployment of SNNs by leveraging open-source pre-trained ANN models, enabling fast, low-power inference with negligible performance reduction. Code is available at https://github.com/putshua/Inference-scale-ANN-SNN.
