An Asynchronous Multi-core Accelerator for SNN inference

Zhuo Chen; De Ma; Xiaofei Jin; Qinghui Xing; Ouwen Jin; Xin Du; Shuibing He; Gang Pan

An Asynchronous Multi-core Accelerator for SNN inference

Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du, Shuibing He, Gang Pan

TL;DR

The paper tackles under-utilization in time-driven SNN accelerators caused by global synchronization. It introduces DepAsync, an asynchronous, time-accurate architecture that uses compile-time core dependencies and runtime dependency tracing to allow per-core progress without waiting for all cores. Across five SNN workloads, DepAsync achieves substantial improvements in speed and energy efficiency over synchronization-based baselines and demonstrates strong scalability, while noting sensitivity to mapping and network conditions. The approach preserves exact spike results while reducing idle time, enabling more efficient brain-inspired computation in multi-core accelerators.

Abstract

Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propose an asynchronous architecture for Spiking Neural Networks (SNNs) that eliminates the need for inter-core synchronization, thus enhancing speed and energy efficiency. This approach leverages the pre-determined dependencies of neuromorphic cores established during compilation. Each core is equipped with a scheduler that monitors the status of its dependencies, allowing it to safely advance to the next timestep without waiting for other cores. This eliminates the necessity for global synchronization and minimizes core waiting time despite inherent workload imbalances. Comprehensive evaluations using five different SNN workloads show that our architecture achieves a 1.86x speedup and a 1.55x increase in energy efficiency compared to state-of-the-art synchronization architectures.

An Asynchronous Multi-core Accelerator for SNN inference

TL;DR

Abstract

Paper Structure (31 sections, 3 equations, 26 figures, 4 tables, 2 algorithms)

This paper contains 31 sections, 3 equations, 26 figures, 4 tables, 2 algorithms.

Introduction
Background
Time-driven SNN Inference
Many-core Time-driven SNN Accelerators
Under-utilization Problem
Imbalance in SNN Workloads
Under-utilization Caused by Synchronization
Related Work
DepAsync Design
Ignored Inter-Core Dependencies
Tracing Dependencies and Running States
DepAsync Implementation
DepAsync Scheduler
Circular Spike Buffer
Package Generator
...and 16 more sections

Figures (26)

Figure 1: Different neuron models between ANN and SNN.
Figure 2: Deploy pipeline on SNN accelerators.
Figure 3: Block diagram of many-core SNN accelerators.
Figure 4: SNN inference on time-driven SNN accelerators.
Figure 5: Imbalance in SNN workloads.
...and 21 more figures

An Asynchronous Multi-core Accelerator for SNN inference

TL;DR

Abstract

An Asynchronous Multi-core Accelerator for SNN inference

Authors

TL;DR

Abstract

Table of Contents

Figures (26)