Table of Contents
Fetching ...

An Asynchronous Multi-core Accelerator for SNN inference

Zhuo Chen, De Ma, Xiaofei Jin, Qinghui Xing, Ouwen Jin, Xin Du, Shuibing He, Gang Pan

TL;DR

The paper tackles under-utilization in time-driven SNN accelerators caused by global synchronization. It introduces DepAsync, an asynchronous, time-accurate architecture that uses compile-time core dependencies and runtime dependency tracing to allow per-core progress without waiting for all cores. Across five SNN workloads, DepAsync achieves substantial improvements in speed and energy efficiency over synchronization-based baselines and demonstrates strong scalability, while noting sensitivity to mapping and network conditions. The approach preserves exact spike results while reducing idle time, enabling more efficient brain-inspired computation in multi-core accelerators.

Abstract

Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propose an asynchronous architecture for Spiking Neural Networks (SNNs) that eliminates the need for inter-core synchronization, thus enhancing speed and energy efficiency. This approach leverages the pre-determined dependencies of neuromorphic cores established during compilation. Each core is equipped with a scheduler that monitors the status of its dependencies, allowing it to safely advance to the next timestep without waiting for other cores. This eliminates the necessity for global synchronization and minimizes core waiting time despite inherent workload imbalances. Comprehensive evaluations using five different SNN workloads show that our architecture achieves a 1.86x speedup and a 1.55x increase in energy efficiency compared to state-of-the-art synchronization architectures.

An Asynchronous Multi-core Accelerator for SNN inference

TL;DR

The paper tackles under-utilization in time-driven SNN accelerators caused by global synchronization. It introduces DepAsync, an asynchronous, time-accurate architecture that uses compile-time core dependencies and runtime dependency tracing to allow per-core progress without waiting for all cores. Across five SNN workloads, DepAsync achieves substantial improvements in speed and energy efficiency over synchronization-based baselines and demonstrates strong scalability, while noting sensitivity to mapping and network conditions. The approach preserves exact spike results while reducing idle time, enabling more efficient brain-inspired computation in multi-core accelerators.

Abstract

Spiking Neural Networks (SNNs) are extensively utilized in brain-inspired computing and neuroscience research. To enhance the speed and energy efficiency of SNNs, several many-core accelerators have been developed. However, maintaining the accuracy of SNNs often necessitates frequent explicit synchronization among all cores, which presents a challenge to overall efficiency. In this paper, we propose an asynchronous architecture for Spiking Neural Networks (SNNs) that eliminates the need for inter-core synchronization, thus enhancing speed and energy efficiency. This approach leverages the pre-determined dependencies of neuromorphic cores established during compilation. Each core is equipped with a scheduler that monitors the status of its dependencies, allowing it to safely advance to the next timestep without waiting for other cores. This eliminates the necessity for global synchronization and minimizes core waiting time despite inherent workload imbalances. Comprehensive evaluations using five different SNN workloads show that our architecture achieves a 1.86x speedup and a 1.55x increase in energy efficiency compared to state-of-the-art synchronization architectures.
Paper Structure (31 sections, 3 equations, 26 figures, 4 tables, 2 algorithms)

This paper contains 31 sections, 3 equations, 26 figures, 4 tables, 2 algorithms.

Figures (26)

  • Figure 1: Different neuron models between ANN and SNN.
  • Figure 2: Deploy pipeline on SNN accelerators.
  • Figure 3: Block diagram of many-core SNN accelerators.
  • Figure 4: SNN inference on time-driven SNN accelerators.
  • Figure 5: Imbalance in SNN workloads.
  • ...and 21 more figures