Flexi-NeurA: A Configurable Neuromorphic Accelerator with Adaptive Bit-Precision Exploration for Edge SNNs

Mohammad Farahani; Mohammad Rasoul Roshanshah; Saeed Safari

Flexi-NeurA: A Configurable Neuromorphic Accelerator with Adaptive Bit-Precision Exploration for Edge SNNs

Mohammad Farahani, Mohammad Rasoul Roshanshah, Saeed Safari

TL;DR

Comprehensive evaluations across MNIST, SHD, and DVS benchmarks demonstrate that the Flexi-NeurA and Flex-plorer co-framework achieves substantial improvements in accuracy, latency, and energy efficiency.

Abstract

Neuromorphic accelerators promise unparalleled energy efficiency and computational density for spiking neural networks (SNNs), especially in edge intelligence applications. However, most existing platforms exhibit rigid architectures with limited configurability, restricting their adaptability to heterogeneous workloads and diverse design objectives. To address these limitations, we present Flexi-NeurA -- a parameterizable neuromorphic accelerator (core) that unifies configurability, flexibility, and efficiency. Flexi-NeurA allows users to customize neuron models, network structures, and precision settings at design time. By pairing these design-time configurability and flexibility features with a time-multiplexed and event-driven processing approach, Flexi-NeurA substantially reduces the required hardware resources and total power while preserving high efficiency and low inference latency. Complementing this, we introduce Flex-plorer, a heuristic-guided design-space exploration (DSE) tool that determines cost-effective fixed-point precisions for critical parameters -- such as decay factors, synaptic weights, and membrane potentials -- based on user-defined trade-offs between accuracy and resource usage. Based on the configuration selected through the Flex-plorer process, RTL code is configured to match the specified design. Comprehensive evaluations across MNIST, SHD, and DVS benchmarks demonstrate that the Flexi-NeurA and Flex-plorer co-framework achieves substantial improvements in accuracy, latency, and energy efficiency. A three-layer 256--128--10 fully connected network with LIF neurons mapped onto two processing cores achieves 97.23% accuracy on MNIST with 1.1~ms inference latency, utilizing only 1,623 logic cells, 7 BRAMs, and 111~mW of total power -- establishing Flexi-NeurA as a scalable, edge-ready neuromorphic platform.

Flexi-NeurA: A Configurable Neuromorphic Accelerator with Adaptive Bit-Precision Exploration for Edge SNNs

TL;DR

Abstract

Paper Structure (15 sections, 7 equations, 11 figures, 2 tables)

This paper contains 15 sections, 7 equations, 11 figures, 2 tables.

Introduction
Background
Spiking Neural Networks
Neuron Models
Related Work
Proposed Hardware
Hardware Design Components
Configurable Neuron Unit (CNU)
Coefficient Generator (CG)
Serial Peripheral Interface (SPI)
AER Management Unit (AMU)
Controller
Exploration
Results and Comparisons
Conclusion

Figures (11)

Figure 1: Spiking neural network topologies: (A) Fully-connected Feedforward, (B) Recurrent All-to-All False, and (C) Recurrent All-to-All.
Figure 2: Spiking neuron models: (A) IF (B) LIF (C) Synaptic.
Figure 3: Top-level architecture of the Flexi-NeurA processing core and its main functional units.
Figure 4: Multi-core system organization showing AER-based inter-layer communication and layer-to-core mapping.
Figure 5: Coefficient Generator (CG) structure based on a shift-and-add network. The 9-bit DecayRate[8:0] configures selectable shifts to approximate any leakage factor in the $[0,1]$ range with $1/256$ resolution.
...and 6 more figures

Flexi-NeurA: A Configurable Neuromorphic Accelerator with Adaptive Bit-Precision Exploration for Edge SNNs

TL;DR

Abstract

Flexi-NeurA: A Configurable Neuromorphic Accelerator with Adaptive Bit-Precision Exploration for Edge SNNs

Authors

TL;DR

Abstract

Table of Contents

Figures (11)