Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation
Sirine Arfa, Bernhard Vogginger, Chen Liu, Johannes Partzsch, Mark Schone, Christian Mayr
TL;DR
This work tackles efficient deployment of Spiking Neural Networks (SNNs) for DVS gesture recognition on the SpiNNaker2 neuromorphic platform under tight memory constraints. It presents an end-to-end deployment pipeline that uses the neuromorphic intermediate representation (NIR) and compares two quantization strategies—Post-Training Quantization (PTQ) and Quantization Aware Training (QAT)—to achieve 8-bit on-chip inference with accuracy close to full-precision. The results show that QAT yields smaller accuracy losses (about 0.56% on-chip for the Q-SNN) than PTQ (up to ~1.07%), with on-chip accuracies around 94.13% for the quantized model and 94.0% for the FP32 baseline, while delivering energy-efficient inference (≈0.459 J per gesture). The study demonstrates SpiNNaker2’s viability for low-energy edge recognition and provides an end-to-end, open-source deployment workflow that can be extended to multi-chip systems and real-time spiking sensing tasks.
Abstract
Spiking Neural Networks (SNNs) are highly energy-efficient during inference, making them particularly suitable for deployment on neuromorphic hardware. Their ability to process event-driven inputs, such as data from dynamic vision sensors (DVS), further enhances their applicability to edge computing tasks. However, the resource constraints of edge hardware necessitate techniques like weight quantization, which reduce the memory footprint of SNNs while preserving accuracy. Despite its importance, existing quantization methods typically focus on synaptic weights quantization without taking account of other critical parameters, such as scaling neuron firing thresholds. To address this limitation, we present the first benchmark for the DVS gesture recognition task using SNNs optimized for the many-core neuromorphic chip SpiNNaker2. Our study evaluates two quantization pipelines for fixed-point computations. The first approach employs post training quantization (PTQ) with percentile-based threshold scaling, while the second uses quantization aware training (QAT) with adaptive threshold scaling. Both methods achieve accurate 8-bit on-chip inference, closely approximating 32-bit floating-point performance. Additionally, our baseline SNNs perform competitively against previously reported results without specialized techniques. These models are deployed on SpiNNaker2 using the neuromorphic intermediate representation (NIR). Ultimately, we achieve 94.13% classification accuracy on-chip, demonstrating the SpiNNaker2's potential for efficient, low-energy neuromorphic computing.
