Exploring the Limitations of Layer Synchronization in Spiking Neural Networks
Roel Koopman, Amirreza Yousefzadeh, Mahyar Shahsavari, Guangzhi Tang, Manolis Sifalakis
TL;DR
This paper investigates the mismatch between layer-synchronized training and end-to-end asynchronous inference in spiking neural networks (SNNs). It introduces unlayered backpropagation, a training approach that incorporates asynchronous neuron scheduling and vectorized execution to prepare models for true event-driven processing. Across multiple spatio-temporal benchmarks, asynchronous training can recover or improve accuracy, dramatically reduce spike counts (up to $50\%$), and halve inference latency (up to $2\times$ faster), albeit with substantial training-time costs. The work highlights the need for co-design of training algorithms and neuromorphic hardware to unlock real energy- and latency-efficient AI systems.
Abstract
Neural-network processing in machine learning applications relies on layer synchronization. This is practiced even in artificial Spiking Neural Networks (SNNs), which are touted as consistent with neurobiology, in spite of processing in the brain being in fact asynchronous. A truly asynchronous system however would allow all neurons to evaluate concurrently their threshold and emit spikes upon receiving any presynaptic current. Omitting layer synchronization is potentially beneficial, for latency and energy efficiency, but asynchronous execution of models previously trained with layer synchronization may entail a mismatch in network dynamics and performance. We present and quantify this problem, and show that models trained with layer synchronization either perform poorly in absence of the synchronization, or fail to benefit from any energy and latency reduction, when such a mechanism is in place. We then explore a potential solution direction, based on a generalization of backpropagation-based training that integrates knowledge about an asynchronous execution scheduling strategy, for learning models suitable for asynchronous processing. We experiment with two asynchronous neuron execution scheduling strategies in datasets that encode spatial and temporal information, and we show the potential of asynchronous processing to use less spikes (up to 50%), complete inference faster (up to 2x), and achieve competitive or even better accuracy (up to 10% higher). Our exploration affirms that asynchronous event-based AI processing can be indeed more efficient, but we need to rethink how we train our SNN models to benefit from it. (Source code available at: https://github.com/RoelMK/asynctorch)
