DelGrad: Exact event-based gradients for training delays and weights on spiking neuromorphic hardware
Julian Göltz, Jimmy Weber, Laura Kriener, Sebastian Billaudelle, Peter Lake, Johannes Schemmel, Melika Payvand, Mihai A. Petrovici
TL;DR
DelGrad presents an exact, event-based gradient framework for jointly training transmission delays and synaptic weights in spiking neural networks, leveraging spike times exclusively and avoiding membrane-potential tracking. It derives analytic spike-time gradients, extends to multi-layer architectures, and integrates delay mechanisms (axonal, dendritic, synaptic) into the learning process, enabling end-to-end optimization with minimal hardware overhead. Empirically, DelGrad improves accuracy and parameter efficiency on a Yin–Yang classification task and demonstrates memory- and energy-friendly chip-in-the-loop training on BrainScaleS-2, with hardware-aware simulations capturing noise-induced gaps. This work enables precise temporal learning for neuromorphic hardware, reducing parameter counts while enhancing robustness to hardware variability and paving the way for delay-aware end-to-end SNN training.
Abstract
Spiking neural networks (SNNs) inherently rely on the timing of signals for representing and processing information. Incorporating trainable transmission delays, alongside synaptic weights, is crucial for shaping these temporal dynamics. While recent methods have shown the benefits of training delays and weights in terms of accuracy and memory efficiency, they rely on discrete time, approximate gradients, and full access to internal variables like membrane potentials. This limits their precision, efficiency, and suitability for neuromorphic hardware due to increased memory requirements and I/O bandwidth demands. To address these challenges, we propose DelGrad, an analytical, event-based method to compute exact loss gradients for both synaptic weights and delays. The inclusion of delays in the training process emerges naturally within our proposed formalism, enriching the model's search space with a temporal dimension. Moreover, DelGrad, grounded purely in spike timing, eliminates the need to track additional variables such as membrane potentials. To showcase this key advantage, we demonstrate the functionality and benefits of DelGrad on the BrainScaleS-2 neuromorphic platform, by training SNNs in a chip-in-the-loop fashion. For the first time, we experimentally demonstrate the memory efficiency and accuracy benefits of adding delays to SNNs on noisy mixed-signal hardware. Additionally, these experiments also reveal the potential of delays for stabilizing networks against noise. DelGrad opens a new way for training SNNs with delays on neuromorphic hardware, which results in fewer required parameters, higher accuracy and ease of hardware training.
