Beyond Rate Coding: Surrogate Gradients Enable Spike Timing Learning in Spiking Neural Networks

Ziqiao Yu; Pengfei Sun; Danyal Akarca; Dan F. M. Goodman

Beyond Rate Coding: Surrogate Gradients Enable Spike Timing Learning in Spiking Neural Networks

Ziqiao Yu, Pengfei Sun, Danyal Akarca, Dan F. M. Goodman

TL;DR

This work interrogates whether surrogate-gradient trained SNNs exploit spike timing beyond firing rate by using synthetic timing benchmarks and timing-normalized speech datasets. It demonstrates that Surrogate GD can learn fine-grained timing features such as ISIs, cross-channel ISIs, and coincidences, and that incorporating trainable axonal delays further enhances learning, especially for long timescales. In realistic datasets (SHD/SSC), timing information persists even after rate normalization, and delay-based networks show increased sensitivity to temporal order and cross-channel cues, underscoring the value of temporal coding in SNNs. The authors also provide timing-focused data resources to foster future exploration of temporal-spike coding in neuromorphic computing.

Abstract

The surrogate gradient descent algorithm enabled spiking neural networks to be trained to carry out challenging sensory processing tasks, an important step in understanding how spikes contribute to neural computations. However, it is unclear the extent to which these algorithms fully explore the space of possible spiking solutions to problems. We investigated whether spiking networks trained with surrogate gradient descent can learn to make use of information that is only encoded in the timing and not the rate of spikes. We constructed synthetic datasets with a range of types of spike timing information (interspike intervals, spatio-temporal spike patterns or polychrony, and coincidence codes). We find that surrogate gradient descent training can extract all of these types of information. In more realistic speech-based datasets, both timing and rate information is present. We therefore constructed variants of these datasets in which all rate information is removed, and find that surrogate gradient descent can still perform well. We tested all networks both with and without trainable axonal delays. We find that delays can give a significant increase in performance, particularly for more challenging tasks. To determine what types of spike timing information are being used by the networks trained on the speech-based tasks, we test these networks on time-reversed spikes which perturb spatio-temporal spike patterns but leave interspike intervals and coincidence information unchanged. We find that when axonal delays are not used, networks perform well under time reversal, whereas networks trained with delays perform poorly. This suggests that spiking neural networks with delays are better able to exploit temporal structure. To facilitate further studies of temporal coding, we have released our modified speech-based datasets.

Beyond Rate Coding: Surrogate Gradients Enable Spike Timing Learning in Spiking Neural Networks

TL;DR

Abstract

Beyond Rate Coding: Surrogate Gradients Enable Spike Timing Learning in Spiking Neural Networks

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)