Learning Delays in Spiking Neural Networks using Dilated Convolutions with Learnable Spacings
Ilyass Hammouamri, Ismail Khalfaoui-Hassani, Timothée Masquelier
TL;DR
Spiking neural networks rely on delays to detect temporal spike coincidences, but learning these delays alongside weights has been challenging. The authors recast delays as learnable positions within a 1D Gaussian-interpolated convolution (DCLS) and train both weights and delays end-to-end offline, progressively narrowing the Gaussian width $\sigma$ to produce discrete delays $d_{ij}^{(l)}$. This approach yields state-of-the-art accuracy on temporal benchmarks SHD, SSC, and GSC-35 while using far fewer parameters and avoiding recurrent connections, making it attractive for neuromorphic implementation. The results, supported by ablations, demonstrate the value of jointly optimizing delays and weights and the effectiveness of gradually shrinking $\sigma$ to balance long-range dependencies and precision.
Abstract
Spiking Neural Networks (SNNs) are a promising research direction for building power-efficient information processing systems, especially for temporal tasks such as speech recognition. In SNNs, delays refer to the time needed for one spike to travel from one neuron to another. These delays matter because they influence the spike arrival times, and it is well-known that spiking neurons respond more strongly to coincident input spikes. More formally, it has been shown theoretically that plastic delays greatly increase the expressivity in SNNs. Yet, efficient algorithms to learn these delays have been lacking. Here, we propose a new discrete-time algorithm that addresses this issue in deep feedforward SNNs using backpropagation, in an offline manner. To simulate delays between consecutive layers, we use 1D convolutions across time. The kernels contain only a few non-zero weights - one per synapse - whose positions correspond to the delays. These positions are learned together with the weights using the recently proposed Dilated Convolution with Learnable Spacings (DCLS). We evaluated our method on three datasets: the Spiking Heidelberg Dataset (SHD), the Spiking Speech Commands (SSC) and its non-spiking version Google Speech Commands v0.02 (GSC) benchmarks, which require detecting temporal patterns. We used feedforward SNNs with two or three hidden fully connected layers, and vanilla leaky integrate-and-fire neurons. We showed that fixed random delays help and that learning them helps even more. Furthermore, our method outperformed the state-of-the-art in the three datasets without using recurrent connections and with substantially fewer parameters. Our work demonstrates the potential of delay learning in developing accurate and precise models for temporal data processing. Our code is based on PyTorch / SpikingJelly and available at: https://github.com/Thvnvtos/SNN-delays
