Random Feature Spiking Neural Networks
Maximilian Gollwitzer, Felix Dietrich
TL;DR
The paper tackles the challenge of training Spiking Neural Networks (SNNs) by introducing S-SWIM, a data-driven Random Feature Method that transfers the SWIM paradigm to Spike Response Model (SRM) networks, enabling gradient-free end-to-end training and effective initialization for gradient-based methods. It provides a rigorous mathematical framework and a modular algorithm that samples weights and temporal parameters from data-driven distributions, while solving a linear problem for output weights. The results on time-series forecasting show that S-SWIM achieves high accuracy with substantial speedups over surrogate-gradient training, and ablation studies reinforce the importance of data-driven weight construction and temporal parameter diversification. The approach is interpretable and flexible, but its current limitations include performance on deep networks and the need for robust sampling strategies, suggesting clear avenues for future work and broader applicability.
Abstract
Spiking Neural Networks (SNNs) as Machine Learning (ML) models have recently received a lot of attention as a potentially more energy-efficient alternative to conventional Artificial Neural Networks. The non-differentiability and sparsity of the spiking mechanism can make these models very difficult to train with algorithms based on propagating gradients through the spiking non-linearity. We address this problem by adapting the paradigm of Random Feature Methods (RFMs) from Artificial Neural Networks (ANNs) to Spike Response Model (SRM) SNNs. This approach allows training of SNNs without approximation of the spike function gradient. Concretely, we propose a novel data-driven, fast, high-performance, and interpretable algorithm for end-to-end training of SNNs inspired by the SWIM algorithm for RFM-ANNs, which we coin S-SWIM. We provide a thorough theoretical discussion and supplementary numerical experiments showing that S-SWIM can reach high accuracies on time series forecasting as a standalone strategy and serve as an effective initialisation strategy before gradient-based training. Additional ablation studies show that our proposed method performs better than random sampling of network weights.
