Exact Gradients for Stochastic Spiking Neural Networks Driven by Rough Signals
Christian Holberg, Cristopher Salvi
TL;DR
This work addresses the challenge of gradient-based training for stochastic spiking neural networks whose dynamics and spike timings are both driven by rough, potentially discontinuous noise. It develops a rigorous framework using rough path theory to model SSNNs as Event SDEs and extends to Marcus RDEs, deriving exact, pathwise gradient formulas for both trajectories and spike times and enabling end-to-end autodifferentiation via a differentiable solver implemented in \\texttt{diffrax}. A novel Marcus signature kernel is introduced to define a loss on càdlàg paths, yielding a robust MMD objective that facilitates training SSNNs as generative models. The approach supports online gradient updates and provides practical tools for input and weight estimation, advancing the feasibility of training SSNNs with noise influencing spike timing and network dynamics, with implications for bioplausible learning and neuromorphic hardware.
Abstract
We introduce a mathematically rigorous framework based on rough path theory to model stochastic spiking neural networks (SSNNs) as stochastic differential equations with event discontinuities (Event SDEs) and driven by càdlàg rough paths. Our formalism is general enough to allow for potential jumps to be present both in the solution trajectories as well as in the driving noise. We then identify a set of sufficient conditions ensuring the existence of pathwise gradients of solution trajectories and event times with respect to the network's parameters and show how these gradients satisfy a recursive relation. Furthermore, we introduce a general-purpose loss function defined by means of a new class of signature kernels indexed on càdlàg rough paths and use it to train SSNNs as generative models. We provide an end-to-end autodifferentiable solver for Event SDEs and make its implementation available as part of the $\texttt{diffrax}$ library. Our framework is, to our knowledge, the first enabling gradient-based training of SSNNs with noise affecting both the spike timing and the network's dynamics.
