StochEP: Stochastic Equilibrium Propagation for Spiking Convergent Recurrent Neural Networks
Jiaqi Lin, Yi Jiang, Abhronil Sengupta
TL;DR
This paper introduces Stochastic Equilibrium Propagation (StochEP), a framework that trains spiking neural networks with probabilistic spiking neurons inside EP to stabilize learning and enable deep, convolutional CRNNs. By proving a mean-field equivalence between the stochastic energy and the deterministic EP energy, StochEP inherits convergence guarantees while smoothing the optimization landscape through stochasticity. Empirically, StochEP achieves competitive performance against BPTT-trained SNNs and EP-trained non-spiking networks on MNIST, CIFAR-10, and DVS Gesture, while delivering substantial memory and energy savings and enabling processing of time-varying inputs. The work highlights stochasticity as both biologically plausible and practically advantageous for neuromorphic, on-chip learning, with clear directions for scaling and hardware evaluation.
Abstract
Spiking Neural Networks (SNNs) promise energy-efficient, sparse, biologically inspired computation. Training them with Backpropagation Through Time (BPTT) and surrogate gradients achieves strong performance but remains biologically implausible. Equilibrium Propagation (EP) provides a more local and biologically grounded alternative. However, existing EP frameworks, primarily based on deterministic neurons, either require complex mechanisms to handle discontinuities in spiking dynamics or fail to scale beyond simple visual tasks. Inspired by the stochastic nature of biological spiking mechanism and recent hardware trends, we propose a stochastic EP framework that integrates probabilistic spiking neurons into the EP paradigm. This formulation smoothens the optimization landscape, stabilizes training, and enables scalable learning in deep convolutional spiking convergent recurrent neural networks (CRNNs). We provide theoretical guarantees showing that the proposed stochastic EP dynamics approximate deterministic EP under mean-field theory, thereby inheriting its underlying theoretical guarantees. The proposed framework narrows the gap to both BPTT-trained SNNs and EP-trained non-spiking CRNNs in vision benchmarks while preserving locality, highlighting stochastic EP as a promising direction for neuromorphic and on-chip learning.
