Sign Gradient Descent-based Neuronal Dynamics: ANN-to-SNN Conversion Beyond ReLU Network
Hyunseok Oh, Youngki Lee
TL;DR
The paper tackles the limited nonlinearity support and high latency in ANN-to-SNN conversion by introducing a sign gradient-descent (signGD) based neuronal dynamics that links discrete spiking behavior to optimization updates. It proves that IF/LIF neurons with rate or EMA coding emulate subgradient methods on a crafted objective, then extends this to design signGD-based neurons that can approximate unary and multi-operand nonlinearities beyond ReLU, including max pooling and layer normalization. Empirically, the approach enables state-of-the-art ANN-to-SNN conversion on large datasets and supports contemporary architectures such as ConvNext, MLP-Mixer, and ResMLP with competitive latency (e.g., 64–256 time-steps on ImageNet). The findings offer a practical, theory-grounded path to energy-efficient SNN inference without sacrificing accuracy, and point to broader implications for neuromorphic hardware and neuroscience-inspired computation. Overall, signGD-based neuronal dynamics broaden the scope and speed of SNN-based inference while preserving the core benefits of spike-based computation.”
Abstract
Spiking neural network (SNN) is studied in multidisciplinary domains to (i) enable order-of-magnitudes energy-efficient AI inference and (ii) computationally simulate neuro-scientific mechanisms. The lack of discrete theory obstructs the practical application of SNN by limiting its performance and nonlinearity support. We present a new optimization-theoretic perspective of the discrete dynamics of spiking neurons. We prove that a discrete dynamical system of simple integrate-and-fire models approximates the sub-gradient method over unconstrained optimization problems. We practically extend our theory to introduce a novel sign gradient descent (signGD)-based neuronal dynamics that can (i) approximate diverse nonlinearities beyond ReLU and (ii) advance ANN-to-SNN conversion performance in low time steps. Experiments on large-scale datasets show that our technique achieves (i) state-of-the-art performance in ANN-to-SNN conversion and (ii) is the first to convert new DNN architectures, e.g., ConvNext, MLP-Mixer, and ResMLP. We publicly share our source code at https://github.com/snuhcs/snn_signgd .
