A Low Latency Adaptive Coding Spiking Framework for Deep Reinforcement Learning
Lang Qin, Rui Yan, Huajin Tang
TL;DR
The paper addresses high latency and limited versatility in spiking reinforcement learning (SRL) by introducing the adaptive coding spike framework (ACSF), which uses learnable matrices to encode and decode spikes through Spike Encoders/Decoders. By employing iterative LIF neurons and surrogate-gradient training, ACSF supports both online and offline DRL with a directly trained SNN, achieving ultra-low latency (as low as $0.8\%$ of prior SRL methods) and up to $5\times$ energy efficiency over DNNs. Empirical results on Atari and MuJoCo show ACSF either matching or surpassing baselines while significantly reducing latency, and ablations demonstrate the value of adaptive coders. Overall, ACSF broadens SRL applicability, enables efficient neuromorphic deployment, and provides a unified, end-to-end framework for low-latency reinforcement learning.
Abstract
In recent years, spiking neural networks (SNNs) have been used in reinforcement learning (RL) due to their low power consumption and event-driven features. However, spiking reinforcement learning (SRL), which suffers from fixed coding methods, still faces the problems of high latency and poor versatility. In this paper, we use learnable matrix multiplication to encode and decode spikes, improving the flexibility of the coders and thus reducing latency. Meanwhile, we train the SNNs using the direct training method and use two different structures for online and offline RL algorithms, which gives our model a wider range of applications. Extensive experiments have revealed that our method achieves optimal performance with ultra-low latency (as low as 0.8% of other SRL methods) and excellent energy efficiency (up to 5X the DNNs) in different algorithms and different environments.
