Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

Zijie Xu; Tong Bu; Zecheng Hao; Jianhao Ding; Zhaofei Yu

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

Zijie Xu, Tong Bu, Zecheng Hao, Jianhao Ding, Zhaofei Yu

TL;DR

The paper addresses the instability of RL with discrete SNNs caused by the mismatch with continuous target-network updates. It introduces a proxy target framework that uses a differentiable proxy actor during training to enable smooth target updates, while preserving SNN energy efficiency during deployment. The approach achieves stable learning, faster convergence, and up to 32% higher average performance across multiple spiking neuron models and continuous control tasks, with simple LIF neurons sometimes surpassing ANN baselines. This work demonstrates the value of SNN-tailored RL algorithms and points to practical, energy-efficient neuromorphic controllers for edge devices.

Abstract

Spiking Neural Networks (SNNs) offer low-latency and energy-efficient decision making on neuromorphic hardware, making them attractive for Reinforcement Learning (RL) in resource-constrained edge devices. However, most RL algorithms for continuous control are designed for Artificial Neural Networks (ANNs), particularly the target network soft update mechanism, which conflicts with the discrete and non-differentiable dynamics of spiking neurons. We show that this mismatch destabilizes SNN training and degrades performance. To bridge the gap between discrete SNNs and continuous-control algorithms, we propose a novel proxy target framework. The proxy network introduces continuous and differentiable dynamics that enable smooth target updates, stabilizing the learning process. Since the proxy operates only during training, the deployed SNN remains fully energy-efficient with no additional inference overhead. Extensive experiments on continuous control benchmarks demonstrate that our framework consistently improves stability and achieves up to $32\%$ higher performance across various spiking neuron models. Notably, to the best of our knowledge, this is the first approach that enables SNNs with simple Leaky Integrate and Fire (LIF) neurons to surpass their ANN counterparts in continuous control. This work highlights the importance of SNN-tailored RL algorithms and paves the way for neuromorphic agents that combine high performance with low power consumption. Code is available at https://github.com/xuzijie32/Proxy-Target.

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

TL;DR

Abstract

Proxy Target: Bridging the Gap Between Discrete Spiking Neural Networks and Continuous Control

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (3)