Ultralight Polarity-Split Neuromorphic SNN for Event-Stream Super-Resolution
Chuanzhi Xu, Haoxian Zhou, Langyi Chen, Yuk Ying Chung, Qiang Qu
TL;DR
This work tackles the challenge of coarse spatial resolution in event cameras by proposing an ultra-lightweight, event-to-event super-resolution network based on Spiking Neural Networks. It introduces a Dual-Forward Polarity-Split Encoding strategy to separately process positive and negative events with a shared SNN, and a Learnable Spatio-temporal Polarity-aware Loss (LearnSTPLoss) to balance temporal, spatial, and polarity fidelity. The approach yields competitive SR performance while dramatically reducing model size and inference time, enabling real-time, on-device deployment and improved downstream tasks such as image reconstruction and object recognition. Extensive experiments across multiple datasets and real-world deployment validate the effectiveness and efficiency of the method, with notable parameter reductions and fast inference on resource-constrained hardware.
Abstract
Event cameras offer unparalleled advantages such as high temporal resolution, low latency, and high dynamic range. However, their limited spatial resolution poses challenges for fine-grained perception tasks. In this work, we propose an ultra-lightweight, stream-based event-to-event super-resolution method based on Spiking Neural Networks (SNNs), designed for real-time deployment on resource-constrained devices. To further reduce model size, we introduce a novel Dual-Forward Polarity-Split Event Encoding strategy that decouples positive and negative events into separate forward paths through a shared SNN. Furthermore, we propose a Learnable Spatio-temporal Polarity-aware Loss (LearnSTPLoss) that adaptively balances temporal, spatial, and polarity consistency using learnable uncertainty-based weights. Experimental results demonstrate that our method achieves competitive super-resolution performance on multiple datasets while significantly reducing model size and inference time. The lightweight design enables embedding the module into event cameras or using it as an efficient front-end preprocessing for downstream vision tasks.
