Driving in Spikes: An Entropy-Guided Object Detector for Spike Cameras
Ziyan Liu, Qi Su, Lulu Tang, Zhaofei Yu, Tiejun Huang
TL;DR
This work tackles object detection for spike cameras in high-speed driving, where traditional frame-based detectors struggle under motion and lighting extremes. It introduces EASD, a dual-branch, end-to-end spike detector that combines a Temporal-Based Texture branch with a global fusion pathway and an Entropy Selective Attention branch for object-centric refinement, enabling robust detection from sparse spike streams. To close the data gap, the authors build DSEC-Spike, a spike-based driving benchmark, and demonstrate state-of-the-art performance on both synthetic spike data and real spike streams, with notable simulation-to-real transfer. The results show that spike cameras, when coupled with carefully designed spatiotemporal and attention mechanisms, can achieve accurate, efficient multi-object detection in challenging autonomous-driving scenarios, highlighting practical viability for ultra-fast perception systems.
Abstract
Object detection in autonomous driving suffers from motion blur and saturation under fast motion and extreme lighting. Spike cameras, offer microsecond latency and ultra high dynamic range for object detection by using per pixel asynchronous integrate and fire. However, their sparse, discrete output cannot be processed by standard image-based detectors, posing a critical challenge for end to end spike stream detection. We propose EASD, an end to end spike camera detector with a dual branch design: a Temporal Based Texture plus Feature Fusion branch for global cross slice semantics, and an Entropy Selective Attention branch for object centric details. To close the data gap, we introduce DSEC Spike, the first driving oriented simulated spike detection benchmark.
