Spiking CenterNet: A Distillation-boosted Spiking Neural Network for Object Detection
Lennard Bodden, Franziska Schwaiger, Duc Bach Ha, Lars Kreuzberg, Sven Behnke
TL;DR
Spiking CenterNet tackles energy-efficient object detection on event data by introducing a fully spiking adaptation of CenterNet paired with an M2U-Net decoder. The method combines simple, binary-spiking building blocks, a train-from-scratch approach, and knowledge distillation from a non-spiking teacher, achieving competitive mAP on Prophesee GEN1 while reducing energy per inference. The study provides an energy model for SNNs and demonstrates strong robustness via time-step ablations, showing that multiple time steps can be downsampled via temporal averaging without sacrificing performance. Overall, the work delivers a practical, distillation-boosted spiking detector with clear potential for neuromorphic, edge deployments and future extensions to RGB data, 3D bounding boxes, and pose estimation.
Abstract
In the era of AI at the edge, self-driving cars, and climate change, the need for energy-efficient, small, embedded AI is growing. Spiking Neural Networks (SNNs) are a promising approach to address this challenge, with their event-driven information flow and sparse activations. We propose Spiking CenterNet for object detection on event data. It combines an SNN CenterNet adaptation with an efficient M2U-Net-based decoder. Our model significantly outperforms comparable previous work on Prophesee's challenging GEN1 Automotive Detection Dataset while using less than half the energy. Distilling the knowledge of a non-spiking teacher into our SNN further increases performance. To the best of our knowledge, our work is the first approach that takes advantage of knowledge distillation in the field of spiking object detection.
