Neural Ganglion Sensors: Learning Task-specific Event Cameras Inspired by the Neural Circuit of the Human Retina
Haley M. So, Gordon Wetzstein
TL;DR
This work introduces Neural Ganglion Sensors, a retina-inspired extension of event cameras that learns task-specific spatio-temporal retinal kernels (RGC events) to improve perception tasks while reducing event bandwidth. By formulating a differentiable RGC event model with learnable kernels and thresholds, and enabling differentiable binning through a closed-form mapping, the approach bridges bio-inspired sensing with end-to-end learning. The framework supports multiple RGC channels and center-surround configurations, and is evaluated on video interpolation and optical flow, showing superior performance and bandwidth efficiency compared with traditional DVS/CSDVS baselines. These results demonstrate the potential of RGC-inspired event sensing for edge devices and real-time, low-power vision applications, with clear directions for hardware integration and future exploration of non-binary nonlinearities and richer kernel families.
Abstract
Inspired by the data-efficient spiking mechanism of neurons in the human eye, event cameras were created to achieve high temporal resolution with minimal power and bandwidth requirements by emitting asynchronous, per-pixel intensity changes rather than conventional fixed-frame rate images. Unlike retinal ganglion cells (RGCs) in the human eye, however, which integrate signals from multiple photoreceptors within a receptive field to extract spatio-temporal features, conventional event cameras do not leverage local spatial context when deciding which events to fire. Moreover, the eye contains around 20 different kinds of RGCs operating in parallel, each attuned to different features or conditions. Inspired by this biological design, we introduce Neural Ganglion Sensors, an extension of traditional event cameras that learns task-specific spatio-temporal retinal kernels (i.e., RGC "events"). We evaluate our design on two challenging tasks: video interpolation and optical flow. Our results demonstrate that our biologically inspired sensing improves performance relative to conventional event cameras while reducing overall event bandwidth. These findings highlight the promise of RGC-inspired event sensors for edge devices and other low-power, real-time applications requiring efficient, high-resolution visual streams.
