Event-based vision for egomotion estimation using precise event timing
Hugh Greatorex, Michele Mastella, Madison Cotteret, Ole Richter, Elisabetta Chicca
TL;DR
This work addresses the challenge of accurate, low-latency egomotion estimation for robotics by replacing frame-based, energy-intensive pipelines with a fully event-based approach. It introduces a Time Difference Encoder (TDE) and a shallow spiking neural network that directly processes asynchronous event streams to extract local optical flow, enabling on-chip readouts of egomotion with minimal latency and power. Silicon-level measurements of a cognigr1 TDE circuit and on-chip network emulation, complemented by scaled-up simulations with up to 178,880 units, demonstrate low drift and state-of-the-art ARRE on MVSEC compared to prior methods. The approach promises real-time, power-efficient navigation for micro-drones and edge devices, with ready integration into neuromorphic vision stacks and future hardware accelerators.
Abstract
Egomotion estimation is crucial for applications such as autonomous navigation and robotics, where accurate and real-time motion tracking is required. However, traditional methods relying on inertial sensors are highly sensitive to external conditions, and suffer from drifts leading to large inaccuracies over long distances. Vision-based methods, particularly those utilising event-based vision sensors, provide an efficient alternative by capturing data only when changes are perceived in the scene. This approach minimises power consumption while delivering high-speed, low-latency feedback. In this work, we propose a fully event-based pipeline for egomotion estimation that processes the event stream directly within the event-based domain. This method eliminates the need for frame-based intermediaries, allowing for low-latency and energy-efficient motion estimation. We construct a shallow spiking neural network using a synaptic gating mechanism to convert precise event timing into bursts of spikes. These spikes encode local optical flow velocities, and the network provides an event-based readout of egomotion. We evaluate the network's performance on a dedicated chip, demonstrating strong potential for low-latency, low-power motion estimation. Additionally, simulations of larger networks show that the system achieves state-of-the-art accuracy in egomotion estimation tasks with event-based cameras, making it a promising solution for real-time, power-constrained robotics applications.
