Event-based Visual Inertial Velometer
Xiuyuan Lu, Yi Zhou, Junkai Niu, Sheng Zhong, Shaojie Shen
TL;DR
The paper tackles robust state estimation for aggressive ego-motion by replacing pose estimation with instantaneous linear velocity estimation using a map-free, event-based visual–inertial velometer. It fuses stereo event camera data and an IMU in a continuous-time framework, modeling velocity with a cubic B-spline and enforcing both event-derived normal-flow constraints and IMU pre-integration corrections. Empirical results on synthetic and real spinning datasets show metric-scale velocity with low latency and improved dead-reckoning over frame-based VO baselines, highlighting robustness to motion blur and high-speed dynamics. By aligning estimation with the differential nature of event data, the approach reduces data-association dependence and offers a practical path toward reliable high-speed navigation with neuromorphic vision.
Abstract
Neuromorphic event-based cameras are bio-inspired visual sensors with asynchronous pixels and extremely high temporal resolution. Such favorable properties make them an excellent choice for solving state estimation tasks under aggressive ego motion. However, failures of camera pose tracking are frequently witnessed in state-of-the-art event-based visual odometry systems when the local map cannot be updated in time. One of the biggest roadblocks for this specific field is the absence of efficient and robust methods for data association without imposing any assumption on the environment. This problem seems, however, unlikely to be addressed as in standard vision due to the motion-dependent observability of event data. Therefore, we propose a mapping-free design for event-based visual-inertial state estimation in this paper. Instead of estimating the position of the event camera, we find that recovering the instantaneous linear velocity is more consistent with the differential working principle of event cameras. The proposed event-based visual-inertial velometer leverages a continuous-time formulation that incrementally fuses the heterogeneous measurements from a stereo event camera and an inertial measurement unit. Experiments on the synthetic dataset demonstrate that the proposed method can recover instantaneous linear velocity in metric scale with low latency.
