BIT-VO: Visual Odometry at 300 FPS using Binary Features from the Focal Plane
Riku Murai, Sajad Saeedi, Paul H. J. Kelly
TL;DR
BIT-VO tackles high-speed monocular visual odometry by moving feature extraction to a focal-plane sensor-processor (FPSP) and transmitting only binary features, enabling 6-DoF pose estimation at 300 FPS without intensity data. It introduces a compact 44-bit local binary descriptor for edges, and a robust, on-host frame and map tracking pipeline using Levenberg–Marquardt optimisation with a small, fast BA step. The approach is validated on a 256×256 SCAMP-5 platform, showing robustness to rapid motion and competitive accuracy against conventional VO systems, while delivering substantial speed advantages. This work demonstrates the practicality of on-sensor computation for VO, informs FPSP device design, and suggests directions for noise modeling and benchmarking in FPSP-based SLAM-like systems.
Abstract
Focal-plane Sensor-processor (FPSP) is a next-generation camera technology which enables every pixel on the sensor chip to perform computation in parallel, on the focal plane where the light intensity is captured. SCAMP-5 is a general-purpose FPSP used in this work and it carries out computations in the analog domain before analog to digital conversion. By extracting features from the image on the focal plane, data which is digitized and transferred is reduced. As a consequence, SCAMP-5 offers a high frame rate while maintaining low energy consumption. Here, we present BIT-VO, which is, to the best of our knowledge, the first 6 Degrees of Freedom visual odometry algorithm which utilises the FPSP. Our entire system operates at 300 FPS in a natural scene, using binary edges and corner features detected by the SCAMP-5.
