TCB-VIO: Tightly-Coupled Focal-Plane Binary-Enhanced Visual Inertial Odometry
Matthew Lisondra, Junseo Kim, Glenn Takashi Shimoda, Kourosh Zareinia, Sajad Saeedi
TL;DR
TCB-VIO delivers a tightly-coupled $6$-DoF VIO designed for focal-plane sensor-processor arrays, achieving $250$ FPS visual updates from an IMU stream at $400$ Hz. By performing on-sensor binary edge/corner extraction and a binary-enhanced KLT tracker, then fusing with a MSCKF backbone, it maintains robust trajectory estimates under fast, aggressive motions. The approach outperforms ROVIO, VINS-Mono, and ORB-SLAM3 in indoor and outdoor tests, while offering substantial energy and latency advantages due to on-sensor processing. This work demonstrates FPSPs' potential to enable low-latency, power-efficient VIO for mobile robotics and motivates further hardware-software co-design to migrate more of the pipeline onto the sensor fabric.
Abstract
Vision algorithms can be executed directly on the image sensor when implemented on the next-generation sensors known as focal-plane sensor-processor arrays (FPSP)s, where every pixel has a processor. FPSPs greatly improve latency, reducing the problems associated with the bottleneck of data transfer from a vision sensor to a processor. FPSPs accelerate vision-based algorithms such as visual-inertial odometry (VIO). However, VIO frameworks suffer from spatial drift due to the vision-based pose estimation, whilst temporal drift arises from the inertial measurements. FPSPs circumvent the spatial drift by operating at a high frame rate to match the high-frequency output of the inertial measurements. In this paper, we present TCB-VIO, a tightly-coupled 6 degrees-of-freedom VIO by a Multi-State Constraint Kalman Filter (MSCKF), operating at a high frame-rate of 250 FPS and from IMU measurements obtained at 400 Hz. TCB-VIO outperforms state-of-the-art methods: ROVIO, VINS-Mono, and ORB-SLAM3.
