ADUGS-VINS: Generalized Visual-Inertial Odometry for Robust Navigation in Highly Dynamic and Complex Environments
Rui Zhou, Jingbin Liu, Junbin Xie, Jianyu Zhang, Yingze Hu, Jiele Zhao
TL;DR
ADUGS-VINS tackles the challenge of accurate visual-inertial odometry in highly dynamic and partially occluded environments. It fuses an enhanced SORT tracker with a promptable segmentation model (Mobile SAM) to robustly identify and exclude dynamic regions, while employing an adaptive Kalman-filter-based update and multi-category segmentation to maintain reliable pose estimates. The method further uses ORB features with ANMS and a compensation strategy to preserve sufficient static features for stable optimization, yielding strong results across VIODERN45, OpenLORIS, VIODE, and real-world tests, often outperforming state-of-the-art baselines. This work advances practical navigation in dynamic scenes, with implications for robotics, drones, and autonomous systems requiring robust VIO under complex object motion and occlusions.
Abstract
Visual-inertial odometry (VIO) is widely used in various fields, such as robots, drones, and autonomous vehicles. However, real-world scenes often feature dynamic objects, compromising the accuracy of VIO. The diversity and partial occlusion of these objects present a tough challenge for existing dynamic VIO methods. To tackle this challenge, we introduce ADUGS-VINS, which integrates an enhanced SORT algorithm along with a promptable foundation model into VIO, thereby improving pose estimation accuracy in environments with diverse dynamic objects and frequent occlusions. We evaluated our proposed method using multiple public datasets representing various scenes, as well as in a real-world scenario involving diverse dynamic objects. The experimental results demonstrate that our proposed method performs impressively in multiple scenarios, outperforming other state-of-the-art methods. This highlights its remarkable generalization and adaptability in diverse dynamic environments, showcasing its potential to handle various dynamic objects in practical applications.
