Amirkabir campus dataset: Real-world challenges and scenarios of Visual Inertial Odometry (VIO) for visually impaired people
Ali Samadzadeh, Mohammad Hassan Mojab, Heydar Soudani, Seyed Hesamoddin Mireshghollah, Ahmad Nickabadi
TL;DR
This paper introduces AUT-VI, a highly challenging Visual Inertial Odometry (VIO) dataset intended to advance navigation tools for visually impaired users. The dataset comprises 126 sequences across 17 campus locations, featuring dynamic objects, diverse lighting, reflections, and abrupt camera motions, and is complemented by an Android data-capture app (VIRec) to enable researcher-driven dataset customization with GPS-ground-truth. The authors provide detailed data formats, calibration procedures, and sequence statistics, and they evaluate leading VO/VIO/SLAM methods (Basalt, VINS-Mono, ORB-SLAM3, SLAMANTIC) to benchmark performance under real-world challenges, including dynamic occlusions and day/night loop-closure scenarios. The work argues that none of the current methods fully address all the challenges captured by AUT-VI, highlighting gaps and guiding future improvements such as dynamic-object segmentation, inertial-only estimation in extreme scenarios, and improved feature matching (e.g., Superglue) for robust loop-closure.
Abstract
Visual Inertial Odometry (VIO) algorithms estimate the accurate camera trajectory by using camera and Inertial Measurement Unit (IMU) sensors. The applications of VIO span a diverse range, including augmented reality and indoor navigation. VIO algorithms hold the potential to facilitate navigation for visually impaired individuals in both indoor and outdoor settings. Nevertheless, state-of-the-art VIO algorithms encounter substantial challenges in dynamic environments, particularly in densely populated corridors. Existing VIO datasets, e.g., ADVIO, typically fail to effectively exploit these challenges. In this paper, we introduce the Amirkabir campus dataset (AUT-VI) to address the mentioned problem and improve the navigation systems. AUT-VI is a novel and super-challenging dataset with 126 diverse sequences in 17 different locations. This dataset contains dynamic objects, challenging loop-closure/map-reuse, different lighting conditions, reflections, and sudden camera movements to cover all extreme navigation scenarios. Moreover, in support of ongoing development efforts, we have released the Android application for data capture to the public. This allows fellow researchers to easily capture their customized VIO dataset variations. In addition, we evaluate state-of-the-art Visual Inertial Odometry (VIO) and Visual Odometry (VO) methods on our dataset, emphasizing the essential need for this challenging dataset.
