A real-time, robust and versatile visual-SLAM framework based on deep learning networks
Zhang Xiao, Shuaixin Li
TL;DR
The paper addresses robust real-time visual-SLAM in challenging environments and presents Rover-SLAM, a versatile, hybrid vSLAM framework that integrates learning-based feature extraction (SuperPoint) and learning-based matching (LightGlue) across tracking, mapping, and loop closure. It supports monocular, monocular-inertial, stereo, and stereo-inertial configurations and uses adaptive feature filtering, a learning-based local mapping, and a deep BoW loop-closure descriptor, all deployed via ONNX Runtime. Extensive experiments on EuRoC, TUM-VI, and self-collected data demonstrate that Rover-SLAM achieves state-of-the-art localization accuracy and tracking robustness across configurations, while maintaining real-time performance. The work provides a practical, scalable platform with public code release that can benefit robotics, autonomous driving, and 3D reconstruction applications by improving SLAM reliability in challenging lighting, texture, and motion conditions.
Abstract
This paper explores how deep learning techniques can improve visual-based SLAM performance in challenging environments. By combining deep feature extraction and deep matching methods, we introduce a versatile hybrid visual SLAM system designed to enhance adaptability in challenging scenarios, such as low-light conditions, dynamic lighting, weak-texture areas, and severe jitter. Our system supports multiple modes, including monocular, stereo, monocular-inertial, and stereo-inertial configurations. We also perform analysis how to combine visual SLAM with deep learning methods to enlighten other researches. Through extensive experiments on both public datasets and self-sampled data, we demonstrate the superiority of the SL-SLAM system over traditional approaches. The experimental results show that SL-SLAM outperforms state-of-the-art SLAM algorithms in terms of localization accuracy and tracking robustness. For the benefit of community, we make public the source code at https://github.com/zzzzxxxx111/SLslam.
