Voxel-SLAM: A Complete, Accurate, and Versatile LiDAR-Inertial SLAM System
Zheng Liu, Haotian Li, Chongjian Yuan, Xiyuan Liu, Jiarong Lin, Rundong Li, Chunran Zheng, Bingyang Zhou, Wenyi Liu, Fu Zhang
TL;DR
Voxel-SLAM tackles the challenge of robust, real-time LiDAR-Inertial SLAM across single and multi-session environments by unifying all modules under an adaptive voxel map and exploiting four data associations: short-term, mid-term, long-term, and multi-map. The system couples a BALM2-based LiDAR-inertial BA with an efficient three-level data pyramid to support initialization, odometry, local mapping, loop closure, and global mapping in real time. Key contributions include robust initialization from highly dynamic starts, a sliding-window LiDAR-Inertial BA for local refinement, loop closure across multiple sessions with a unified map, and a hierarchical global BA to ensure global consistency. The approach demonstrates state-of-the-art accuracy on diverse datasets (Hilti, MARS-LVIG, UrbanNav) and maintains real-time performance on resource-constrained hardware, with explicit support for multi-session relocalization and online global map refinement.
Abstract
In this work, we present Voxel-SLAM: a complete, accurate, and versatile LiDAR-inertial SLAM system that fully utilizes short-term, mid-term, long-term, and multi-map data associations to achieve real-time estimation and high precision mapping. The system consists of five modules: initialization, odometry, local mapping, loop closure, and global mapping, all employing the same map representation, an adaptive voxel map. The initialization provides an accurate initial state estimation and a consistent local map for subsequent modules, enabling the system to start with a highly dynamic initial state. The odometry, exploiting the short-term data association, rapidly estimates current states and detects potential system divergence. The local mapping, exploiting the mid-term data association, employs a local LiDAR-inertial bundle adjustment (BA) to refine the states (and the local map) within a sliding window of recent LiDAR scans. The loop closure detects previously visited places in the current and all previous sessions. The global mapping refines the global map with an efficient hierarchical global BA. The loop closure and global mapping both exploit long-term and multi-map data associations. We conducted a comprehensive benchmark comparison with other state-of-the-art methods across 30 sequences from three representative scenes, including narrow indoor environments using hand-held equipment, large-scale wilderness environments with aerial robots, and urban environments on vehicle platforms. Other experiments demonstrate the robustness and efficiency of the initialization, the capacity to work in multiple sessions, and relocalization in degenerated environments.
