Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor Environments
Jianhao Jiao, Ruoyu Geng, Yuanhang Li, Ren Xin, Bowen Yang, Jin Wu, Lujia Wang, Ming Liu, Rui Fan, Dimitrios Kanoulas
TL;DR
The paper tackles real-time outdoor autonomous navigation by introducing an online metric-semantic mapping system that fuses LiDAR, vision, and IMU data into a GPU-accelerated TSDF framework with semantic labeling. It combines a LiDAR-Visual-Inertial state estimator, CNN-based pixel-wise segmentation, and a Bayesian fusion mechanism to produce a global 3D mesh annotated with semantic classes, from which traversable regions are extracted for map-based localization and planning. The approach is validated on 24 sequences across public and campus datasets, showing strong geometric and semantic accuracy with rapid per-frame updates (millisecond-scale), and demonstrates real-world point-to-point navigation using the generated maps. The work advances outdoor semantic mapping by enabling real-time, large-scale, semantically informed navigation and provides public code and datasets to foster reproducibility and further research.
Abstract
The creation of a metric-semantic map, which encodes human-prior knowledge, represents a high-level abstraction of environments. However, constructing such a map poses challenges related to the fusion of multi-modal sensor data, the attainment of real-time mapping performance, and the preservation of structural and semantic information consistency. In this paper, we introduce an online metric-semantic mapping system that utilizes LiDAR-Visual-Inertial sensing to generate a global metric-semantic mesh map of large-scale outdoor environments. Leveraging GPU acceleration, our mapping process achieves exceptional speed, with frame processing taking less than 7ms, regardless of scenario scale. Furthermore, we seamlessly integrate the resultant map into a real-world navigation system, enabling metric-semantic-based terrain assessment and autonomous point-to-point navigation within a campus environment. Through extensive experiments conducted on both publicly available and self-collected datasets comprising 24 sequences, we demonstrate the effectiveness of our mapping and navigation methodologies. Code has been publicly released: https://github.com/gogojjh/cobra
