Active Semantic Mapping and Pose Graph Spectral Analysis for Robot Exploration
Rongge Zhang, Haechan Mark Bong, Giovanni Beltrame
TL;DR
This work integrates semantic mutual information with pose-graph spectral analysis to form an active SLAM framework for robot exploration. By weighting trajectory decisions with both semantic information gain and graph-connectivity-based uncertainty, it achieves improvements in metric-semantic SLAM performance without sacrificing exploration efficiency. The approach leverages a Shannon-Rényi entropy formulation and Laplacian spectral properties to avoid costly full-covariance computations, enabling real-time decision-making. Experimental results on photorealistic Habitat/Matterport data demonstrate up to $38\%$ localization error reduction, $21\%$ map error reduction, and notable semantic IoU gains, highlighting the practical impact for robust autonomous exploration and scene understanding.
Abstract
Exploration in unknown and unstructured environments is a pivotal requirement for robotic applications. A robot's exploration behavior can be inherently affected by the performance of its Simultaneous Localization and Mapping (SLAM) subsystem, although SLAM and exploration are generally studied separately. In this paper, we formulate exploration as an active mapping problem and extend it with semantic information. We introduce a novel active metric-semantic SLAM approach, leveraging recent research advances in information theory and spectral graph theory: we combine semantic mutual information and the connectivity metrics of the underlying pose graph of the SLAM subsystem. We use the resulting utility function to evaluate different trajectories to select the most favorable strategy during exploration. Exploration and SLAM metrics are analyzed in experiments. Running our algorithm on the Habitat dataset, we show that, while maintaining efficiency close to the state-of-the-art exploration methods, our approach effectively increases the performance of metric-semantic SLAM with a 21% reduction in average map error and a 9% improvement in average semantic classification accuracy.
