Open-Set Semantic Uncertainty Aware Metric-Semantic Graph Matching
Kurran Singh, John J. Leonard
TL;DR
This work tackles open-set object-based SLAM in challenging underwater environments by introducing a semantic-uncertainty-aware graph matching framework. It represents detected objects as 384-dimensional semantic embeddings with per-object uncertainty and casts open-set place recognition as a Quadratic Assignment Problem with node and edge affinities, including WeightedCosineSim, Mahalanobis, and Bhattacharyya-based measures. The method combines uncertainty quantification, graph matching solvers (spectral, RRWM, A*, neural), and a local-map construction pipeline with Kalman-filter-based uncertainty tracking, demonstrated on underwater data and KITTI to show real-time feasibility and cross-domain generalization. Key findings show that weighted cosine affinity often yields best tradeoffs, A* achieves top accuracy on small graphs, RRWM scales well to larger graphs, and the approach robustly handles unseen object classes for loop closure and map merging. The work thus enables robust, open-set, multi-object, semantic-uncertainty-aware loop closure in marine environments and beyond.
Abstract
Underwater object-level mapping requires incorporating visual foundation models to handle the uncommon and often previously unseen object classes encountered in marine scenarios. In this work, a metric of semantic uncertainty for open-set object detections produced by visual foundation models is calculated and then incorporated into an object-level uncertainty tracking framework. Object-level uncertainties and geometric relationships between objects are used to enable robust object-level loop closure detection for unknown object classes. The above loop closure detection problem is formulated as a graph-matching problem. While graph matching, in general, is NP-Complete, a solver for an equivalent formulation of the proposed graph matching problem as a graph editing problem is tested on multiple challenging underwater scenes. Results for this solver as well as three other solvers demonstrate that the proposed methods are feasible for real-time use in marine environments for the robust, open-set, multi-object, semantic-uncertainty-aware loop closure detection. Further experimental results on the KITTI dataset demonstrate that the method generalizes to large-scale terrestrial scenes.
