LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks
Liudi Yang, Ruben Mascaro, Ignacio Alzugaray, Sai Manoj Prakhya, Marco Karrer, Ziyuan Liu, Margarita Chli
TL;DR
This work tackles LiDAR SLAM drift by introducing a semantic-graph loop-closure framework powered by Graph Attention Networks (GATs). It encodes semantic graphs from segmented point clouds into distinctive graph vectors via a three-branch GAT and a self-attention–based graph encoder, then compares graphs with a difference-driven neural module to detect loop closures. A semantic registration stage estimates the 6 DoF pose constraint to be added to the pose graph, improving trajectory consistency. On SemanticKITTI and KITTI-360, the method yields up to a 13% improvement in maximum $F_1$ score over the SGPR baseline and demonstrates real-time performance with a compact model, while open-sourcing the implementation to facilitate further research.
Abstract
In this paper, we propose a novel loop closure detection algorithm that uses graph attention neural networks to encode semantic graphs to perform place recognition and then use semantic registration to estimate the 6 DoF relative pose constraint. Our place recognition algorithm has two key modules, namely, a semantic graph encoder module and a graph comparison module. The semantic graph encoder employs graph attention networks to efficiently encode spatial, semantic and geometric information from the semantic graph of the input point cloud. We then use self-attention mechanism in both node-embedding and graph-embedding steps to create distinctive graph vectors. The graph vectors of the current scan and a keyframe scan are then compared in the graph comparison module to identify a possible loop closure. Specifically, employing the difference of the two graph vectors showed a significant improvement in performance, as shown in ablation studies. Lastly, we implemented a semantic registration algorithm that takes in loop closure candidate scans and estimates the relative 6 DoF pose constraint for the LiDAR SLAM system. Extensive evaluation on public datasets shows that our model is more accurate and robust, achieving 13% improvement in maximum F1 score on the SemanticKITTI dataset, when compared to the baseline semantic graph algorithm. For the benefit of the community, we open-source the complete implementation of our proposed algorithm and custom implementation of semantic registration at https://github.com/crepuscularlight/SemanticLoopClosure
