Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration
Xueyang Kang, Zhaoliang Luan, Kourosh Khoshelham, Bing Wang
TL;DR
This work tackles robust sparse point cloud registration by introducing an $SE(3)$-equivariant graph network that jointly learns local geometric descriptors, propagates SE(3) equivariant features via graph neural networks, and employs a Low-Rank Feature Transformation to produce compact, robust descriptors for similarity-based pose estimation. Key contributions include the LRFT module, a local $SO(3)$-invariant frame projection for equivariant message passing, and a rank-based regularizer with submatrix checks to suppress outliers without requiring explicit point correspondences. The method achieves state-of-the-art performance on indoor 3DMatch and strong recall on outdoor KITTI while maintaining low latency, demonstrating data efficiency and potential for real-time registration. Overall, the paper advances registration by integrating symmetry-aware representations with efficient matching, enabling robust alignment from sparsely sampled points and opening avenues for permutation-invariant extensions and multi-sensor fusion.
Abstract
Point cloud registration is a foundational task for 3D alignment and reconstruction applications. While both traditional and learning-based registration approaches have succeeded, leveraging the intrinsic symmetry of point cloud data, including rotation equivariance, has received insufficient attention. This prohibits the model from learning effectively, resulting in a requirement for more training data and increased model complexity. To address these challenges, we propose a graph neural network model embedded with a local Spherical Euclidean 3D equivariance property through SE(3) message passing based propagation. Our model is composed mainly of a descriptor module, equivariant graph layers, match similarity, and the final regression layers. Such modular design enables us to utilize sparsely sampled input points and initialize the descriptor by self-trained or pre-trained geometric feature descriptors easily. Experiments conducted on the 3DMatch and KITTI datasets exhibit the compelling and robust performance of our model compared to state-of-the-art approaches, while the model complexity remains relatively low at the same time.
