Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration
Junjie Gao, Chongjian Wang, Zhongjun Ding, Shuangmin Chen, Shiqing Xin, Changhe Tu, Wenping Wang
TL;DR
Deep-PE addresses the challenge of pose evaluation in point cloud registration under low overlap by learning a pose-specific confidence for each candidate transformation. It introduces a lightweight architecture with a Pose-Aware Attention module to simulate alignment status under different poses and a Pose Confidence Prediction module to output per-pose confidence, trained with a weighted cross-entropy loss that emphasizes poses close to the ground truth. By leveraging a pre-trained Geotransformer feature extractor and efficient attention-based learning, Deep-PE achieves state-of-the-art registration recall on benchmarks like 3DLoMatch across handcrafted FPFH and learning-based FC GF descriptors, while remaining robust to low inlier ratios and capable of identifying registration failures. The approach reduces dependence on input correspondences, improves robustness in challenging scenarios, and can be integrated into existing estimator-based pipelines to enhance pose selection without regressing transformations. Overall, Deep-PE demonstrates that learning-based pose evaluation can surpass traditional statistics-based evaluators in difficult registration settings, with practical implications for robotics and 3D perception systems.
Abstract
In the realm of point cloud registration, the most prevalent pose evaluation approaches are statistics-based, identifying the optimal transformation by maximizing the number of consistent correspondences. However, registration recall decreases significantly when point clouds exhibit a low overlap rate, despite efforts in designing feature descriptors and establishing correspondences. In this paper, we introduce Deep-PE, a lightweight, learning-based pose evaluator designed to enhance the accuracy of pose selection, especially in challenging point cloud scenarios with low overlap. Our network incorporates a Pose-Aware Attention (PAA) module to simulate and learn the alignment status of point clouds under various candidate poses, alongside a Pose Confidence Prediction (PCP) module that predicts the likelihood of successful registration. These two modules facilitate the learning of both local and global alignment priors. Extensive tests across multiple benchmarks confirm the effectiveness of Deep-PE. Notably, on 3DLoMatch with a low overlap rate, Deep-PE significantly outperforms state-of-the-art methods by at least 8% and 11% in registration recall under handcrafted FPFH and learning-based FCGF descriptors, respectively. To the best of our knowledge, this is the first study to utilize deep learning to select the optimal pose without the explicit need for input correspondences.
