RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Congjia Chen; Xiaoyu Jia; Yanhong Zheng; Yufu Qu

RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Congjia Chen, Xiaoyu Jia, Yanhong Zheng, Yufu Qu

TL;DR

RGBD-Glue addresses RGB-D point-cloud registration by decoupling visual and geometric features, then combining them through an explicit transformation-consistency filter and an adaptive threshold. Visual correspondences provide a rough prior to estimate a transformation and its error distribution, from which a distribution-informed threshold $\epsilon$ selects credible geometric matches; the final registration is obtained by a weighted Procrustes fit over the fused set. The approach is flexible, working with hand-crafted or learning-based descriptors, and demonstrates state-of-the-art performance on ScanNet and 3DMatch, while maintaining robustness under large frame spacing and across multiple visual features. This yields a more robust and practical RGB-D registration pipeline that leverages complementary cues without brittle, tightly fused representations.

Abstract

Point cloud registration is a fundamental task for estimating rigid transformations between point clouds. Previous studies have used geometric information for extracting features, matching and estimating transformation. Recently, owing to the advancement of RGB-D sensors, researchers have attempted to combine visual and geometric information to improve registration performance. However, these studies focused on extracting distinctive features by deep feature fusion, which cannot effectively solve the negative effects of each feature's weakness, and cannot sufficiently leverage the valid information. In this paper, we propose a new feature combination framework, which applies a looser but more effective combination. An explicit filter based on transformation consistency is designed for the combination framework, which can overcome each feature's weakness. And an adaptive threshold determined by the error distribution is proposed to extract more valid information from the two types of features. Owing to the distinctive design, our proposed framework can estimate more accurate correspondences and is applicable to both hand-crafted and learning-based feature descriptors. Experiments on ScanNet and 3DMatch show that our method achieves a state-of-the-art performance.

RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

TL;DR

selects credible geometric matches; the final registration is obtained by a weighted Procrustes fit over the fused set. The approach is flexible, working with hand-crafted or learning-based descriptors, and demonstrates state-of-the-art performance on ScanNet and 3DMatch, while maintaining robustness under large frame spacing and across multiple visual features. This yields a more robust and practical RGB-D registration pipeline that leverages complementary cues without brittle, tightly fused representations.

Abstract

Paper Structure (18 sections, 10 equations, 4 figures, 9 tables)

This paper contains 18 sections, 10 equations, 4 figures, 9 tables.

Introduction
Related Work
Image Feature Matching
Point Cloud Registration
RGB-D Combination
Method
Feature Matching
Adaptive Filter Based on Transformation Consistency
Geometric Fitting
Experiment
Experimental Settings
Registration on ScanNet Dataset
Registration on 3DMatch Dataset
Ablations
Conclusion
...and 3 more sections

Figures (4)

Figure 1: RGBD-Glue combines visual and geometric features to estimate credible correspondences for geometric fitting, which can achieve low rotation errors (REs) and translation errors (TEs) in registration.
Figure 2: Architecture of the proposed RGBD-Glue framework. First, we extract both visual and geometric features from RGB-D data. Second, we match them to obtain correspondences. Third, we leverage the high-quality visual correspondences to find credible geometric correspondences by testing the transformation consistency based on an adaptive threshold. Finally, we estimate the transformation via the correspondences.
Figure 3: Correspondence estimation result yielded by our proposed method. The green lines denote the correct correspondences, and the red lines denote the incorrect correspondences. For better visualization, we randomly sample parts of matches to draw the lines. The result shows that the processed correspondences have high inlier ratio.
Figure S1: An example of different frame spacing. We reduce the opacity of non-overlap region for better visualization. Large frame spacing causes low overlap, which brings great challenge to both visual and geometric feature matching.

RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

TL;DR

Abstract

RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration

Authors

TL;DR

Abstract

Table of Contents

Figures (4)