Geometry Depth Consistency in RGBD Relative Pose Estimation
Sourav Kumar, Chiang-Heng Chien, Benjamin Kimia
TL;DR
This work tackles RGBD relative pose estimation by identifying that a single veridical RGBD correspondence imposes strong depth-consistency constraints on all other potential matches. It introduces Geometric Depth Consistency (GDC), which constrains correspondences to lie on nested curves in the image domains, and pairs it with Filtered RANSAC to prune outliers efficiently. The authors further enhance robustness and speed with Nested RANSAC, biasing selections toward high-confidence top-ranked matches. Comprehensive experiments on TUM-RGBD, ICL-NUIM, and RGBD Scene v2 show substantial runtime savings and competitive pose accuracy, especially under high outlier ratios. The combination of GDC and nested strategies yields scalable, robust RGBD pose estimation suitable for VO/SLAM pipelines and 3D reconstruction tasks.
Abstract
Relative pose estimation for RGBD cameras is crucial in a number of applications. Previous approaches either rely on the RGB aspect of the images to estimate pose thus not fully making use of depth in the estimation process or estimate pose from the 3D cloud of points that each image produces, thus not making full use of RGB information. This paper shows that if one pair of correspondences is hypothesized from the RGB-based ranked-ordered correspondence list, then the space of remaining correspondences is restricted to corresponding pairs of curves nested around the hypothesized correspondence, implicitly capturing depth consistency. This simple Geometric Depth Constraint (GDC) significantly reduces potential matches. In effect this becomes a filter on possible correspondences that helps reduce the number of outliers and thus expedites RANSAC significantly. As such, the same budget of time allows for more RANSAC iterations and therefore additional robustness and a significant speedup. In addition, the paper proposed a Nested RANSAC approach that also speeds up the process, as shown through experiments on TUM, ICL-NUIM, and RGBD Scenes v2 datasets.
