Multiway Point Cloud Mosaicking with Diffusion and Global Optimization
Shengze Jin, Iro Armeni, Marc Pollefeys, Daniel Barath
TL;DR
The paper tackles robust multiway point cloud mosaicking for unordered, partially overlapping scans by fusing learning-based matching with classical geometry. The core is ODIN, a diffusion-enhanced, overlap-aware pairwise registration method, followed by a decoupled, diffusion-guided global optimization pipeline: global rotation averaging, optimal robust translation re-estimation, translation optimization, and diffusion-based joint pose optimization. Key contributions include the ODIN matcher with dual attention and diffusion denoising, a globally optimal translation re-estimation via maximal sphere overlaps, and a diffusion-based pose-graph optimizer conditioned on input point clouds, achieving state-of-the-art results across four large datasets. The approach substantially improves both pairwise and multiway registration accuracy, enabling reliable large-scale 3D mosaicking with practical runtimes suitable for robotics and mapping applications, as evidenced by significant reductions in rotation and translation errors on challenging benchmarks.
Abstract
We introduce a novel framework for multiway point cloud mosaicking (named Wednesday), designed to co-align sets of partially overlapping point clouds -- typically obtained from 3D scanners or moving RGB-D cameras -- into a unified coordinate system. At the core of our approach is ODIN, a learned pairwise registration algorithm that iteratively identifies overlaps and refines attention scores, employing a diffusion-based process for denoising pairwise correlation matrices to enhance matching accuracy. Further steps include constructing a pose graph from all point clouds, performing rotation averaging, a novel robust algorithm for re-estimating translations optimally in terms of consensus maximization and translation optimization. Finally, the point cloud rotations and positions are optimized jointly by a diffusion-based approach. Tested on four diverse, large-scale datasets, our method achieves state-of-the-art pairwise and multiway registration results by a large margin on all benchmarks. Our code and models are available at https://github.com/jinsz/Multiway-Point-Cloud-Mosaicking-with-Diffusion-and-Global-Optimization.
