P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising
Mathias Vogel, Keisuke Tateno, Marc Pollefeys, Federico Tombari, Marie-Julie Rakotosaona, Francis Engelmann
TL;DR
The paper addresses 3D point cloud denoising under real-world scanner noise by framing denoising as a diffusion Schrödinger bridge that learns an optimal transport plan between paired clean and noisy clouds. It introduces a data-to-data diffusion process, uses shortest-path interpolation to align point sets, and reduces the stochastic process to an OT-ODE when stochasticity vanishes, enabling efficient training and inference. The approach, called P2P-Bridge, leverages PVCNN-based architectures and can incorporate RGB or high-level DINOV2 features to boost performance, achieving state-of-the-art results on object-level datasets (PU-Net, PC-Net) and indoor scenes (ScanNet++, ARKitScenes) with as few as 5–10 inference steps. The work emphasizes the importance of data alignment, tractable diffusion bridges, and feature integration for robust denoising, offering a practical, scalable solution with open-source code and pretrained models.
Abstract
In this work, we tackle the task of point cloud denoising through a novel framework that adapts Diffusion Schrödinger bridges to points clouds. Unlike previous approaches that predict point-wise displacements from point features or learned noise distributions, our method learns an optimal transport plan between paired point clouds. Experiments on object datasets like PU-Net and real-world datasets such as ScanNet++ and ARKitScenes show that P2P-Bridge achieves significant improvements over existing methods. While our approach demonstrates strong results using only point coordinates, we also show that incorporating additional features, such as color information or point-wise DINOv2 features, further enhances the performance. Code and pretrained models are available at https://p2p-bridge.github.io.
