ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

Tianxin Huang; Zhiwen Yan; Yuyang Zhao; Gim Hee Lee

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

Tianxin Huang, Zhiwen Yan, Yuyang Zhao, Gim Hee Lee

TL;DR

This paper tackles the challenge of completing 3D point clouds captured from partial views, especially across unseen categories. It introduces ComPC, a training-free test-time framework that uses pre-trained 2D diffusion priors guided through 3D Gaussian Splatting to fill missing regions, starting from a reference viewpoint. The method comprises Partial Gaussian Initialization (PGI) to generate a guiding reference image, Zero-shot Fractal Completion (ZFC) to optimize 3D Gaussians under diffusion guidance with a Preservation Constraint, and Point Cloud Extraction (PCE) to produce uniform surface points, enabling efficient, generalizable completion. Experiments on synthetic data, Redwood real scans, and ShapeNet/Kitti variants show robust performance, improved generalization to unseen categories, and faster optimization relative to prior prompt-based methods, with a practical, training-free workflow disclosed on the project page.

Abstract

3D point clouds directly collected from objects through sensors are often incomplete due to self-occlusion. Conventional methods for completing these partial point clouds rely on manually organized training sets and are usually limited to object categories seen during training. In this work, we propose a test-time framework for completing partial point clouds across unseen categories without any requirement for training. Leveraging point rendering via Gaussian Splatting, we develop techniques of Partial Gaussian Initialization, Zero-shot Fractal Completion, and Point Cloud Extraction that utilize priors from pre-trained 2D diffusion models to infer missing regions and extract uniform completed point clouds. Experimental results on both synthetic and real-world scanned point clouds demonstrate that our approach outperforms existing methods in completing a variety of objects. Our project page is at \url{https://tianxinhuang.github.io/projects/ComPC/}.

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

TL;DR

Abstract

Paper Structure (26 sections, 9 equations, 18 figures, 12 tables, 1 algorithm)

This paper contains 26 sections, 9 equations, 18 figures, 12 tables, 1 algorithm.

Introduction
Related Works
3D Generation via 2D priors
Point Cloud Completion
Methodology
Partial Gaussian Initialization
Zero-shot Fractal Completion
Point Cloud Extraction
Experiments
Comparison on Synthetic Point Clouds
Comparison on Real Scans
Ablation Study for Colorization Strategies in PGI
Ablation Study for ZFC and PCE
Limitation
Conclusion
...and 11 more sections

Figures (18)

Figure 1: Different point cloud completion methods. (a) Existing network-based completion methods; (b) Test-time SDS-complete kasten2024point with text prompts to guide Neural surface for completion; (c) Our method based on 3D Gaussian Splatting (GS) guided by the diffusion model from Zero 1-to-3 liu2023zero conditioned on the reference image rendered from partial points.
Figure 2: Illustration of our framework. ①In Partial Gaussian Initialization (PGI), Reference Viewpoint Estimation estimates a camera pose $V_p$ where $P_{in}$ can be most completely observed. We initialize 3D Gaussians $G_{in}$ from $P_{in}$ and render the reference image $I_{in}$ under $V_p$. ②In Zero-shot Fractal Completion (ZFC), 3D Gaussians $G_m$ begins with an initialization using noisy $P_{N}$ and undergoes optimization guided by view-dependent guidance from the diffusion model $f_Z$ in Zero 1-to-3 liu2023zero based on a randomly chosen camera pose $V_i$. Additionally, it incorporates a Preservation Constraint computed with respect to $V_p$. $G_{in}$ is mixed with $G_m$ to form $G_{all}$, introducing the partial geometry from $P_{in}$. ③After ZFC, we use Point Cloud Extraction (PCE) to extract surface points $P_{surf}$ from centers of $G_{all}$, and convert $P_{surf}$ into uniform $P_{out}$ with Grid Pulling.
Figure 3: Differences between our binarized opacity and original continuous opacity. $\prec$ denotes smaller but not approaching.
Figure 4: Illustration of Grid Pulling module. $g(\cdot)$ is a MLP-based SDF learned from the completed point cloud $P_{surf}$. Merge denotes merge layer from huang2021rfnet. Given the 3D grids $\mathcal{G}$, $r$ is the diagonal length of a unit grid. Sampled points would be $P_s = \{p \mid g(p)<0.5r, p \in \mathcal{G}\}$.
Figure 5: Qualitative comparison on synthetic data.
...and 13 more figures

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

TL;DR

Abstract

ComPC: Completing a 3D Point Cloud with 2D Diffusion Priors

Authors

TL;DR

Abstract

Table of Contents

Figures (18)