LaS-Comp: Zero-shot 3D Completion with Latent-Spatial Consistency

Weilong Yan; Haipeng Li; Hao Xu; Nianjin Ye; Yihao Ai; Shuaicheng Liu; Jingyu Hu

LaS-Comp: Zero-shot 3D Completion with Latent-Spatial Consistency

Weilong Yan, Haipeng Li, Hao Xu, Nianjin Ye, Yihao Ai, Shuaicheng Liu, Jingyu Hu

TL;DR

This paper introduces LaS-Comp, a zero-shot and category-agnostic approach that leverages the rich geometric priors of 3D foundation models to enable 3D shape completion across diverse types of partial observations and introduces Omni-Comp, a comprehensive benchmark combining real-world and synthetic data with diverse and challenging partial patterns.

Abstract

This paper introduces LaS-Comp, a zero-shot and category-agnostic approach that leverages the rich geometric priors of 3D foundation models to enable 3D shape completion across diverse types of partial observations. Our contributions are threefold: First, \ourname{} harnesses these powerful generative priors for completion through a complementary two-stage design: (i) an explicit replacement stage that preserves the partial observation geometry to ensure faithful completion; and (ii) an implicit refinement stage ensures seamless boundaries between the observed and synthesized regions. Second, our framework is training-free and compatible with different 3D foundation models. Third, we introduce Omni-Comp, a comprehensive benchmark combining real-world and synthetic data with diverse and challenging partial patterns, enabling a more thorough and realistic evaluation. Both quantitative and qualitative experiments demonstrate that our approach outperforms previous state-of-the-art approaches. Our code and data will be available at \href{https://github.com/DavidYan2001/LaS-Comp}{LaS-Comp}.

LaS-Comp: Zero-shot 3D Completion with Latent-Spatial Consistency

TL;DR

This paper introduces LaS-Comp, a zero-shot and category-agnostic approach that leverages the rich geometric priors of 3D foundation models to enable 3D shape completion across diverse types of partial observations and introduces Omni-Comp, a comprehensive benchmark combining real-world and synthetic data with diverse and challenging partial patterns.

Abstract

This paper introduces LaS-Comp, a zero-shot and category-agnostic approach that leverages the rich geometric priors of 3D foundation models to enable 3D shape completion across diverse types of partial observations. Our contributions are threefold: First, \ourname{} harnesses these powerful generative priors for completion through a complementary two-stage design: (i) an explicit replacement stage that preserves the partial observation geometry to ensure faithful completion; and (ii) an implicit refinement stage ensures seamless boundaries between the observed and synthesized regions. Second, our framework is training-free and compatible with different 3D foundation models. Third, we introduce Omni-Comp, a comprehensive benchmark combining real-world and synthetic data with diverse and challenging partial patterns, enabling a more thorough and realistic evaluation. Both quantitative and qualitative experiments demonstrate that our approach outperforms previous state-of-the-art approaches. Our code and data will be available at \href{https://github.com/DavidYan2001/LaS-Comp}{LaS-Comp}.

Paper Structure (26 sections, 12 equations, 8 figures, 6 tables)

This paper contains 26 sections, 12 equations, 8 figures, 6 tables.

Introduction
Related Work
Supervised Shape Completion.
Unsupervised Shape Completion.
Shape Completion with Generative Priors.
Method
Overview
Explicit Replacement Stage
Clean Branch.
Noisy Branch.
Implicit Alignment Stage
Experiments
Datasets and Implementation details
Evaluation Datasets.
Proposed Omni-Comp Benchmark.
...and 11 more sections

Figures (8)

Figure 1: Our new framework supports category-agnostic shape completion across diverse partial patterns, including (a) random crops, (b) single-view scans, and (c) missing semantic parts. It further supports both unconditional and text-guided completion, offering flexible control for real-world applications, see (d).
Figure 2: Overview of the LaS-Comp framework. Starting from Gaussian noise, the process iteratively refines a latent feature $\boldsymbol{x}_t$ under the guidance of the partial input $\boldsymbol{S}_p$. At each iteration $t$, this refinement is performed in two stages: the Explicit Replacement Stage (ERS) and the Implicit Alignment Stage (IAS). The ERS explicitly injects the known geometry of $\boldsymbol{S}_p$ into $\boldsymbol{x}_t$ to produce an updated latent $\boldsymbol{x}^{*}_t$. The IAS then refines $\boldsymbol{x}^{*}_t$ using a gradient-based optimization, yielding a spatially aligned latent $\boldsymbol{x}_{t-dt}$ for the next step. After the final iteration, the completed shape $\boldsymbol{S_{\text{c}}}$ is obtained by decoding the refined latent.
Figure 3: Overview of the Explicit Replacement Stage (ERS). At each timestep $t$, ERS decomposes the latent generation into two parallel branches. The clean branch (top) enforces spatial consistency, yielding $\boldsymbol{x}^{*}_{0|t}$. Concurrently, the noisy branch (bottom) enhances fidelity, producing $\boldsymbol{x}^{*}_{1|t}$. These two branch outputs are then interpolated to compute the final aligned latent $\boldsymbol{x}^{*}_t$.
Figure 4: Qualitative comparison on Redwood dataset redwood. We compare with various supervised and unsupervised methods zhu2023svdformeradapointrsdscompletepcdreamercompc, and visualize the output as meshes utilizing the commonly-used mesh reconstruction method Peng2021SAP.
Figure 5: Visual examples on ScanNet dataset_scannet and KITTI dataset_kitti real-world datasets, which only contain real scans of table, chair, and car, with very sparse points.
...and 3 more figures

LaS-Comp: Zero-shot 3D Completion with Latent-Spatial Consistency

TL;DR

Abstract

LaS-Comp: Zero-shot 3D Completion with Latent-Spatial Consistency

Authors

TL;DR

Abstract

Table of Contents

Figures (8)