DANCE: Density-agnostic and Class-aware Network for Point Cloud Completion
Da-Yeong Kim, Yeong-Jun Cho
TL;DR
Point cloud completion must cope with incomplete scans that vary in sparsity. DANCE addresses this by a density-agnostic, class-aware pipeline that generates candidate points $P^S$ via ray-based sampling with $M = V R^2$, refines them through a transformer decoder predicting offsets $o_m$ and opacities $\,\\sigma_m$, and yields a flexible output $P^{out}$ combined with the observed input $P^I$ to form $P^{pred}$. A lightweight 3D classification head provides semantic priors from the incomplete geometry, guiding completion without image supervision, while a fusion network translates features and priors into per-point predictions. Experiments on PCN and MVP show state-of-the-art accuracy, stronger structural consistency, and robustness to density and noise, with the added benefit that output density can be controlled at inference by adjusting the sampling parameter $R$.
Abstract
Point cloud completion aims to recover missing geometric structures from incomplete 3D scans, which often suffer from occlusions or limited sensor viewpoints. Existing methods typically assume fixed input/output densities or rely on image-based representations, making them less suitable for real-world scenarios with variable sparsity and limited supervision. In this paper, we introduce Density-agnostic and Class-aware Network (DANCE), a novel framework that completes only the missing regions while preserving the observed geometry. DANCE generates candidate points via ray-based sampling from multiple viewpoints. A transformer decoder then refines their positions and predicts opacity scores, which determine the validity of each point for inclusion in the final surface. To incorporate semantic guidance, a lightweight classification head is trained directly on geometric features, enabling category-consistent completion without external image supervision. Extensive experiments on the PCN and MVP benchmarks show that DANCE outperforms state-of-the-art methods in accuracy and structural consistency, while remaining robust to varying input densities and noise levels.
