Improving Geometry in Sparse-View 3DGS via Reprojection-based DoF Separation
Yongsung Kim, Minjun Park, Jooyoung Choi, Sungroh Yoon
TL;DR
This work targets geometry artifacts that arise when refining sparse-view 3D geometry with 3D Gaussian Splatting (3DGS). It introduces a reprojection-based DoF separation that partitions positional DoFs into image-plane-parallel and ray-aligned types, applying a bounded offset for the former and a visibility loss for the latter to leverage per-view depth priors from learning-based MVS. The method preserves depth information while suppressing texture-driven geometric distortions, and it demonstrates improvements in geometry plausibility (via $PDC$) with competitive rendering quality across Mip-NeRF 360, MVImgNet, and Tanks and Temples. Ablation studies confirm the importance of the bounded offset and visibility loss, and limitations are discussed in the context of the underlying MVS pose-estimation and specular-surface handling. Overall, the approach advances sparse-view 3DGS by prioritizing geometric plausibility alongside photometric fidelity, suggesting a shift toward geometry-focused metrics beyond $PSNR$ in future work.
Abstract
Recent learning-based Multi-View Stereo models have demonstrated state-of-the-art performance in sparse-view 3D reconstruction. However, directly applying 3D Gaussian Splatting (3DGS) as a refinement step following these models presents challenges. We hypothesize that the excessive positional degrees of freedom (DoFs) in Gaussians induce geometry distortion, fitting color patterns at the cost of structural fidelity. To address this, we propose reprojection-based DoF separation, a method distinguishing positional DoFs in terms of uncertainty: image-plane-parallel DoFs and ray-aligned DoF. To independently manage each DoF, we introduce a reprojection process along with tailored constraints for each DoF. Through experiments across various datasets, we confirm that separating the positional DoFs of Gaussians and applying targeted constraints effectively suppresses geometric artifacts, producing reconstruction results that are both visually and geometrically plausible.
