CDGS: Confidence-Aware Depth Regularization for 3D Gaussian Splatting
Qilin Zhang, Olaf Wysocki, Steffen Urban, Boris Jutzi
TL;DR
This work addresses the limited geometric accuracy of 3D Gaussian Splatting (3DGS) by introducing CDGS, a confidence-aware depth regularization framework. CDGS comprises two components: depth refinement and alignment to produce geometry-consistent depths from monocular estimates and sparse SfM data, and confidence-aware depth regularization that adaptively weights depth supervision using multi-cue confidence maps and an alignment loss $l_a$. On Tanks and Temples, CDGS achieves stable convergence, improved early-stage geometric details, and competitive novel-view synthesis quality, with M3C2 geometry errors consistently lowered and PSNR gains up to $2.31$ dB reported. The approach enables more accurate and efficient 3D reconstruction suitable for real-world applications like digital twins, heritage preservation, and forestry, while acknowledging challenges in indoor, complex lighting scenarios and suggesting avenues for enhanced confidence estimation and additional geometric priors.
Abstract
3D Gaussian Splatting (3DGS) has shown significant advantages in novel view synthesis (NVS), particularly in achieving high rendering speeds and high-quality results. However, its geometric accuracy in 3D reconstruction remains limited due to the lack of explicit geometric constraints during optimization. This paper introduces CDGS, a confidence-aware depth regularization approach developed to enhance 3DGS. We leverage multi-cue confidence maps of monocular depth estimation and sparse Structure-from-Motion depth to adaptively adjust depth supervision during the optimization process. Our method demonstrates improved geometric detail preservation in early training stages and achieves competitive performance in both NVS quality and geometric accuracy. Experiments on the publicly available Tanks and Temples benchmark dataset show that our method achieves more stable convergence behavior and more accurate geometric reconstruction results, with improvements of up to 2.31 dB in PSNR for NVS and consistently lower geometric errors in M3C2 distance metrics. Notably, our method reaches comparable F-scores to the original 3DGS with only 50% of the training iterations. We expect this work will facilitate the development of efficient and accurate 3D reconstruction systems for real-world applications such as digital twin creation, heritage preservation, or forestry applications.
