DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing
Matias Turkulainen, Xuqian Ren, Iaroslav Melekhov, Otto Seiskari, Esa Rahtu, Juho Kannala
TL;DR
This work tackles the challenge of high-fidelity indoor 3D reconstruction with Gaussian splatting by introducing depth and normal priors. It couples an edge-aware depth loss and monocular normal cues with Gaussian-based priors, enabling stable optimization and direct mesh extraction via Poisson reconstruction. Experiments on MuSHRoom and ScanNet++ show improved novel-view synthesis, depth accuracy, and mesh quality compared to NeRF-, SDF-, and prior Gaussian methods, with ablations highlighting the key role of depth supervision. The approach leverages readily available sensor depth and monocular normals to make indoor 3D reconstruction more accurate and practically deployable for VR/AR pipelines.
Abstract
High-fidelity 3D reconstruction of common indoor scenes is crucial for VR and AR applications. 3D Gaussian splatting, a novel differentiable rendering technique, has achieved state-of-the-art novel view synthesis results with high rendering speeds and relatively low training times. However, its performance on scenes commonly seen in indoor datasets is poor due to the lack of geometric constraints during optimization. In this work, we explore the use of readily accessible geometric cues to enhance Gaussian splatting optimization in challenging, ill-posed, and textureless scenes. We extend 3D Gaussian splatting with depth and normal cues to tackle challenging indoor datasets and showcase techniques for efficient mesh extraction. Specifically, we regularize the optimization procedure with depth information, enforce local smoothness of nearby Gaussians, and use off-the-shelf monocular networks to achieve better alignment with the true scene geometry. We propose an adaptive depth loss based on the gradient of color images, improving depth estimation and novel view synthesis results over various baselines. Our simple yet effective regularization technique enables direct mesh extraction from the Gaussian representation, yielding more physically accurate reconstructions of indoor scenes.
