SplatSDF: Boosting Neural Implicit SDF via Gaussian Splatting Fusion
Runfa Blark Li, Keito Suzuki, Bang Du, Ki Myung Brian Lee, Nikolay Atanasov, Truong Nguyen
TL;DR
SplatSDF introduces architecture-level fusion of Gaussian Splatting (3DGS) with neural implicit SDF (SDF-NeRF) to recover continuous 3D geometry from multi-view images. By training with 3DGS input and employing a dedicated 3DGS Aggregator and a Surface 3DGS Fusion strategy, it achieves faster convergence and higher geometric and photometric accuracy than state-of-the-art SDF-NeRF methods, while retaining the same inference cost as prior approaches. Key findings include improved Chamfer Distance and PSNR on DTU and NeRF Synthetic datasets, and ablations showing the superiority of anchor-point surface fusion over dense fusion. The work suggests a practical path to more accurate and efficient neural implicit representations for robotics and graphics tasks, with explicit 3D priors guiding the SDF learning during training only.
Abstract
A signed distance function (SDF) is a useful representation for continuous-space geometry and many related operations, including rendering, collision checking, and mesh generation. Hence, reconstructing SDF from image observations accurately and efficiently is a fundamental problem. Recently, neural implicit SDF (SDF-NeRF) techniques, trained using volumetric rendering, have gained a lot of attention. Compared to earlier truncated SDF (TSDF) fusion algorithms that rely on depth maps and voxelize continuous space, SDF-NeRF enables continuous-space SDF reconstruction with better geometric and photometric accuracy. However, the accuracy and convergence speed of scene-level SDF reconstruction require further improvements for many applications. With the advent of 3D Gaussian Splatting (3DGS) as an explicit representation with excellent rendering quality and speed, several works have focused on improving SDF-NeRF by introducing consistency losses on depth and surface normals between 3DGS and SDF-NeRF. However, loss-level connections alone lead to incremental improvements. We propose a novel neural implicit SDF called "SplatSDF" to fuse 3DGSandSDF-NeRF at an architecture level with significant boosts to geometric and photometric accuracy and convergence speed. Our SplatSDF relies on 3DGS as input only during training, and keeps the same complexity and efficiency as the original SDF-NeRF during inference. Our method outperforms state-of-the-art SDF-NeRF models on geometric and photometric evaluation by the time of submission.
