GVKF: Gaussian Voxel Kernel Functions for Highly Efficient Surface Reconstruction in Open Scenes
Gaochao Song, Chong Cheng, Hao Wang
TL;DR
GVKF tackles open-scene 3D surface reconstruction by marrying fast Gaussian splatting with an implicit, kernel-regressed opacity field to form a continuous scene representation. It uses a sparse voxel grid to manage Gaussian primitives and models along-ray opacity as ρ(t) and transmission T(t) = exp(-∫_0^t ρ(w) dw), aligning with volume rendering while enabling kernel regression over discrete Gaussians. A logistic surrogate maps the opacity field to a surface distance D(t) to enable direct mesh extraction, linking density to an explicit surface. Experiments on Waymo Open Scene, Tanks & Temples, and Mip-NeRF 360 demonstrate high reconstruction fidelity, real-time rendering, and substantial storage/memory savings compared with both explicit and implicit baselines, illustrating the practical impact for autonomous driving and virtual reality.
Abstract
In this paper we present a novel method for efficient and effective 3D surface reconstruction in open scenes. Existing Neural Radiance Fields (NeRF) based works typically require extensive training and rendering time due to the adopted implicit representations. In contrast, 3D Gaussian splatting (3DGS) uses an explicit and discrete representation, hence the reconstructed surface is built by the huge number of Gaussian primitives, which leads to excessive memory consumption and rough surface details in sparse Gaussian areas. To address these issues, we propose Gaussian Voxel Kernel Functions (GVKF), which establish a continuous scene representation based on discrete 3DGS through kernel regression. The GVKF integrates fast 3DGS rasterization and highly effective scene implicit representations, achieving high-fidelity open scene surface reconstruction. Experiments on challenging scene datasets demonstrate the efficiency and effectiveness of our proposed GVKF, featuring with high reconstruction quality, real-time rendering speed, significant savings in storage and training memory consumption.
