VoxNeuS: Enhancing Voxel-Based Neural Surface Reconstruction via Gradient Interpolation
Sidun Liu, Peng Qiao, Zongxin Ye, Wenyu Li, Yong Dou
TL;DR
VoxNeuS tackles gradient discontinuities in voxel-based SDF representations by substituting the analytical gradient from trilinear interpolation with an interpolated gradient, thereby stabilizing optimization and improving surface quality. It additionally enforces SDF regularization directly on grid vertices via explicit gradient computations, and employs a geometry-radiance disentangled architecture to prevent radiance optimization from deforming geometry. The method uses progressive SDF grid super-resolution and CUDA-accelerated primitives to achieve fast training (around 15 minutes) and low memory usage (~2–3 GB) on a single consumer GPU, while delivering superior reconstruction fidelity on DTU and BlendedMVS datasets compared to prior voxel-based and NeRF-like approaches. These contributions yield a practical, scalable pipeline for high-frequency, texture-aware 3D reconstruction with reduced computational overhead. The work demonstrates that gradient continuity and explicit regularization are crucial for efficient and accurate neural surface reconstruction in grid-based representations.
Abstract
Neural Surface Reconstruction learns a Signed Distance Field~(SDF) to reconstruct the 3D model from multi-view images. Previous works adopt voxel-based explicit representation to improve efficiency. However, they ignored the gradient instability of interpolation in the voxel grid, leading to degradation on convergence and smoothness. Besides, previous works entangled the optimization of geometry and radiance, which leads to the deformation of geometry to explain radiance, causing artifacts when reconstructing textured planes. In this work, we reveal that the instability of gradient comes from its discontinuity during trilinear interpolation, and propose to use the interpolated gradient instead of the original analytical gradient to eliminate the discontinuity. Based on gradient interpolation, we propose VoxNeuS, a lightweight surface reconstruction method for computational and memory efficient neural surface reconstruction. Thanks to the explicit representation, the gradient of regularization terms, i.e. Eikonal and curvature loss, are directly solved, avoiding computation and memory-access overhead. Further, VoxNeuS adopts a geometry-radiance disentangled architecture to handle the geometry deformation from radiance optimization. The experimental results show that VoxNeuS achieves better reconstruction quality than previous works. The entire training process takes 15 minutes and less than 3 GB of memory on a single 2080ti GPU.
