Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction
Li Fang, Hao Zhu, Longlong Chen, Fei Hu, Long Ye, Zhan Ma
TL;DR
This work tackles the high computational cost of rendering high-resolution views in generalizable NeRFs by introducing depth-guided bundle sampling, which groups adjacent rays into cones and samples them jointly with depth-aware adaptation. It leverages plenoptic sampling principles to concentrate sampling near depth surfaces and reduces overall ray counts without sacrificing quality, applying the method to ENeRF and MVSGaussian. Experimental results on DTU, Real Forward-facing, and NeRF Synthetic datasets show state-of-the-art or competitive quality with substantial speedups, including up to $2\times$ faster rendering and notable PSNR/SSIM/LPIPS improvements. The approach offers a flexible speed-accuracy trade-off via bundle size and depth-guided sampling, with robust ablations confirming the value of sphere-based bundle sampling and joint bundle plus ray-specific representations, while acknowledging limitations in depth estimation-heavy or object-centric scenes.
Abstract
Recent advancements in generalizable novel view synthesis have achieved impressive quality through interpolation between nearby views. However, rendering high-resolution images remains computationally intensive due to the need for dense sampling of all rays. Recognizing that natural scenes are typically piecewise smooth and sampling all rays is often redundant, we propose a novel depth-guided bundle sampling strategy to accelerate rendering. By grouping adjacent rays into a bundle and sampling them collectively, a shared representation is generated for decoding all rays within the bundle. To further optimize efficiency, our adaptive sampling strategy dynamically allocates samples based on depth confidence, concentrating more samples in complex regions while reducing them in smoother areas. When applied to ENeRF, our method achieves up to a 1.27 dB PSNR improvement and a 47% increase in FPS on the DTU dataset. Extensive experiments on synthetic and real-world datasets demonstrate state-of-the-art rendering quality and up to 2x faster rendering compared to existing generalizable methods. Code is available at https://github.com/KLMAV-CUC/GDB-NeRF.
