NeLF-Pro: Neural Light Field Probes for Multi-Scale Novel View Synthesis
Zinuo You, Andreas Geiger, Anpei Chen
TL;DR
NeLF-Pro tackles scalable, high-fidelity novel view synthesis across scenes of varying scale by representing a 3D scene as local light field probes and applying a Vector-Matrix-Matrix (VMM) factorization that shares a core representation while keeping probe-specific bases. The method queries a small set of camera-adjacent probes via soft blending and mipmap-like hierarchical sampling to render density and radiance with occlusion awareness, using a differentiable decoder to map fused features to $\\sigma$ and $\\mathcal{L}(\\mathbf{x},\\mathbf{d})$. It combines volumetric light field rendering with continuous factorization, local coordinate transforms, and permutation-invariant blending to achieve fast optimization and high fidelity across small to large-scale datasets, outperforming many grid-based baselines while maintaining compact models. The approach demonstrates strong performance on mip-NeRF360, Free, KITTI-360, and large-scale Google Earth–level scenes, with notably faster training times for large reconstructions and robust handling of multi-scale geometry.
Abstract
We present NeLF-Pro, a novel representation to model and reconstruct light fields in diverse natural scenes that vary in extent and spatial granularity. In contrast to previous fast reconstruction methods that represent the 3D scene globally, we model the light field of a scene as a set of local light field feature probes, parameterized with position and multi-channel 2D feature maps. Our central idea is to bake the scene's light field into spatially varying learnable representations and to query point features by weighted blending of probes close to the camera - allowing for mipmap representation and rendering. We introduce a novel vector-matrix-matrix (VMM) factorization technique that effectively represents the light field feature probes as products of core factors (i.e., VM) shared among local feature probes, and a basis factor (i.e., M) - efficiently encoding internal relationships and patterns within the scene. Experimentally, we demonstrate that NeLF-Pro significantly boosts the performance of feature grid-based representations, and achieves fast reconstruction with better rendering quality while maintaining compact modeling. Project webpage https://sinoyou.github.io/nelf-pro/.
