InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity
Jiabin Liang, Lanqing Zhang, Zhuoran Zhao, Xiangyu Xu
TL;DR
InfNeRF introduces an octree-based Level-of-Detail framework for NeRFs to render large-scale scenes with $O(\log n)$ memory during rendering while keeping NeRF fidelity. Each octree node hosts a NeRF and sampling points are routed to the level that matches their spatial scale, enabling efficient memory access and reduced aliasing through hierarchical smoothing. It adds a SfM-driven pruning strategy to obtain a compact, adaptive tree and a distributed training approach that preserves $O(n)$ training complexity, allowing scalable, multi-device training. Empirical results on four large urban drone datasets and the MipNeRF 360 garden scene show strong memory efficiency and improved multi-resolution rendering, including up to $2.4$ dB PSNR gains over baselines. The approach is versatile, compatible with various NeRF backbones, and sets a foundation for scalable large-scale neural scene representations with LoD octrees.
Abstract
The conventional mesh-based Level of Detail (LoD) technique, exemplified by applications such as Google Earth and many game engines, exhibits the capability to holistically represent a large scene even the Earth, and achieves rendering with a space complexity of O(log n). This constrained data requirement not only enhances rendering efficiency but also facilitates dynamic data fetching, thereby enabling a seamless 3D navigation experience for users. In this work, we extend this proven LoD technique to Neural Radiance Fields (NeRF) by introducing an octree structure to represent the scenes in different scales. This innovative approach provides a mathematically simple and elegant representation with a rendering space complexity of O(log n), aligned with the efficiency of mesh-based LoD techniques. We also present a novel training strategy that maintains a complexity of O(n). This strategy allows for parallel training with minimal overhead, ensuring the scalability and efficiency of our proposed method. Our contribution is not only in extending the capabilities of existing techniques but also in establishing a foundation for scalable and efficient large-scale scene representation using NeRF and octree structures.
