Expansive Supervision for Neural Radiance Field
Weixiang Zhang, Shuzhao Xie, Shijia Ge, Wei Yao, Chen Tang, Zhi Wang
TL;DR
This work tackles the computational bottleneck of Neural Radiance Field (NeRF) training by introducing Expansive Supervision, which selectively renders a small subset of pixels and expands their errors to estimate the full loss, leveraging a long-tail distribution of training errors correlated with image content. The method employs an anchor area extractor and a source area sampling strategy to form a supervision set $R'$, and defines an expansive loss $\hat{L}$ that guides training while reducing compute and memory, achieving up to 52% memory and 16% time savings with comparable rendering quality. It integrates with existing explicit caching acceleration frameworks and extends to Implicit Neural Representations for images, demonstrating broad applicability and strong early- and mid-training performance gains. Overall, Expansive Supervision offers a practical, scalable pathway to faster and more memory-efficient NeRF training for high-fidelity novel view synthesis.
Abstract
Neural Radiance Field (NeRF) has achieved remarkable success in creating immersive media representations through its exceptional reconstruction capabilities. However, the computational demands of dense forward passes and volume rendering during training continue to challenge its real-world applications. In this paper, we introduce Expansive Supervision to reduce time and memory costs during NeRF training from the perspective of partial ray selection for supervision. Specifically, we observe that training errors exhibit a long-tail distribution correlated with image content. Based on this observation, our method selectively renders a small but crucial subset of pixels and expands their values to estimate errors across the entire area for each iteration. Compared to conventional supervision, our approach effectively bypasses redundant rendering processes, resulting in substantial reductions in both time and memory consumption. Experimental results demonstrate that integrating Expansive Supervision within existing state-of-the-art acceleration frameworks achieves 52% memory savings and 16% time savings while maintaining comparable visual quality.
