MCBlock: Boosting Neural Radiance Field Training Speed by MCTS-based Dynamic-Resolution Ray Sampling
Yunpeng Tan, Junlin Hao, Jiangkai Wu, Liming Liu, Qingyang Li, Xinggong Zhang
TL;DR
MCBlock introduces a Monte-Carlo Tree Search-based dynamic-resolution ray sampling method for NeRF training, enabling blocks of varying pixel sizes to be trained in parallel according to image texture. It initializes a texture-informed block hierarchy, then dynamically expands and prunes blocks while using a redefined UCT to guide selection, achieving up to 2.33x training acceleration with minimal overhead. The approach blends active sampling with multi-resolution block training and demonstrates clear speedups across real-world (Mipnerf360) and synthetic (Blender) datasets, while maintaining competitive rendering quality. Its block-wise, cone-tracing compatible design suggests broad applicability to cone-tracing NeRF variants and real-time multimedia scenarios.
Abstract
Neural Radiance Field (NeRF) is widely known for high-fidelity novel view synthesis. However, even the state-of-the-art NeRF model, Gaussian Splatting, requires minutes for training, far from the real-time performance required by multimedia scenarios like telemedicine. One of the obstacles is its inefficient sampling, which is only partially addressed by existing works. Existing point-sampling algorithms uniformly sample simple-texture regions (easy to fit) and complex-texture regions (hard to fit), while existing ray-sampling algorithms sample these regions all in the finest granularity (i.e. the pixel level), both wasting GPU training resources. Actually, regions with different texture intensities require different sampling granularities. To this end, we propose a novel dynamic-resolution ray-sampling algorithm, MCBlock, which employs Monte Carlo Tree Search (MCTS) to partition each training image into pixel blocks with different sizes for active block-wise training. Specifically, the trees are initialized according to the texture of training images to boost the initialization speed, and an expansion/pruning module dynamically optimizes the block partition. MCBlock is implemented in Nerfstudio, an open-source toolset, and achieves a training acceleration of up to 2.33x, surpassing other ray-sampling algorithms. We believe MCBlock can apply to any cone-tracing NeRF model and contribute to the multimedia community.
