Data Augmentation for NeRFs in the Low Data Limit
Ayush Gaggar, Todd D. Murphey
TL;DR
This work tackles the challenge of training Neural Radiance Fields (NeRFs) with very limited and partially observed data in robotic settings. It introduces a data augmentation framework that samples new training views from a posterior uncertainty distribution combining in-distribution entropy and out-of-distribution spatial coverage, implemented via rejection sampling on a hemispherical candidate set. The uncertainty distribution is defined as $U(r(s)) = H(r(s))_{ent} + D(r(s))_{dist}$, with $H$ capturing ID uncertainty and $D$ measuring SE(3) pose-distance to training views; sampling is performed by accepting candidate rays with probability proportional to this uncertainty. Empirical results on Blender scenes show the method outperforms state-of-the-art baselines in PSNR, LPIPS, and SSIM while exhibiting markedly lower variability, demonstrating strong data-efficiency and robustness for NeRFs in resource-constrained, partially observed environments. The approach is end-to-end and transformable to existing NeRF architectures without requiring pre-training, making it practical for real-world robotic applications where informative data is expensive or scarce.
Abstract
Current methods based on Neural Radiance Fields fail in the low data limit, particularly when training on incomplete scene data. Prior works augment training data only in next-best-view applications, which lead to hallucinations and model collapse with sparse data. In contrast, we propose adding a set of views during training by rejection sampling from a posterior uncertainty distribution, generated by combining a volumetric uncertainty estimator with spatial coverage. We validate our results on partially observed scenes; on average, our method performs 39.9% better with 87.5% less variability across established scene reconstruction benchmarks, as compared to state of the art baselines. We further demonstrate that augmenting the training set by sampling from any distribution leads to better, more consistent scene reconstruction in sparse environments. This work is foundational for robotic tasks where augmenting a dataset with informative data is critical in resource-constrained, a priori unknown environments. Videos and source code are available at https://murpheylab.github.io/low-data-nerf/.
