FrameNeRF: A Simple and Efficient Framework for Few-shot Novel View Synthesis
Yan Xing, Pan Wang, Ligang Liu, Daolun Li, Li Zhang
TL;DR
FrameNeRF addresses the challenge of few-shot novel view synthesis by combining a regularization-based data generator with a fast high-fidelity NeRF. The method uses a three-stage pipeline: generate pseudo-dense views from sparse inputs via a regularization model, train a fast NeRF on these views, and then fine-tune on the original sparse views to refine details. It achieves state-of-the-art results across Blender, LLFF, and DTU benchmarks, demonstrating strong rendering quality and robust multi-view consistency while maintaining fast training. The framework is modular and flexible, enabling the substitution of different regularization and fast-NeRF components to adapt to new data domains and performance targets.
Abstract
We present a novel framework, called FrameNeRF, designed to apply off-the-shelf fast high-fidelity NeRF models with fast training speed and high rendering quality for few-shot novel view synthesis tasks. The training stability of fast high-fidelity models is typically constrained to dense views, making them unsuitable for few-shot novel view synthesis tasks. To address this limitation, we utilize a regularization model as a data generator to produce dense views from sparse inputs, facilitating subsequent training of fast high-fidelity models. Since these dense views are pseudo ground truth generated by the regularization model, original sparse images are then used to fine-tune the fast high-fidelity model. This process helps the model learn realistic details and correct artifacts introduced in earlier stages. By leveraging an off-the-shelf regularization model and a fast high-fidelity model, our approach achieves state-of-the-art performance across various benchmark datasets.
