FewShotNeRF: Meta-Learning-based Novel View Synthesis for Rapid Scene-Specific Adaptation
Piraveen Sivakumar, Paul Janson, Jathushan Rajasegaran, Thanuja Ambegoda
TL;DR
FewShotNeRF tackles the challenge of synthesizing novel views from limited multi-view images by meta-learning a NeRF initialization that can rapidly adapt to a new scene. It leverages gradient-based meta-learning (Reptile) together with hash-encoded representations to distill a robust 3D prior from many scenes, enabling efficient inner-loop NeRF optimization with as few as 2–6 views. Evaluated on the CO3D dataset with real-world objects, it demonstrates competitive performance against strong baselines and affirms the viability of 3D priors learned via meta-learning without external priors. The approach reduces data and compute demands for per-scene NeRF fitting, enabling scalable, rapid scene-specific view synthesis in practical settings.
Abstract
In this paper, we address the challenge of generating novel views of real-world objects with limited multi-view images through our proposed approach, FewShotNeRF. Our method utilizes meta-learning to acquire optimal initialization, facilitating rapid adaptation of a Neural Radiance Field (NeRF) to specific scenes. The focus of our meta-learning process is on capturing shared geometry and textures within a category, embedded in the weight initialization. This approach expedites the learning process of NeRFs and leverages recent advancements in positional encodings to reduce the time required for fitting a NeRF to a scene, thereby accelerating the inner loop optimization of meta-learning. Notably, our method enables meta-learning on a large number of 3D scenes to establish a robust 3D prior for various categories. Through extensive evaluations on the Common Objects in 3D open source dataset, we empirically demonstrate the efficacy and potential of meta-learning in generating high-quality novel views of objects.
