CAD-NeRF: Learning NeRFs from Uncalibrated Few-view Images by CAD Model Retrieval
Xin Wen, Xuening Zhu, Renjiao Yi, Zhifeng Wang, Chenyang Zhu, Kai Xu
TL;DR
CAD-NeRF addresses reconstructing neural radiance fields from very few uncalibrated views by leveraging a ShapeNet-based CAD mini-library to bootstrap initial geometry and poses. It introduces a multi-view pose retrieval strategy that respects input order, a deformation-based density refinement, and joint optimization of density, pose, and texture in a self-supervised framework. Evaluations on synthetic and real data show CAD-NeRF achieving robust, high-quality novel-view synthesis and outperforming several state-of-the-art few-shot NeRF methods under extreme data scarcity. The approach broadens NeRF applicability to casual, uncalibrated photo collections by exploiting priors from CAD models.
Abstract
Reconstructing from multi-view images is a longstanding problem in 3D vision, where neural radiance fields (NeRFs) have shown great potential and get realistic rendered images of novel views. Currently, most NeRF methods either require accurate camera poses or a large number of input images, or even both. Reconstructing NeRF from few-view images without poses is challenging and highly ill-posed. To address this problem, we propose CAD-NeRF, a method reconstructed from less than 10 images without any known poses. Specifically, we build a mini library of several CAD models from ShapeNet and render them from many random views. Given sparse-view input images, we run a model and pose retrieval from the library, to get a model with similar shapes, serving as the density supervision and pose initializations. Here we propose a multi-view pose retrieval method to avoid pose conflicts among views, which is a new and unseen problem in uncalibrated NeRF methods. Then, the geometry of the object is trained by the CAD guidance. The deformation of the density field and camera poses are optimized jointly. Then texture and density are trained and fine-tuned as well. All training phases are in self-supervised manners. Comprehensive evaluations of synthetic and real images show that CAD-NeRF successfully learns accurate densities with a large deformation from retrieved CAD models, showing the generalization abilities.
