NeRFactor: Neural Factorization of Shape and Reflectance Under an Unknown Illumination
Xiuming Zhang, Pratul P. Srinivasan, Boyang Deng, Paul Debevec, William T. Freeman, Jonathan T. Barron
TL;DR
NeRFactor addresses the challenge of recovering shape and spatially varying reflectance from multi-view images captured under one unknown illumination by combining NeRF-based geometry initialization with a joint optimization over surface normals, light visibility, albedo, BRDF latent codes, and environment lighting. It advances inverse rendering by explicitly modeling visibility and shadows and by introducing a data-driven BRDF prior learned from real measurements, enabling realistic relighting with arbitrary lighting and material editing. The approach yields high-quality geometry suitable for free-viewpoint relighting, supports both synthetic and real scenes, and demonstrates robust performance even when starting from MVS geometry. Together, these innovations move toward robust 3D asset recovery from casual captures, with practical impact on relighting, material editing, and view synthesis in uncontrolled settings.
Abstract
We address the problem of recovering the shape and spatially-varying reflectance of an object from multi-view images (and their camera poses) of an object illuminated by one unknown lighting condition. This enables the rendering of novel views of the object under arbitrary environment lighting and editing of the object's material properties. The key to our approach, which we call Neural Radiance Factorization (NeRFactor), is to distill the volumetric geometry of a Neural Radiance Field (NeRF) [Mildenhall et al. 2020] representation of the object into a surface representation and then jointly refine the geometry while solving for the spatially-varying reflectance and environment lighting. Specifically, NeRFactor recovers 3D neural fields of surface normals, light visibility, albedo, and Bidirectional Reflectance Distribution Functions (BRDFs) without any supervision, using only a re-rendering loss, simple smoothness priors, and a data-driven BRDF prior learned from real-world BRDF measurements. By explicitly modeling light visibility, NeRFactor is able to separate shadows from albedo and synthesize realistic soft or hard shadows under arbitrary lighting conditions. NeRFactor is able to recover convincing 3D models for free-viewpoint relighting in this challenging and underconstrained capture setup for both synthetic and real scenes. Qualitative and quantitative experiments show that NeRFactor outperforms classic and deep learning-based state of the art across various tasks. Our videos, code, and data are available at people.csail.mit.edu/xiuming/projects/nerfactor/.
