Object-Centric Neural Scene Rendering
Michelle Guo, Alireza Fathi, Jiajun Wu, Thomas Funkhouser
TL;DR
This work tackles photorealistic rendering of dynamic scenes where objects and lighting move, a scenario where NeRF-like radiance fields struggle due to static illumination assumptions. It introduces object-centric neural scattering functions (OSFs), a per-object neural network $F_\Theta$ that maps $(\mathbf{x}, \boldsymbol{\omega_l}, \boldsymbol{\omega_o})$ to $(\sigma, \boldsymbol{\rho})$, enabling reuse of object assets across different scene configurations. OSFs are integrated with volumetric path tracing to model inter-object light transport, including shadows and indirect illumination, without retraining when scene arrangements change. Experiments on Furniture datasets show that OSFs better disentangle lighting from viewpoint, reproduce shadows and specularities, and render complex lighting scenarios more accurately than NeRF-based baselines, highlighting the practical potential of combining implicit object models with classical rendering techniques.
Abstract
We present a method for composing photorealistic scenes from captured images of objects. Our work builds upon neural radiance fields (NeRFs), which implicitly model the volumetric density and directionally-emitted radiance of a scene. While NeRFs synthesize realistic pictures, they only model static scenes and are closely tied to specific imaging conditions. This property makes NeRFs hard to generalize to new scenarios, including new lighting or new arrangements of objects. Instead of learning a scene radiance field as a NeRF does, we propose to learn object-centric neural scattering functions (OSFs), a representation that models per-object light transport implicitly using a lighting- and view-dependent neural network. This enables rendering scenes even when objects or lights move, without retraining. Combined with a volumetric path tracing procedure, our framework is capable of rendering both intra- and inter-object light transport effects including occlusions, specularities, shadows, and indirect illumination. We evaluate our approach on scene composition and show that it generalizes to novel illumination conditions, producing photorealistic, physically accurate renderings of multi-object scenes.
