Deep Reflectance Volumes: Relightable Reconstructions from Multi-View Photometric Images
Sai Bi, Zexiang Xu, Kalyan Sunkavalli, Miloš Hašan, Yannick Hold-Geoffroy, David Kriegman, Ravi Ramamoorthi
TL;DR
This paper addresses relighting and view synthesis of real scenes from unstructured multi-view photographs captured with collocated camera and flash. It introduces Deep Reflectance Volumes, a neural volumetric representation consisting of $\alpha$ (opacity), $\mathbf{n}$ (normals), and $R$ (BRDF-based reflectance) volumes, learned through a physically based differentiable volume ray marching renderer. A decoder-like network with a learnable warping function $W$ maps a 512-channel scene encoding to voxel volumes, enabling joint optimization of geometry and spatially varying materials. The framework supports relighting under novel lighting and arbitrary viewpoints, performs view synthesis and material editing, and outperforms state-of-the-art mesh-based methods on challenging scenes with occlusions and specularities. Together, the approach offers practical, off-the-shelf data capture and photorealistic rendering suitable for VR/AR visualization and immersive applications.
Abstract
We present a deep learning approach to reconstruct scene appearance from unstructured images captured under collocated point lighting. At the heart of Deep Reflectance Volumes is a novel volumetric scene representation consisting of opacity, surface normal and reflectance voxel grids. We present a novel physically-based differentiable volume ray marching framework to render these scene volumes under arbitrary viewpoint and lighting. This allows us to optimize the scene volumes to minimize the error between their rendered images and the captured images. Our method is able to reconstruct real scenes with challenging non-Lambertian reflectance and complex geometry with occlusions and shadowing. Moreover, it accurately generalizes to novel viewpoints and lighting, including non-collocated lighting, rendering photorealistic images that are significantly better than state-of-the-art mesh-based methods. We also show that our learned reflectance volumes are editable, allowing for modifying the materials of the captured scenes.
