Differentiable Inverse Rendering with Interpretable Basis BRDFs
Hoon-Gyu Chung, Seokjun Choi, Seung-Hwan Baek
TL;DR
This work targets the ill-posed problem of recovering geometry and spatially varying BRDFs from images. It introduces a differentiable inverse rendering pipeline that represents geometry with 2D Gaussians and reflectance with a set of basis BRDFs blended per Gaussian, while dynamically adjusting the basis count and enforcing sparsity to yield interpretable, spatially separated BRDFs. A specular-weighted rendering loss further emphasizes challenging highlights, and basis BRDF merging/removal plus scheduling ensure the basis set adapts to scene complexity. The approach yields accurate geometry, scalable interpretable BRDFs, and supports novel-view relighting and intuitive scene editing, with faster training and superior normal reconstruction compared to state-of-the-art methods. Overall, this method provides a practical, interpretable, and adaptable framework for differentiable inverse rendering of complex scenes.
Abstract
Inverse rendering seeks to reconstruct both geometry and spatially varying BRDFs (SVBRDFs) from captured images. To address the inherent ill-posedness of inverse rendering, basis BRDF representations are commonly used, modeling SVBRDFs as spatially varying blends of a set of basis BRDFs. However, existing methods often yield basis BRDFs that lack intuitive separation and have limited scalability to scenes of varying complexity. In this paper, we introduce a differentiable inverse rendering method that produces interpretable basis BRDFs. Our approach models a scene using 2D Gaussians, where the reflectance of each Gaussian is defined by a weighted blend of basis BRDFs. We efficiently render an image from the 2D Gaussians and basis BRDFs using differentiable rasterization and impose a rendering loss with the input images. During this analysis-by-synthesis optimization process of differentiable inverse rendering, we dynamically adjust the number of basis BRDFs to fit the target scene while encouraging sparsity in the basis weights. This ensures that the reflectance of each Gaussian is represented by only a few basis BRDFs. This approach enables the reconstruction of accurate geometry and interpretable basis BRDFs that are spatially separated. Consequently, the resulting scene representation, comprising basis BRDFs and 2D Gaussians, supports physically-based novel-view relighting and intuitive scene editing.
