SpecNeRF: Gaussian Directional Encoding for Specular Reflections
Li Ma, Vasu Agrawal, Haithem Turki, Changil Kim, Chen Gao, Pedro Sander, Michael Zollhöfer, Christian Richardt
TL;DR
This work targets accurate modeling of view-dependent specular reflections in NeRFs under near-field indoor lighting, where prior approaches assuming distant illumination struggle. It introduces Gaussian directional encoding using a learnable set of 3D Gaussians with parameters ${\boldsymbol\mu}_i$, ${\boldsymbol\sigma}_i$, and ${\mathbf{q}}_i$ to map a ray origin and direction to a 5D embedding, enabling preconvolved specular color to be predicted by a lightweight MLP while allowing roughness to modulate high-frequency content via ${\boldsymbol\sigma}_i \leftarrow \rho {\boldsymbol\sigma}_i$. To address shape–radiance ambiguity, a data-driven monocular normal prior is trained early with ${\mathcal L}_{\text{mono}}$ and then gradually removed, improving normals and specular alignment. Experiments on indoor near-field datasets show improved specular reconstruction and more meaningful color decomposition compared to baselines, with strong performance on Eyeful Tower and competitive results on related datasets, validating both the representation and the initialization strategy. Overall, the approach enables practical near-field relighting, reflection removal, and roughness editing while maintaining tractable computation through a moderate number of Gaussians.
Abstract
Neural radiance fields have achieved remarkable performance in modeling the appearance of 3D scenes. However, existing approaches still struggle with the view-dependent appearance of glossy surfaces, especially under complex lighting of indoor environments. Unlike existing methods, which typically assume distant lighting like an environment map, we propose a learnable Gaussian directional encoding to better model the view-dependent effects under near-field lighting conditions. Importantly, our new directional encoding captures the spatially-varying nature of near-field lighting and emulates the behavior of prefiltered environment maps. As a result, it enables the efficient evaluation of preconvolved specular color at any 3D location with varying roughness coefficients. We further introduce a data-driven geometry prior that helps alleviate the shape radiance ambiguity in reflection modeling. We show that our Gaussian directional encoding and geometry prior significantly improve the modeling of challenging specular reflections in neural radiance fields, which helps decompose appearance into more physically meaningful components.
