NeRF-Casting: Improved View-Dependent Appearance with Consistent Reflections
Dor Verbin, Pratul P. Srinivasan, Peter Hedman, Ben Mildenhall, Benjamin Attal, Richard Szeliski, Jonathan T. Barron
TL;DR
NeRF-Casting addresses the challenge of rendering highly specular content with NeRFs by introducing reflection-cone tracing into the rendering pipeline. Instead of evaluating a large view-dependent radiance MLP at every surface point, the method casts a small set of reflected rays through the scene and decodes a compact reflection feature into color, enabling consistent near-field and distant reflections with improved photorealism. Key innovations include conical reflection features, directional unscented sampling, 2D directional downweighting to prevent aliasing, an asymmetric predicted-normal loss to regularize geometry, and a multi-cone strategy that yields accurate and stable reflections across views. The approach achieves state-of-the-art results on shiny real and synthetic scenes while maintaining comparable optimization times to existing view-synthesis models, demonstrating practical impact for realistic rendering of glossy materials in complex environments.
Abstract
Neural Radiance Fields (NeRFs) typically struggle to reconstruct and render highly specular objects, whose appearance varies quickly with changes in viewpoint. Recent works have improved NeRF's ability to render detailed specular appearance of distant environment illumination, but are unable to synthesize consistent reflections of closer content. Moreover, these techniques rely on large computationally-expensive neural networks to model outgoing radiance, which severely limits optimization and rendering speed. We address these issues with an approach based on ray tracing: instead of querying an expensive neural network for the outgoing view-dependent radiance at points along each camera ray, our model casts reflection rays from these points and traces them through the NeRF representation to render feature vectors which are decoded into color using a small inexpensive network. We demonstrate that our model outperforms prior methods for view synthesis of scenes containing shiny objects, and that it is the only existing NeRF method that can synthesize photorealistic specular appearance and reflections in real-world scenes, while requiring comparable optimization time to current state-of-the-art view synthesis models.
