GAS-NeRF: Geometry-Aware Stylization of Dynamic Radiance Fields
Nhat Phuong Anh Vu, Abhishek Saroha, Or Litany, Daniel Cremers
TL;DR
GAS-NeRF tackles the gap in 3D scene stylization by jointly optimizing geometry and appearance for dynamic radiance fields. It first transfers geometry from a style image using depth maps, then applies appearance stylization, leveraging a two-stage optimization, NNFM-based losses, and a linear gradient-scaling strategy to suppress near-camera artifacts. The approach builds on a Hexplane dynamic RF and uses depth maps from style images (via ZoeDepth) along with a VGG16-based feature extractor for NNFM losses, achieving improved depth and RGB fidelity and temporal coherence. Experimental results on real and synthetic datasets, plus a user study, demonstrate superior stylization quality and coherence, highlighting the practical potential for dynamic scene editing in applications like games and AR/VR.
Abstract
Current 3D stylization techniques primarily focus on static scenes, while our world is inherently dynamic, filled with moving objects and changing environments. Existing style transfer methods primarily target appearance -- such as color and texture transformation -- but often neglect the geometric characteristics of the style image, which are crucial for achieving a complete and coherent stylization effect. To overcome these shortcomings, we propose GAS-NeRF, a novel approach for joint appearance and geometry stylization in dynamic Radiance Fields. Our method leverages depth maps to extract and transfer geometric details into the radiance field, followed by appearance transfer. Experimental results on synthetic and real-world datasets demonstrate that our approach significantly enhances the stylization quality while maintaining temporal coherence in dynamic scenes.
