WildGaussians: 3D Gaussian Splatting in the Wild
Jonas Kulhanek, Songyou Peng, Zuzana Kukelova, Marc Pollefeys, Torsten Sattler
TL;DR
WildGaussians extends 3D Gaussian Splatting to uncontrolled in-the-wild data by integrating per-image and per-Gaussian appearance embeddings and a DINO-based uncertainty predictor. An appearance MLP outputs affine color transforms that condition each Gaussian, while a robust uncertainty loss suppresses occluders during training, enabling accurate rendering under varying illumination and occlusion. The approach preserves real-time rendering and allows appearance to be baked back into the base 3DGS representation, achieving state-of-the-art results on challenging datasets like NeRF On-the-go and Photo Tourism. Sky handling and test-time appearance optimization further enhance robustness, though limitations remain in capturing highlights and extremely occluded regions.
Abstract
While the field of 3D scene reconstruction is dominated by NeRFs due to their photorealistic quality, 3D Gaussian Splatting (3DGS) has recently emerged, offering similar quality with real-time rendering speeds. However, both methods primarily excel with well-controlled 3D scenes, while in-the-wild data - characterized by occlusions, dynamic objects, and varying illumination - remains challenging. NeRFs can adapt to such conditions easily through per-image embedding vectors, but 3DGS struggles due to its explicit representation and lack of shared parameters. To address this, we introduce WildGaussians, a novel approach to handle occlusions and appearance changes with 3DGS. By leveraging robust DINO features and integrating an appearance modeling module within 3DGS, our method achieves state-of-the-art results. We demonstrate that WildGaussians matches the real-time rendering speed of 3DGS while surpassing both 3DGS and NeRF baselines in handling in-the-wild data, all within a simple architectural framework.
