SWAG: Splatting in the Wild images with Appearance-conditioned Gaussians
Hiba Dahmani, Moussab Bennehar, Nathan Piasco, Luis Roldao, Dzmitry Tsishkou
TL;DR
SWAG tackles in-the-wild 3D scene reconstruction by extending 3D Gaussian Splatting with appearance-conditioned colors and image-dependent opacity variations to handle photometric changes and transient occluders. It introduces an image-conditioned color network that uses per-image embeddings and a center-aware hash encoding, and a Binary Concrete-based opacity mechanism to identify and remove transient objects in an unsupervised manner. Across Phototourism and NeRF-OSR benchmarks, SWAG delivers state-of-the-art rendering quality with significantly faster training and real-time rendering compared to prior in-the-wild methods, while enabling appearance transfer and smooth interpolation in the learned appearance space. Ablation studies validate the respective contributions of appearance modeling and transient handling, and analyses quantify the distribution of transient Gaussians, supporting robust static scene reconstruction with occluder removal in unconstrained photo collections.
Abstract
Implicit neural representation methods have shown impressive advancements in learning 3D scenes from unstructured in-the-wild photo collections but are still limited by the large computational cost of volumetric rendering. More recently, 3D Gaussian Splatting emerged as a much faster alternative with superior rendering quality and training efficiency, especially for small-scale and object-centric scenarios. Nevertheless, this technique suffers from poor performance on unstructured in-the-wild data. To tackle this, we extend over 3D Gaussian Splatting to handle unstructured image collections. We achieve this by modeling appearance to seize photometric variations in the rendered images. Additionally, we introduce a new mechanism to train transient Gaussians to handle the presence of scene occluders in an unsupervised manner. Experiments on diverse photo collection scenes and multi-pass acquisition of outdoor landmarks show the effectiveness of our method over prior works achieving state-of-the-art results with improved efficiency.
