StyleSplat: 3D Object Style Transfer with Gaussian Splatting
Sahil Jain, Avik Kuthiala, Prabhdeep Singh Sethi, Prakanshul Saxena
TL;DR
StyleSplat addresses the challenge of per-object, fast style transfer in 3D scenes represented by 3D Gaussian splats. It introduces a three-stage pipeline—2D mask generation and tracking, 3D Gaussian training/segmentation, and 3D style transfer—that localizes stylistic changes to user-selected objects by finetuning only the spherical harmonic color coefficients via a nearest-neighbor feature matching loss. The method enables multiple objects to receive different styles with high fidelity and efficiency (often under a minute on a single A100 GPU), outperforming prior approaches in localization accuracy and speed. By leveraging view-consistent 3D segmentation and SH-based color encoding, StyleSplat minimizes leakage and preserves geometric priors while delivering diverse artistic stylizations across varied scenes.
Abstract
Recent advancements in radiance fields have opened new avenues for creating high-quality 3D assets and scenes. Style transfer can enhance these 3D assets with diverse artistic styles, transforming creative expression. However, existing techniques are often slow or unable to localize style transfer to specific objects. We introduce StyleSplat, a lightweight method for stylizing 3D objects in scenes represented by 3D Gaussians from reference style images. Our approach first learns a photorealistic representation of the scene using 3D Gaussian splatting while jointly segmenting individual 3D objects. We then use a nearest-neighbor feature matching loss to finetune the Gaussians of the selected objects, aligning their spherical harmonic coefficients with the style image to ensure consistency and visual appeal. StyleSplat allows for quick, customizable style transfer and localized stylization of multiple objects within a scene, each with a different style. We demonstrate its effectiveness across various 3D scenes and styles, showcasing enhanced control and customization in 3D creation.
