AnyStyle: Single-Pass Multimodal Stylization for 3D Gaussian Splatting
Joanna Kaleta, Bartosz Świrta, Kacper Kania, Przemysław Spurek, Marek Kowalski
TL;DR
AnyStyle tackles fast, pose-free stylization of 3D scenes represented by 3D Gaussian Splatting by introducing a modular, architecture-agnostic Style Branch that couples with a frozen AnySplat backbone. Style control is multimodal and zero-shot, driven by Long-CLIP embeddings that support both text and image inputs and allow smooth interpolation between styles. A zero-initialized Style Injection mechanism enables additive conditioning without retraining the backbone, preserving geometry while achieving expressive appearance changes; CLIP-based and perceptual losses guide stylization across views. Empirical evaluation shows state-of-the-art stylization quality among feed-forward methods, strong multi-view consistency, and clear benefits from text-conditioned control and style interpolation, enabling practical, flexible 3D stylization for rapid asset creation.
Abstract
The growing demand for rapid and scalable 3D asset creation has driven interest in feed-forward 3D reconstruction methods, with 3D Gaussian Splatting (3DGS) emerging as an effective scene representation. While recent approaches have demonstrated pose-free reconstruction from unposed image collections, integrating stylization or appearance control into such pipelines remains underexplored. Existing attempts largely rely on image-based conditioning, which limits both controllability and flexibility. In this work, we introduce AnyStyle, a feed-forward 3D reconstruction and stylization framework that enables pose-free, zero-shot stylization through multimodal conditioning. Our method supports both textual and visual style inputs, allowing users to control the scene appearance using natural language descriptions or reference images. We propose a modular stylization architecture that requires only minimal architectural modifications and can be integrated into existing feed-forward 3D reconstruction backbones. Experiments demonstrate that AnyStyle improves style controllability over prior feed-forward stylization methods while preserving high-quality geometric reconstruction. A user study further confirms that AnyStyle achieves superior stylization quality compared to an existing state-of-the-art approach. Repository: https://github.com/joaxkal/AnyStyle.
