DiffStyle360: Diffusion-Based 360° Head Stylization via Style Fusion Attention
Furkan Guzelant, Arda Goktogan, Tarık Kaya, Aysegul Dundar
TL;DR
DiffStyle360 presents a diffusion-based solution for 360° head stylization that preserves identity and ensures cross-view consistency from a single style reference without retraining for each style. It introduces a Style Appearance Module for disentangled style transfer and a Style Fusion Attention mechanism to balance structure and style in the latent space, combined with GAN-derived multiview fine-tuning and a temperature-based key scaling for controllable stylization. The method achieves superior style fidelity, view coherence, and depth-consistency on FFHQ and RenderMe360 benchmarks, with strong user preference in live studies. This work advances 3D head stylization by leveraging diffusion priors and adaptive style integration, enabling arbitrary style transfer with efficient adaptation.
Abstract
3D head stylization has emerged as a key technique for reimagining realistic human heads in various artistic forms, enabling expressive character design and creative visual experiences in digital media. Despite the progress in 3D-aware generation, existing 3D head stylization methods often rely on computationally expensive optimization or domain-specific fine-tuning to adapt to new styles. To address these limitations, we propose DiffStyle360, a diffusion-based framework capable of producing multi-view consistent, identity-preserving 3D head stylizations across diverse artistic domains given a single style reference image, without requiring per-style training. Building upon the 3D-aware DiffPortrait360 architecture, our approach introduces two key components: the Style Appearance Module, which disentangles style from content, and the Style Fusion Attention mechanism, which adaptively balances structure preservation and stylization fidelity in the latent space. Furthermore, we employ a 3D GAN-generated multi-view dataset for robust fine-tuning and introduce a temperaturebased key scaling strategy to control stylization intensity during inference. Extensive experiments on FFHQ and RenderMe360 demonstrate that DiffStyle360 achieves superior style quality, outperforming state-of-the-art GAN- and diffusion-based stylization methods across challenging style domains.
