StyleBrush: Style Extraction and Transfer from a Single Image
Wancheng Feng, Wanquan Feng, Dawei Huang, Jiaming Pei, Guangliang Cheng, Lukun Wang
TL;DR
StyleBrush tackles reference-image guided stylization with a diffusion-based, two-branch framework that separates style from content structure. A ReferenceNet extracts style while a Structure Guider preserves geometry, using grayscale+blur inputs and cross-attention within a Stable Diffusion backbone, with video extension via Animatediff and a controllable style-strength parameter. The authors construct a 100K high-quality style image dataset through LLM-generated prompts and Kolors, enabling training from single-image crops and without per-style optimization. Empirical results on qualitative, quantitative, and user-based metrics show state-of-the-art performance and strong video consistency, highlighting the method's practicality for image and video stylization with flexible strength control.
Abstract
Stylization for visual content aims to add specific style patterns at the pixel level while preserving the original structural features. Compared with using predefined styles, stylization guided by reference style images is more challenging, where the main difficulty is to effectively separate style from structural elements. In this paper, we propose StyleBrush, a method that accurately captures styles from a reference image and ``brushes'' the extracted style onto other input visual content. Specifically, our architecture consists of two branches: ReferenceNet, which extracts style from the reference image, and Structure Guider, which extracts structural features from the input image, thus enabling image-guided stylization. We utilize LLM and T2I models to create a dataset comprising 100K high-quality style images, encompassing a diverse range of styles and contents with high aesthetic score. To construct training pairs, we crop different regions of the same training image. Experiments show that our approach achieves state-of-the-art results through both qualitative and quantitative analyses. We will release our code and dataset upon acceptance of the paper.
