MRStyle: A Unified Framework for Color Style Transfer with Multi-Modality Reference
Jiancheng Huang, Yu Gao, Zequn Jie, Yujie Zhong, Xintong Han, Lin Ma
TL;DR
MRStyle tackles the problem of color style transfer with guidance from either image or text prompts by unifying their style representations. It introduces IRStyle for image references and TRStyle for text references, connected through a shared style space and an efficient priors mapper that leverages Stable Diffusion priors. The approach combines an interaction dual-mapping network and a combined supervised learning pipeline to achieve artifact-free, content-preserving transfers at high resolutions, and uses synthetic data to train the text pathway in an end-to-end manner. Extensive experiments show state-of-the-art performance in both image- and text-guided color transfer, with impressive efficiency, memory economy, and open-set capabilities for text prompts.
Abstract
In this paper, we introduce MRStyle, a comprehensive framework that enables color style transfer using multi-modality reference, including image and text. To achieve a unified style feature space for both modalities, we first develop a neural network called IRStyle, which generates stylized 3D lookup tables for image reference. This is accomplished by integrating an interaction dual-mapping network with a combined supervised learning pipeline, resulting in three key benefits: elimination of visual artifacts, efficient handling of high-resolution images with low memory usage, and maintenance of style consistency even in situations with significant color style variations. For text reference, we align the text feature of stable diffusion priors with the style feature of our IRStyle to perform text-guided color style transfer (TRStyle). Our TRStyle method is highly efficient in both training and inference, producing notable open-set text-guided transfer results. Extensive experiments in both image and text settings demonstrate that our proposed method outperforms the state-of-the-art in both qualitative and quantitative evaluations.
