ColorFlow: Retrieval-Augmented Image Sequence Colorization
Junhao Zhuang, Xuan Ju, Zhaoyang Zhang, Yong Liu, Shiyi Zhang, Chun Yuan, Ying Shan
TL;DR
ColorFlow tackles the challenge of automatic colorization for black-and-white image sequences while preserving fine-grained color IDs across frames. It introduces a three-stage retrieval-augmented pipeline, an in-context diffusion colorization module with a Colorization Guider, and a guided super-resolution stage to maintain identity consistency and high fidelity. Through ColorFlow-Bench, the method achieves state-of-the-art results across perceptual and pixel-based metrics and garners strong user-study support, demonstrating industrial viability for manga and animation production. The work highlights the importance of robust multi-reference retrieval, context-aware color transfer, and artifact-reducing upscaling for scalable, high-quality sequential colorization, while acknowledging ethical considerations around data bias and content authenticity.
Abstract
Automatic black-and-white image sequence colorization while preserving character and object identity (ID) is a complex task with significant market demand, such as in cartoon or comic series colorization. Despite advancements in visual colorization using large-scale generative models like diffusion models, challenges with controllability and identity consistency persist, making current solutions unsuitable for industrial application.To address this, we propose ColorFlow, a three-stage diffusion-based framework tailored for image sequence colorization in industrial applications. Unlike existing methods that require per-ID finetuning or explicit ID embedding extraction, we propose a novel robust and generalizable Retrieval Augmented Colorization pipeline for colorizing images with relevant color references. Our pipeline also features a dual-branch design: one branch for color identity extraction and the other for colorization, leveraging the strengths of diffusion models. We utilize the self-attention mechanism in diffusion models for strong in-context learning and color identity matching. To evaluate our model, we introduce ColorFlow-Bench, a comprehensive benchmark for reference-based colorization. Results show that ColorFlow outperforms existing models across multiple metrics, setting a new standard in sequential image colorization and potentially benefiting the art industry. We release our codes and models on our project page: https://zhuang2002.github.io/ColorFlow/.
