AniDoc: Animation Creation Made Easier
Yihao Meng, Hao Ouyang, Hanlin Wang, Qiuyu Wang, Wen Wang, Ka Leong Cheng, Zhiheng Liu, Yujun Shen, Huamin Qu
TL;DR
This work tackles automating colorization and in-betweening in 2D anime production by introducing AniDoc, a diffusion-based all-in-one model that uses explicit reference–sketch correspondence to colorize line-art sequences with temporal coherence. It combines correspondence-guided colorization, binarized sketch conditioning, background augmentation, and sparse-sketch training within a Stable Video Diffusion framework to enable high-fidelity colorization from a single character design. Across extensive experiments on Sakuga-42M, AniDoc achieves superior quantitative and qualitative results over state-of-the-art baselines, including robust identity preservation and effective interpolation with sparse inputs. The approach promises practical impact by fitting into existing production pipelines and reducing manual coloring and in-betweening labor for anime and digital art.
Abstract
The production of 2D animation follows an industry-standard workflow, encompassing four essential stages: character design, keyframe animation, in-betweening, and coloring. Our research focuses on reducing the labor costs in the above process by harnessing the potential of increasingly powerful generative AI. Using video diffusion models as the foundation, AniDoc emerges as a video line art colorization tool, which automatically converts sketch sequences into colored animations following the reference character specification. Our model exploits correspondence matching as an explicit guidance, yielding strong robustness to the variations (e.g., posture) between the reference character and each line art frame. In addition, our model could even automate the in-betweening process, such that users can easily create a temporally consistent animation by simply providing a character image as well as the start and end sketches. Our code is available at: https://yihao-meng.github.io/AniDoc_demo.
