APISR: Anime Production Inspired Real-World Anime Super-Resolution
Boyang Wang, Fengyu Yang, Xihang Yu, Chao Zhang, Hanbin Zhao
TL;DR
Problem: Real-world anime SR suffers when photorealistic SR methods are naively applied to frame-based animation. Approach: APISR uses an image-centric pipeline with API dataset, a prediction-oriented degradation model, hand-drawn line enhancement, and a balanced twin perceptual loss. Contributions: API dataset creation (3,740 images from I-Frames with ICA-based selection and 720P back-original), a degradation model simulating video artifacts from single images, pseudo-GT line enhancement via XDoG, and a loss that blends ResNet50 anime features with photorealistic VGG features. Findings: APISR outperforms SOTA on AVC-RealLQ with a compact model and improved restoration quality, reducing training data needs.
Abstract
While real-world anime super-resolution (SR) has gained increasing attention in the SR community, existing methods still adopt techniques from the photorealistic domain. In this paper, we analyze the anime production workflow and rethink how to use characteristics of it for the sake of the real-world anime SR. First, we argue that video networks and datasets are not necessary for anime SR due to the repetition use of hand-drawing frames. Instead, we propose an anime image collection pipeline by choosing the least compressed and the most informative frames from the video sources. Based on this pipeline, we introduce the Anime Production-oriented Image (API) dataset. In addition, we identify two anime-specific challenges of distorted and faint hand-drawn lines and unwanted color artifacts. We address the first issue by introducing a prediction-oriented compression module in the image degradation model and a pseudo-ground truth preparation with enhanced hand-drawn lines. In addition, we introduce the balanced twin perceptual loss combining both anime and photorealistic high-level features to mitigate unwanted color artifacts and increase visual clarity. We evaluate our method through extensive experiments on the public benchmark, showing our method outperforms state-of-the-art anime dataset-trained approaches.
