NOVA-3D: Non-overlapped Views for 3D Anime Character Reconstruction
Hongsheng Wang, Nanjie Yao, Xinrui Zhou, Shengyu Zhang, Huahao Xu, Fei Wu, Feng Lin
TL;DR
This work tackles the problem of reconstructing full-body 3D anime characters from non-overlapped front and back views, addressing data scarcity and the inapplicability of conventional NeRF-based methods. It introduces NOVA-3D, a GAN-based pipeline that uses a dual-viewpoint encoder and direction-aware attention to synthesize full-body 3D characters via tri-plane representations, trained with the NOVA-Human dataset of 10.2k models and 163.2k images with calibrated camera parameters. A composite loss comprising reconstruction, adversarial, and regularization terms enforces high fidelity and geometric consistency. Experiments show that NOVA-3D outperforms single-view and multi-view baselines in both head and full-body reconstruction, delivering richer details and fewer artifacts, and demonstrates strong generalization on the NOVA-Human data. The NOVA-Human dataset and accompanying code release aim to accelerate research and practical adoption for automated 3D anime character production.
Abstract
In the animation industry, 3D modelers typically rely on front and back non-overlapped concept designs to guide the 3D modeling of anime characters. However, there is currently a lack of automated approaches for generating anime characters directly from these 2D designs. In light of this, we explore a novel task of reconstructing anime characters from non-overlapped views. This presents two main challenges: existing multi-view approaches cannot be directly applied due to the absence of overlapping regions, and there is a scarcity of full-body anime character data and standard benchmarks. To bridge the gap, we present Non-Overlapped Views for 3D \textbf{A}nime Character Reconstruction (NOVA-3D), a new framework that implements a method for view-aware feature fusion to learn 3D-consistent features effectively and synthesizes full-body anime characters from non-overlapped front and back views directly. To facilitate this line of research, we collected the NOVA-Human dataset, which comprises multi-view images and accurate camera parameters for 3D anime characters. Extensive experiments demonstrate that the proposed method outperforms baseline approaches, achieving superior reconstruction of anime characters with exceptional detail fidelity. In addition, to further verify the effectiveness of our method, we applied it to the animation head reconstruction task and improved the state-of-the-art baseline to 94.453 in SSIM, 7.726 in LPIPS, and 19.575 in PSNR on average. Codes and datasets are available at https://wanghongsheng01.github.io/NOVA-3D/.
