Spiritus: An AI-Assisted Tool for Creating 2D Characters and Animations
Qirui Sun, Yunyi Ni, Teli Yuan, Jingjing Zhang, Fan Yang, Zhihao Yao, Haipeng Mi
TL;DR
Spiritus addresses the high technical barrier to producing personalized 2D character animation by combining NLP-driven text-to-character generation with skeleton-based animation. The system pipelines text prompts through image generation, SAM-based segmentation, and a non-uniform dynamic mesh with unified rigging to support costume variation, then maps motion data via BVH to 2D space using PCA and diffusion-based motion synthesis. Key contributions include a hierarchical character generation framework, a web-based interface with scene orchestration, and cross-platform interoperability via Spine Runtime, enabling reusable animation assets. The work demonstrates reduced setup complexity, enhanced creative flexibility, and asset universality for rapid animated skit production.
Abstract
This research presents Spiritus, an AI-assisted creation tool designed to streamline 2D character animation creation while enhancing creative flexibility. By integrating natural language processing and diffusion models, users can efficiently transform natural language descriptions into personalized 2D characters and animations. The system employs automated segmentation, layered costume techniques, and dynamic mesh-skeleton binding solutions to support flexible adaptation of complex costumes and additional components. Spiritus further achieves real-time animation generation and efficient animation resource reuse between characters through the integration of BVH data and motion diffusion models. Experimental results demonstrate Spiritus's effectiveness in reducing technical barriers, enhancing creative freedom, and supporting resource universality. Future work will focus on optimizing user experience and further exploring the system's human-computer collaboration potential.
