Table of Contents
Fetching ...

AniME: Adaptive Multi-Agent Planning for Long Animation Generation

Lisai Zhang, Baohan Xu, Siqian Yang, Mingyu Yin, Jing Liu, Chao Xu, Siqi Wang, Yidi Wu, Yuxin Hong, Zihao Zhang, Yanzhang Liang, Yudong Jiang

TL;DR

AniME addresses the challenge of automating long-form anime production with cross-stage consistency. It introduces a Director Agent orchestrating Specialized Agents via a Model Context Protocol to adaptively select control conditions and coordinate planning, memory, and quality control. Key contributions include a centralized Asset Memory Bank, explicit MCP-based tool selection for each sub-task, and structured inter-agent JSON communication with revision loops. The approach enables scalable, coherent generation from script to final video with character identity preservation and synchronized audio-visuals.

Abstract

We present AniME, a director-oriented multi-agent system for automated long-form anime production, covering the full workflow from a story to the final video. The director agent keeps a global memory for the whole workflow, and coordinates several downstream specialized agents. By integrating customized Model Context Protocol (MCP) with downstream model instruction, the specialized agent adaptively selects control conditions for diverse sub-tasks. AniME produces cinematic animation with consistent characters and synchronized audio visual elements, offering a scalable solution for AI-driven anime creation.

AniME: Adaptive Multi-Agent Planning for Long Animation Generation

TL;DR

AniME addresses the challenge of automating long-form anime production with cross-stage consistency. It introduces a Director Agent orchestrating Specialized Agents via a Model Context Protocol to adaptively select control conditions and coordinate planning, memory, and quality control. Key contributions include a centralized Asset Memory Bank, explicit MCP-based tool selection for each sub-task, and structured inter-agent JSON communication with revision loops. The approach enables scalable, coherent generation from script to final video with character identity preservation and synchronized audio-visuals.

Abstract

We present AniME, a director-oriented multi-agent system for automated long-form anime production, covering the full workflow from a story to the final video. The director agent keeps a global memory for the whole workflow, and coordinates several downstream specialized agents. By integrating customized Model Context Protocol (MCP) with downstream model instruction, the specialized agent adaptively selects control conditions for diverse sub-tasks. AniME produces cinematic animation with consistent characters and synchronized audio visual elements, offering a scalable solution for AI-driven anime creation.

Paper Structure

This paper contains 21 sections, 2 figures, 3 tables, 1 algorithm.

Figures (2)

  • Figure 1: Brief Architecture of AniME
  • Figure 2: Storyboard keyframes for scene sceneYX01. Each shot shows the corresponding keyframe generated by the Storyboard Agent using selected tools (text-to-image, reference image, layout-guided generation).