NECromancer: Breathing Life into Skeletons via BVH Animation
Mingxi Xu, Qi Wang, Zhengyu Wen, Phong Dao Thien, Zhengyu Li, Ning Zhang, Xiaoyu He, Wei Zhao, Kehong Gong, Mingyuan Zhang
TL;DR
This work addresses the challenge of universal motion understanding across diverse morphologies by introducing NECromancer, a topology-invariant BVH motion tokenizer. It combines OwO, a graph-based, ontology-aware skeletal encoder, with TAT, a topology-agnostic tokenizer that relies on a virtual joint and RVQ to generate discrete tokens independent of skeleton topology, all trained and evaluated on the Unified BVH Universe (UvU) dataset of 47,807 sequences. The approach enables high-fidelity reconstruction under compression, cross-skeleton motion transfer, and text–motion retrieval, demonstrating robust generalization across humans, animals, and fantasy morphologies. The combined framework promises a scalable, cross-species foundation for 4D animation and motion synthesis, with potential applications in cross-domain content creation and robotics, while acknowledging computational and data richness challenges.
Abstract
Motion tokenization is a key component of generalizable motion models, yet most existing approaches are restricted to species-specific skeletons, limiting their applicability across diverse morphologies. We propose NECromancer (NEC), a universal motion tokenizer that operates directly on arbitrary BVH skeletons. NEC consists of three components: (1) an Ontology-aware Skeletal Graph Encoder (OwO) that encodes structural priors from BVH files, including joint semantics, rest-pose offsets, and skeletal topology, into skeletal embeddings; (2) a Topology-Agnostic Tokenizer (TAT) that compresses motion sequences into a universal, topology-invariant discrete representation; and (3) the Unified BVH Universe (UvU), a large-scale dataset aggregating BVH motions across heterogeneous skeletons. Experiments show that NEC achieves high-fidelity reconstruction under substantial compression and effectively disentangles motion from skeletal structure. The resulting token space supports cross-species motion transfer, composition, denoising, generation with token-based models, and text-motion retrieval, establishing a unified framework for motion analysis and synthesis across diverse morphologies. Demo page: https://animotionlab.github.io/NECromancer/
