UniMoGen: Universal Motion Generation
Aliasghar Khani, Arianna Rampini, Evan Atherton, Bruno Roy
TL;DR
UniMoGen tackles the challenge of skeleton-agnostic motion generation by introducing a UNet-based diffusion model that processes variable joint counts without padding. It supports auto-regressive generation conditioned on style, trajectory, and past frames, using temporal downsampling, joint-wise attention with topology-aware masking, and FiLM conditioning, enabling real-time synthesis across diverse skeletons. Across 100style and the combined 100style+LAFAN1 datasets, UniMoGen outperforms state-of-the-art diffusion and skeleton-agnostic baselines in realism, diversity, and physical plausibility, while significantly reducing computational overhead. The work demonstrates accurate trajectory following, smooth long-horizon motion, and effective style blending, highlighting its potential for versatile animation, gaming, and robotics applications.
Abstract
Motion generation is a cornerstone of computer graphics, animation, gaming, and robotics, enabling the creation of realistic and varied character movements. A significant limitation of existing methods is their reliance on specific skeletal structures, which restricts their versatility across different characters. To overcome this, we introduce UniMoGen, a novel UNet-based diffusion model designed for skeleton-agnostic motion generation. UniMoGen can be trained on motion data from diverse characters, such as humans and animals, without the need for a predefined maximum number of joints. By dynamically processing only the necessary joints for each character, our model achieves both skeleton agnosticism and computational efficiency. Key features of UniMoGen include controllability via style and trajectory inputs, and the ability to continue motions from past frames. We demonstrate UniMoGen's effectiveness on the 100style dataset, where it outperforms state-of-the-art methods in diverse character motion generation. Furthermore, when trained on both the 100style and LAFAN1 datasets, which use different skeletons, UniMoGen achieves high performance and improved efficiency across both skeletons. These results highlight UniMoGen's potential to advance motion generation by providing a flexible, efficient, and controllable solution for a wide range of character animations.
