Taming Diffusion Probabilistic Models for Character Control
Rui Chen, Mingyi Shi, Shaoli Huang, Ping Tan, Taku Komura, Xuelin Chen
TL;DR
The paper tackles real-time, high-quality, and diverse character control using diffusion-based motion models. It introduces CAMDM, a transformer-based Conditional Autoregressive Motion Diffusion Model that conditions on past motion and coarse user controls to generate future motions in real time, employing design innovations like Separate Condition Tokenization, CFG-PM, and HFTE, and achieving high performance with only 8 denoising steps. Extensive experiments on the 100STYLE mocap dataset demonstrate superior motion quality, diversity, and seamless style transitions compared to baselines, validated by ablations and a comprehensive set of metrics. The work enables a single, unified model to animate characters in multiple styles in real time, with practical implications for games and interactive media, while outlining avenues for speedups and multimodal control extensions.
Abstract
We present a novel character control framework that effectively utilizes motion diffusion probabilistic models to generate high-quality and diverse character animations, responding in real-time to a variety of dynamic user-supplied control signals. At the heart of our method lies a transformer-based Conditional Autoregressive Motion Diffusion Model (CAMDM), which takes as input the character's historical motion and can generate a range of diverse potential future motions conditioned on high-level, coarse user control. To meet the demands for diversity, controllability, and computational efficiency required by a real-time controller, we incorporate several key algorithmic designs. These include separate condition tokenization, classifier-free guidance on past motion, and heuristic future trajectory extension, all designed to address the challenges associated with taming motion diffusion probabilistic models for character control. As a result, our work represents the first model that enables real-time generation of high-quality, diverse character animations based on user interactive control, supporting animating the character in multiple styles with a single unified model. We evaluate our method on a diverse set of locomotion skills, demonstrating the merits of our method over existing character controllers. Project page and source codes: https://aiganimation.github.io/CAMDM/
