WANDR: Intention-guided Human Motion Generation
Markos Diomataris, Nikos Athanasiou, Omid Taheri, Xi Wang, Otmar Hilliges, Michael J. Black
TL;DR
WANDR tackles the challenge of synthesizing natural 3D human motion that reaches arbitrary goals from a given initial pose by introducing intention features that guide a frame-by-frame autoregressive c-VAE. The model is trained on two complementary datasets, AMASS and CIRCLE, with pseudo-goals derived from future wrist positions to enable learning from unlabeled data and goal-directed data alike. Key contributions include the intention mechanism, a data-fusion training approach, and a motion generator capable of long-horizon, goal-directed motion with zero-shot generalization to unseen goals. Quantitative and qualitative results demonstrate improved goal-reaching accuracy and motion realism, and the authors release code to support further research in goal-conditioned motion synthesis.
Abstract
Synthesizing natural human motions that enable a 3D human avatar to walk and reach for arbitrary goals in 3D space remains an unsolved problem with many applications. Existing methods (data-driven or using reinforcement learning) are limited in terms of generalization and motion naturalness. A primary obstacle is the scarcity of training data that combines locomotion with goal reaching. To address this, we introduce WANDR, a data-driven model that takes an avatar's initial pose and a goal's 3D position and generates natural human motions that place the end effector (wrist) on the goal location. To solve this, we introduce novel intention features that drive rich goal-oriented movement. Intention guides the agent to the goal, and interactively adapts the generation to novel situations without needing to define sub-goals or the entire motion path. Crucially, intention allows training on datasets that have goal-oriented motions as well as those that do not. WANDR is a conditional Variational Auto-Encoder (c-VAE), which we train using the AMASS and CIRCLE datasets. We evaluate our method extensively and demonstrate its ability to generate natural and long-term motions that reach 3D goals and generalize to unseen goal locations. Our models and code are available for research purposes at wandr.is.tue.mpg.de.
