Human Motion Prediction, Reconstruction, and Generation
Canxuan Gang, Yiran Wang
TL;DR
This survey consolidates advances across three intertwined strands of human motion: prediction, reconstruction, and generation, highlighting how transformers, diffusion models, and physics-informed priors address challenges like occlusion, long-term consistency, and fine-grained hand-object interactions. It surveys predictive methods that handle perturbations and scene constraints, reconstruction approaches that achieve real-time 3D mesh tracking under noise and occlusion, and generation techniques that synthesize diverse, controllable motions from actions, text, or language prompts. Key contributions include diffusion-based pipelines for robust motion reconstruction, latent-dis diffusion and discrete-token frameworks for text-to-motion, and long-sequence generation strategies that enable coherent action transitions. Collectively, the work underlines the importance of physics, scene context, and modal conditioning for realistic and applicable motion synthesis in robotics, AR/VR, gaming, and animation.
Abstract
This report reviews recent advancements in human motion prediction, reconstruction, and generation. Human motion prediction focuses on forecasting future poses and movements from historical data, addressing challenges like nonlinear dynamics, occlusions, and motion style variations. Reconstruction aims to recover accurate 3D human body movements from visual inputs, often leveraging transformer-based architectures, diffusion models, and physical consistency losses to handle noise and complex poses. Motion generation synthesizes realistic and diverse motions from action labels, textual descriptions, or environmental constraints, with applications in robotics, gaming, and virtual avatars. Additionally, text-to-motion generation and human-object interaction modeling have gained attention, enabling fine-grained and context-aware motion synthesis for augmented reality and robotics. This review highlights key methodologies, datasets, challenges, and future research directions driving progress in these fields.
