A Survey on Human Interaction Motion Generation

Kewei Sui; Anindita Ghosh; Inwoo Hwang; Bing Zhou; Jian Wang; Chuan Guo

A Survey on Human Interaction Motion Generation

Kewei Sui, Anindita Ghosh, Inwoo Hwang, Bing Zhou, Jian Wang, Chuan Guo

TL;DR

This survey addresses the problem of generating realistic human interaction motions across four interaction settings: human-human, human-object, human-scene, and human-mix. It surveys foundational concepts, conditioning modalities, and a spectrum of generation methods from motion graphs and regression to diffusion models, transformers, RL with physics, and LLM-assisted planning. It catalogs datasets and evaluation metrics, emphasizing fidelity, naturalness, diversity, and condition coherence, and discusses data, physics, representation, and controllability as core challenges. The work underscores the importance of physics-informed, multi-modal, and context-aware modeling to advance practical applications in robotics, VR, and animation, and outlines four promising directions for future research.

Abstract

Humans inhabit a world defined by interactions -- with other humans, objects, and environments. These interactive movements not only convey our relationships with our surroundings but also demonstrate how we perceive and communicate with the real world. Therefore, replicating these interaction behaviors in digital systems has emerged as an important topic for applications in robotics, virtual reality, and animation. While recent advances in deep generative models and new datasets have accelerated progress in this field, significant challenges remain in modeling the intricate human dynamics and their interactions with entities in the external world. In this survey, we present, for the first time, a comprehensive overview of the literature in human interaction motion generation. We begin by establishing foundational concepts essential for understanding the research background. We then systematically review existing solutions and datasets across three primary interaction tasks -- human-human, human-object, and human-scene interactions -- followed by evaluation metrics. Finally, we discuss open research directions and future opportunities.

A Survey on Human Interaction Motion Generation

TL;DR

Abstract

A Survey on Human Interaction Motion Generation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)