Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes

Jianqi Chen; Panwen Hu; Xiaojun Chang; Zhenwei Shi; Michael Kampffmeyer; Xiaodan Liang

Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes

Jianqi Chen, Panwen Hu, Xiaojun Chang, Zhenwei Shi, Michael Kampffmeyer, Xiaodan Liang

TL;DR

Sitcom-Crafter addresses the lack of a unified system for diverse human motion generation in 3D scenes by integrating locomotion, scene-interaction, and human-human interaction under long plot guidance. It introduces a self-supervised scene-aware human-human interaction module that injects synthetic scene information via implicit SDF conditioning, and unifies motion representation through marker points aided by a body regressor, all within a plot-driven, eight-module pipeline. Experimental results on open 3D scenes and established HH-I datasets show improved physics-constraint metrics and motion realism, validating the approach against strong baselines. This work promises to streamline creative workflows in animation and game design by enabling cohesive, plot-driven, multi-type motion generation in complex environments.

Abstract

Recent advancements in human motion synthesis have focused on specific types of motions, such as human-scene interaction, locomotion or human-human interaction, however, there is a lack of a unified system capable of generating a diverse combination of motion types. In response, we introduce Sitcom-Crafter, a comprehensive and extendable system for human motion generation in 3D space, which can be guided by extensive plot contexts to enhance workflow efficiency for anime and game designers. The system is comprised of eight modules, three of which are dedicated to motion generation, while the remaining five are augmentation modules that ensure consistent fusion of motion sequences and system functionality. Central to the generation modules is our novel 3D scene-aware human-human interaction module, which addresses collision issues by synthesizing implicit 3D Signed Distance Function (SDF) points around motion spaces, thereby minimizing human-scene collisions without additional data collection costs. Complementing this, our locomotion and human-scene interaction modules leverage existing methods to enrich the system's motion generation capabilities. Augmentation modules encompass plot comprehension for command generation, motion synchronization for seamless integration of different motion types, hand pose retrieval to enhance motion realism, motion collision revision to prevent human collisions, and 3D retargeting to ensure visual fidelity. Experimental evaluations validate the system's ability to generate high-quality, diverse, and physically realistic motions, underscoring its potential for advancing creative workflows. Project page: https://windvchen.github.io/Sitcom-Crafter.

Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes

TL;DR

Abstract

Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (26)