PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction
Yinhuai Wang, Jing Lin, Ailing Zeng, Zhengyi Luo, Jian Zhang, Lei Zhang
TL;DR
This work tackles dynamic whole-body human-object interaction imitation in physics-based simulation by introducing PhysHOI, a framework that uses a general-purpose contact graph and a task-agnostic reward to guide imitation without task-specific rewards. It adopts a contact-aware HOI representation and a task-agnostic imitation reward, optimized via PPO in a simulation loop with a 52-part SMPL-X humanoid and objects, aided by an aggregated contact graph. The BallPlay dataset of eight basketball skills provides dynamic HOI data to support learning. Experiments on GRAB and BallPlay demonstrate improved success rates and reduced tracking errors, with the contact graph reward (CGR) shown to be crucial for accurate contact and robust imitation. This work advances general HOI learning for robotics and animation by reducing reliance on hand-crafted rewards and explicitly modeling contact dynamics.
Abstract
Humans interact with objects all the time. Enabling a humanoid to learn human-object interaction (HOI) is a key step for future smart animation and intelligent robotics systems. However, recent progress in physics-based HOI requires carefully designed task-specific rewards, making the system unscalable and labor-intensive. This work focuses on dynamic HOI imitation: teaching humanoid dynamic interaction skills through imitating kinematic HOI demonstrations. It is quite challenging because of the complexity of the interaction between body parts and objects and the lack of dynamic HOI data. To handle the above issues, we present PhysHOI, the first physics-based whole-body HOI imitation approach without task-specific reward designs. Except for the kinematic HOI representations of humans and objects, we introduce the contact graph to model the contact relations between body parts and objects explicitly. A contact graph reward is also designed, which proved to be critical for precise HOI imitation. Based on the key designs, PhysHOI can imitate diverse HOI tasks simply yet effectively without prior knowledge. To make up for the lack of dynamic HOI scenarios in this area, we introduce the BallPlay dataset that contains eight whole-body basketball skills. We validate PhysHOI on diverse HOI tasks, including whole-body grasping and basketball skills.
