PHUMA: Physically-Grounded Humanoid Locomotion Dataset

Kyungmin Lee; Sibeen Kim; Minho Park; Hyunseung Kim; Dongyoon Hwang; Hojoon Lee; Jaegul Choo

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

Kyungmin Lee, Sibeen Kim, Minho Park, Hyunseung Kim, Dongyoon Hwang, Hojoon Lee, Jaegul Choo

TL;DR

PHUMA addresses the scarcity and physical artifacts in humanoid motion data by combining large-scale human video with physics-aware curation and a physics-constrained retargeting method, PhySINK, to produce physically plausible humanoid motions. The two-stage pipeline yields 73 hours of data across 76 thousand clips, enabling policies that outperform AMASS and Humanoid-X in both full-motion imitation and pelvis-only path following on Unitree G1 and H1-2. Through MaskedMimic-based PPO training, PHUMA-trained policies show improved success rates on unseen motions and in precise path-following, demonstrating the value of physically grounded data for scalable humanoid control.

Abstract

Motion imitation is a promising approach for humanoid locomotion, enabling agents to acquire humanlike behaviors. Existing methods typically rely on high-quality motion capture datasets such as AMASS, but these are scarce and expensive, limiting scalability and diversity. Recent studies attempt to scale data collection by converting large-scale internet videos, exemplified by Humanoid-X. However, they often introduce physical artifacts such as floating, penetration, and foot skating, which hinder stable imitation. In response, we introduce PHUMA, a Physically-grounded HUMAnoid locomotion dataset that leverages human video at scale, while addressing physical artifacts through careful data curation and physics-constrained retargeting. PHUMA enforces joint limits, ensures ground contact, and eliminates foot skating, producing motions that are both large-scale and physically reliable. We evaluated PHUMA in two sets of conditions: (i) imitation of unseen motion from self-recorded test videos and (ii) path following with pelvis-only guidance. In both cases, PHUMA-trained policies outperform Humanoid-X and AMASS, achieving significant gains in imitating diverse motions. The code is available at https://davian-robotics.github.io/PHUMA.

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

TL;DR

Abstract

PHUMA: Physically-Grounded Humanoid Locomotion Dataset

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)