HumanoidExo: Scalable Whole-Body Humanoid Manipulation via Wearable Exoskeleton
Rui Zhong, Yizhe Sun, Junjie Wen, Jinming Li, Chuang Cheng, Wei Dai, Zhiwen Zeng, Huimin Lu, Yichen Zhu, Yi Xu
TL;DR
This work tackles the data bottleneck in scalable humanoid policy learning by introducing HumanoidExo, a wearable exoskeleton system that captures human whole-body motion and translates it into robot-ready data. The authors pair this with HE-VLA, a hybrid Vision-Language-Action and reinforcement learning framework, to bootstrap learning from exoskeleton demonstrations and ensure robust balance for real-world execution on a Unitree G1 humanoid. Across three tasks, HumanoidExo data significantly improves generalization and data efficiency, enabling complex manipulation and even novel skills such as walking to be learned with as little as a few real-robot demonstrations. The results suggest that exoskeleton-driven data can substitute or complement teleoperation, offering a scalable path toward generalist humanoid policies with strong real-world applicability.
Abstract
A significant bottleneck in humanoid policy learning is the acquisition of large-scale, diverse datasets, as collecting reliable real-world data remains both difficult and cost-prohibitive. To address this limitation, we introduce HumanoidExo, a novel system that transfers human motion to whole-body humanoid data. HumanoidExo offers a high-efficiency solution that minimizes the embodiment gap between the human demonstrator and the robot, thereby tackling the scarcity of whole-body humanoid data. By facilitating the collection of more voluminous and diverse datasets, our approach significantly enhances the performance of humanoid robots in dynamic, real-world scenarios. We evaluated our method across three challenging real-world tasks: table-top manipulation, manipulation integrated with stand-squat motions, and whole-body manipulation. Our results empirically demonstrate that HumanoidExo is a crucial addition to real-robot data, as it enables the humanoid policy to generalize to novel environments, learn complex whole-body control from only five real-robot demonstrations, and even acquire new skills (i.e., walking) solely from HumanoidExo data.
