Scalable and General Whole-Body Control for Cross-Humanoid Locomotion
Yufei Xue, YunFeng Lin, Wentao Dong, Yang Tang, Jingbo Wang, Jiangmiao Pang, Ming Zhou, Minghuan Liu, Weinan Zhang
TL;DR
XHugWBC tackles cross-humanoid whole-body control by learning a single generalist policy trained with physics-consistent morphological randomization and a universal, semantically aligned joint-space representation. The policy uses a GCN or Transformer encoder over an embodiment graph to fuse kinematic topology with proprioceptive history, supported by a state estimator and a node-wise action decoder to map to each robot's joints. In simulation, it generalizes to twelve morphologies and seven real humanoids, achieving approximately 85% of specialist performance on unseen robots and 100% survival in zero-shot real-world tests; fine-tuning further improves results. Real-world demonstrations include teleoperation-driven loco-manipulation across long-horizon tasks, indicating practical, scalable transfer without per-robot retraining.
Abstract
Learning-based whole-body controllers have become a key driver for humanoid robots, yet most existing approaches require robot-specific training. In this paper, we study the problem of cross-embodiment humanoid control and show that a single policy can robustly generalize across a wide range of humanoid robot designs with one-time training. We introduce XHugWBC, a novel cross-embodiment training framework that enables generalist humanoid control through: (1) physics-consistent morphological randomization, (2) semantically aligned observation and action spaces across diverse humanoid robots, and (3) effective policy architectures modeling morphological and dynamical properties. XHugWBC is not tied to any specific robot. Instead, it internalizes a broad distribution of morphological and dynamical characteristics during training. By learning motion priors from diverse randomized embodiments, the policy acquires a strong structural bias that supports zero-shot transfer to previously unseen robots. Experiments on twelve simulated humanoids and seven real-world robots demonstrate the strong generalization and robustness of the resulting universal controller.
