General Humanoid Whole-Body Control via Pretraining and Fast Adaptation
Zepeng Wang, Jiangxing Wang, Shiqing Yao, Yu Zhang, Ziluo Ding, Ming Yang, Yuxuan Wang, Haobin Jiang, Chao Ma, Xiaochuan Shi, Zongqing Lu
TL;DR
FAST addresses the challenge of general humanoid whole-body control under distribution shifts by coupling a pretrained, Center-of-Mass-Aware base controller with a lightweight Parseval-regularized residual policy. The residual is trained with KL constraints to bound drift, and action composition follows $a_t = a_t^b + a_t^r$, enabling rapid adaptation while preserving prior behavior. The framework relies on curated motion datasets and a mixture-of-experts policy, augmented with CoM/CoP signals to improve balance during tracking and teleoperation. Empirical results in simulation and on a real Unitree G1 robot show that FAST outperforms state-of-the-art baselines in robustness, adaptation efficiency, and generalization, including zero-shot high-dynamic tracking, fast adaptation to low-quality motions, and real-time teleoperation.
Abstract
Learning a general whole-body controller for humanoid robots remains challenging due to the diversity of motion distributions, the difficulty of fast adaptation, and the need for robust balance in high-dynamic scenarios. Existing approaches often require task-specific training or suffer from performance degradation when adapting to new motions. In this paper, we present FAST, a general humanoid whole-body control framework that enables Fast Adaptation and Stable Motion Tracking. FAST introduces Parseval-Guided Residual Policy Adaptation, which learns a lightweight delta action policy under orthogonality and KL constraints, enabling efficient adaptation to out-of-distribution motions while mitigating catastrophic forgetting. To further improve physical robustness, we propose Center-of-Mass-Aware Control, which incorporates CoM-related observations and objectives to enhance balance when tracking challenging reference motions. Extensive experiments in simulation and real-world deployment demonstrate that FAST consistently outperforms state-of-the-art baselines in robustness, adaptation efficiency, and generalization.
