Integrating Controllable Motion Skills from Demonstrations
Honghao Liao, Zhiheng Li, Ziyu Meng, Ran Song, Yibin Li, Wei Zhang
TL;DR
CSI tackles the problem of flexibly integrating multiple motion skills into a single legged-robot controller without extensive reward engineering, and extends capability with language-directed control via a high-level NLI module. It presents a GAILfO-based learning framework augmented with Conditional Imitation Learning, a latent conditioning from skill labels, and a Condition-Aware Discriminator to ensure controllability across many skills. The method yields a scalable, demonstration-driven pipeline enabling smooth skill transitions and zero-shot language guidance, demonstrated on both humanoid and quadruped platforms. This approach reduces design complexity, expands the skill repertoire accessible to legged robots, and enables practical, language-guided control for real-world tasks.
Abstract
The expanding applications of legged robots require their mastery of versatile motion skills. Correspondingly, researchers must address the challenge of integrating multiple diverse motion skills into controllers. While existing reinforcement learning (RL)-based approaches have achieved notable success in multi-skill integration for legged robots, these methods often require intricate reward engineering or are restricted to integrating a predefined set of motion skills constrained by specific task objectives, resulting in limited flexibility. In this work, we introduce a flexible multi-skill integration framework named Controllable Skills Integration (CSI). CSI enables the integration of a diverse set of motion skills with varying styles into a single policy without the need for complex reward tuning. Furthermore, in a hierarchical control manner, the trained low-level policy can be coupled with a high-level Natural Language Inference (NLI) module to enable preliminary language-directed skill control. Our experiments demonstrate that CSI can flexibly integrate a diverse array of motion skills more comprehensively and facilitate the transitions between different skills. Additionally, CSI exhibits good scalability as the number of motion skills to be integrated increases significantly.
