Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior
Ruihan Yang, Zhuoqun Chen, Jianhan Ma, Chongyi Zheng, Yiyu Chen, Quan Nguyen, Xiaolong Wang
TL;DR
This work introduces Versatile Instructable Motion prior (VIM), a reinforcement learning framework that learns a single, reusable motion prior from a diverse set of reference motions (mocap, synthetic, and trajectory-optimized) to enable multiple agile locomotion skills in legged robots. A latent command space, learned via a reference motion encoder and autoregressive KL regularization, guides a low-level policy trained with a combination of Functionality and Style rewards, including an adversarial stylization component. The approach yields spatial-temporal skill representations that support smooth transitions and enable high-level policies to solve downstream tasks such as following commands and jumping, demonstrated in both simulation and real hardware with superior performance and sample efficiency over baselines. VIM achieves realistic, agile behaviors like backflips and jumps without fine-tuning, illustrating strong sim2real transfer and potential for broad robotic applications. The work also discusses limitations and avenues for safety, perception, and dynamics-aware extensions to further enhance robustness and applicability.
Abstract
The agility of animals, particularly in complex activities such as running, turning, jumping, and backflipping, stands as an exemplar for robotic system design. Transferring this suite of behaviors to legged robotic systems introduces essential inquiries: How can a robot learn multiple locomotion behaviors simultaneously? How can the robot execute these tasks with a smooth transition? How to integrate these skills for wide-range applications? This paper introduces the Versatile Instructable Motion prior (VIM) - a Reinforcement Learning framework designed to incorporate a range of agile locomotion tasks suitable for advanced robotic applications. Our framework enables legged robots to learn diverse agile low-level skills by imitating animal motions and manually designed motions. Our Functionality reward guides the robot's ability to adopt varied skills, and our Stylization reward ensures that robot motions align with reference motions. Our evaluations of the VIM framework span both simulation and the real world. Our framework allows a robot to concurrently learn diverse agile locomotion skills using a single learning-based controller in the real world. Videos can be found on our website: https://rchalyang.github.io/VIM/
