Learning Agile Bipedal Motions on a Quadrupedal Robot
Yunfei Li, Jinhan Li, Wei Fu, Yi Wu
TL;DR
Problem: enable agile, human-like bipedal motions on a lightweight quadruped. Approach: formulate the task as an MDP $(\mathcal{S},\mathcal{A},\mathcal{T},r,\gamma,\rho_0)$ and optimize with PPO, training a motion-conditioned policy to balance on hind toes while tracking base and front-limb targets, plus a high-level motion generator that converts videos or language into motion targets; the policy is trained in a calibrated simulator and transferred to real via real-to-sim calibration. Contributions: (i) a two-level framework enabling bipedal maneuvers on a consumer quad platform without external supports; (ii) real-to-sim calibration and domain randomization to bridge sim-to-real gaps; (iii) multiple human-interaction modalities including video mimicry, natural-language instructions, and physical guidance; (iv) demonstrations on a Xiaomi CyberDog2 achieving stand-up, walking at $v_x=\pm0.3$ m/s, and interactive behavior. Significance: provides a cost-effective path to humanoid-like agility on quad platforms and expands human-robot interaction capabilities for assistive and collaborative tasks.
Abstract
Can a quadrupedal robot perform bipedal motions like humans? Although developing human-like behaviors is more often studied on costly bipedal robot platforms, we present a solution over a lightweight quadrupedal robot that unlocks the agility of the quadruped in an upright standing pose and is capable of a variety of human-like motions. Our framework is with a hierarchical structure. At the low level is a motion-conditioned control policy that allows the quadrupedal robot to track desired base and front limb movements while balancing on two hind feet. The policy is commanded by a high-level motion generator that gives trajectories of parameterized human-like motions to the robot from multiple modalities of human input. We for the first time demonstrate various bipedal motions on a quadrupedal robot, and showcase interesting human-robot interaction modes including mimicking human videos, following natural language instructions, and physical interaction. The video is available at https://sites.google.com/view/bipedal-motions-quadruped.
