Learning Visual Quadrupedal Loco-Manipulation from Demonstrations
Zhengmao He, Kun Lei, Yanjie Ze, Koushil Sreenath, Zhongyu Li, Huazhe Xu
TL;DR
The paper tackles legged loco-manipulation by integrating a high-level diffusion-based BC planner with a low-level PPO-based controller, enabling a quadruped to perform manipulation tasks solely with its legs. It introduces a trajectory-parameterization scheme using manipulator flags, rational Bézier curves, and SLERP-orientation, all coordinated across world/body frames to ensure robust planning and tracking. Expert demonstrations collected in parallel simulations train the planner, while a learned low-level controller handles precise end-effector tracking under varied dynamics and randomization, achieving sim-to-real transfer on a Unitree Aliengo. The approach demonstrates superior performance across nine tasks, robustness to unexpected disturbances, and efficient data usage compared to baselines like HRL and end-to-end BC/VRL. This framework advances practical, mobile, leg-based manipulation in real-world environments, reducing reliance on additional robotic arms.
Abstract
Quadruped robots are progressively being integrated into human environments. Despite the growing locomotion capabilities of quadrupedal robots, their interaction with objects in realistic scenes is still limited. While additional robotic arms on quadrupedal robots enable manipulating objects, they are sometimes redundant given that a quadruped robot is essentially a mobile unit equipped with four limbs, each possessing 3 degrees of freedom (DoFs). Hence, we aim to empower a quadruped robot to execute real-world manipulation tasks using only its legs. We decompose the loco-manipulation process into a low-level reinforcement learning (RL)-based controller and a high-level Behavior Cloning (BC)-based planner. By parameterizing the manipulation trajectory, we synchronize the efforts of the upper and lower layers, thereby leveraging the advantages of both RL and BC. Our approach is validated through simulations and real-world experiments, demonstrating the robot's ability to perform tasks that demand mobility and high precision, such as lifting a basket from the ground while moving, closing a dishwasher, pressing a button, and pushing a door. Project website: https://zhengmaohe.github.io/leg-manip
