Long-horizon Locomotion and Manipulation on a Quadrupedal Robot with Large Language Models
Yutao Ouyang, Jinhan Li, Yunfei Li, Zhongyu Li, Chao Yu, Koushil Sreenath, Yi Wu
TL;DR
This work tackles long-horizon loco-manipulation for quadrupedal robots by combining a cascade of large language models for high-level planning with a reinforcement-learning-based pool of low-level skills. The high-level module decomposes tasks into discrete and continuous actions via a semantic planner, parameter calculator, code generator, and a replanner to ground to executable robot code; the low-level layer learns locomotion and manipulation policies through PPO and hierarchical RL, including recovery strategies. The system demonstrates multi-step strategies such as building tools or requesting human help, achieving over 70% success in simulation and successful real-world deployment on a CyberDog2 after domain randomization. These results underscore the importance of branching, replanning, and policy chaining for robust autonomous long-horizon robot autonomy in complex, unstructured environments.
Abstract
We present a large language model (LLM) based system to empower quadrupedal robots with problem-solving abilities for long-horizon tasks beyond short-term motions. Long-horizon tasks for quadrupeds are challenging since they require both a high-level understanding of the semantics of the problem for task planning and a broad range of locomotion and manipulation skills to interact with the environment. Our system builds a high-level reasoning layer with large language models, which generates hybrid discrete-continuous plans as robot code from task descriptions. It comprises multiple LLM agents: a semantic planner that sketches a plan, a parameter calculator that predicts arguments in the plan, a code generator that converts the plan into executable robot code, and a replanner that handles execution failures or human interventions. At the low level, we adopt reinforcement learning to train a set of motion planning and control skills to unleash the flexibility of quadrupeds for rich environment interactions. Our system is tested on long-horizon tasks that are infeasible to complete with one single skill. Simulation and real-world experiments show that it successfully figures out multi-step strategies and demonstrates non-trivial behaviors, including building tools or notifying a human for help. Demos are available on our project page: https://sites.google.com/view/long-horizon-robot.
