CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models
Kanghyun Ryu, Qiayuan Liao, Zhongyu Li, Payam Delgosha, Koushil Sreenath, Negar Mehr
TL;DR
CurricuLLM addresses the challenge of automatic curriculum design for learning complex robotic skills by leveraging large language models to generate a sequence of subtasks described in natural language, translate them into executable task codes with reward and goal-distribution specifications, and evaluate trained policies through trajectory analysis to select the best subtask performance. The method comprises three modules—Curriculum Design, Task Code Generation, and Policy Evaluation—and is validated across manipulation, navigation, locomotion, and a high-dimensional humanoid task, including real-world hardware transfer with the Berkeley Humanoid. Key contributions include (1) introducing a task-level curriculum designer that uses LLMs for planning and coding, (2) demonstrating efficacy across diverse robotic domains, and (3) validating that policies learned via CurricuLLM can transfer to real hardware. The results show CurricuLLM providing competitive or superior performance relative to baselines such as SAC, HER, and LLM-zeroshot, with especially notable gains on complex tasks like AntMaze and successful real-world deployment, highlighting the practical impact of automated, language-guided curriculum design in robotics.
Abstract
Curriculum learning is a training mechanism in reinforcement learning (RL) that facilitates the achievement of complex policies by progressively increasing the task difficulty during training. However, designing effective curricula for a specific task often requires extensive domain knowledge and human intervention, which limits its applicability across various domains. Our core idea is that large language models (LLMs), with their extensive training on diverse language data and ability to encapsulate world knowledge, present significant potential for efficiently breaking down tasks and decomposing skills across various robotics environments. Additionally, the demonstrated success of LLMs in translating natural language into executable code for RL agents strengthens their role in generating task curricula. In this work, we propose CurricuLLM, which leverages the high-level planning and programming capabilities of LLMs for curriculum design, thereby enhancing the efficient learning of complex target tasks. CurricuLLM consists of: (Step 1) Generating sequence of subtasks that aid target task learning in natural language form, (Step 2) Translating natural language description of subtasks in executable task code, including the reward code and goal distribution code, and (Step 3) Evaluating trained policies based on trajectory rollout and subtask description. We evaluate CurricuLLM in various robotics simulation environments, ranging from manipulation, navigation, and locomotion, to show that CurricuLLM can aid learning complex robot control tasks. In addition, we validate humanoid locomotion policy learned through CurricuLLM in real-world. Project website is https://iconlab.negarmehr.com/CurricuLLM/
