Curriculum Learning for Reinforcement Learning Domains: A Framework and Survey
Sanmit Narvekar, Bei Peng, Matteo Leonetti, Jivko Sinapov, Matthew E. Taylor, Peter Stone
TL;DR
This survey formalizes curriculum learning for reinforcement learning, framing curricula as DAGs over samples and tasks to accelerate training. It decomposes CL into task generation, sequencing, and transfer learning, and develops a seven-dimensional taxonomy to classify methods. By surveying sample-reordering, co-learning, and environment-modification approaches, it highlights open problems including fully automated task creation, sim-to-real transfer, and theoretical guarantees. The work provides a consolidated foundation and guidance for researchers pursuing efficient, scalable curricula in RL and related domains.
Abstract
Reinforcement learning (RL) is a popular paradigm for addressing sequential decision tasks in which the agent has only limited environmental feedback. Despite many advances over the past three decades, learning in many domains still requires a large amount of interaction with the environment, which can be prohibitively expensive in realistic scenarios. To address this problem, transfer learning has been applied to reinforcement learning such that experience gained in one task can be leveraged when starting to learn the next, harder task. More recently, several lines of research have explored how tasks, or data samples themselves, can be sequenced into a curriculum for the purpose of learning a problem that may otherwise be too difficult to learn from scratch. In this article, we present a framework for curriculum learning (CL) in reinforcement learning, and use it to survey and classify existing CL methods in terms of their assumptions, capabilities, and goals. Finally, we use our framework to find open problems and suggest directions for future RL curriculum learning research.
