Deep Reinforcement Learning for Robotics: A Survey of Real-World Successes
Chen Tang, Ben Abbatematteo, Jiaheng Hu, Rohan Chandra, Roberto Martín-Martín, Peter Stone
TL;DR
This survey analyzes real-world successes of DRL in robotics, arguing that maturity varies by task and domain. It introduces a four-axis taxonomy (competencies, problem formulation, solution approach, real-world deployment level) to systematically compare papers, with emphasis on real-world validation over simulations. The review highlights mature quadruped locomotion and advancing navigation, while identifying open challenges in biped locomotion, aerial control, manipulation, HRI, and multi-robot systems. It concludes with actionable directions for stable, sample-efficient real-world RL and the potential of foundation models to accelerate deployment in real-world robotics.
Abstract
Reinforcement learning (RL), particularly its combination with deep neural networks referred to as deep RL (DRL), has shown tremendous promise across a wide range of applications, suggesting its potential for enabling the development of sophisticated robotic behaviors. Robotics problems, however, pose fundamental difficulties for the application of RL, stemming from the complexity and cost of interacting with the physical world. This article provides a modern survey of DRL for robotics, with a particular focus on evaluating the real-world successes achieved with DRL in realizing several key robotic competencies. Our analysis aims to identify the key factors underlying those exciting successes, reveal underexplored areas, and provide an overall characterization of the status of DRL in robotics. We highlight several important avenues for future work, emphasizing the need for stable and sample-efficient real-world RL paradigms, holistic approaches for discovering and integrating various competencies to tackle complex long-horizon, open-world tasks, and principled development and evaluation procedures. This survey is designed to offer insights for both RL practitioners and roboticists toward harnessing RL's power to create generally capable real-world robotic systems.
