A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models

Longchao Da; Justin Turnau; Thirulogasankar Pranav Kutralingam; Alvaro Velasquez; Paulo Shakarian; Hua Wei

A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models

Longchao Da, Justin Turnau, Thirulogasankar Pranav Kutralingam, Alvaro Velasquez, Paulo Shakarian, Hua Wei

TL;DR

The paper addresses the sim-to-real challenge in RL by proposing a formal, MDP-based taxonomy that spans observation, action, transition, and reward dimensions, and by surveying both classical techniques and emerging foundation-model–driven methods. It synthesizes domain-specific insights, benchmarks, and evaluation protocols, while highlighting GenAI-based simulation trends and a publicly maintained research repository. Key contributions include a rigorous taxonomy, a comprehensive literature review across domains, and a discussion of evaluation settings and metrics to quantify transfer gaps. The work aims to unify disparate strands of sim-to-real research, guiding future development toward safer, more scalable deployment of RL in real-world systems.

Abstract

Deep Reinforcement Learning (RL) has been explored and verified to be effective in solving decision-making tasks in various domains, such as robotics, transportation, recommender systems, etc. It learns from the interaction with environments and updates the policy using the collected experience. However, due to the limited real-world data and unbearable consequences of taking detrimental actions, the learning of RL policy is mainly restricted within the simulators. This practice guarantees safety in learning but introduces an inevitable sim-to-real gap in terms of deployment, thus causing degraded performance and risks in execution. There are attempts to solve the sim-to-real problems from different domains with various techniques, especially in the era with emerging techniques such as large foundations or language models that have cast light on the sim-to-real. This survey paper, to the best of our knowledge, is the first taxonomy that formally frames the sim-to-real techniques from key elements of the Markov Decision Process (State, Action, Transition, and Reward). Based on the framework, we cover comprehensive literature from the classic to the most advanced methods including the sim-to-real techniques empowered by foundation models, and we also discuss the specialties that are worth attention in different domains of sim-to-real problems. Then we summarize the formal evaluation process of sim-to-real performance with accessible code or benchmarks. The challenges and opportunities are also presented to encourage future exploration of this direction. We are actively maintaining a repository to include the most up-to-date sim-to-real research work to help domain researchers.

A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models

TL;DR

Abstract

A Survey of Sim-to-Real Methods in RL: Progress, Prospects and Challenges with Foundation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)