DrEureka: Language Model Guided Sim-To-Real Transfer
Yecheng Jason Ma, William Liang, Hung-Ju Wang, Sam Wang, Yuke Zhu, Linxi Fan, Osbert Bastani, Dinesh Jayaraman
TL;DR
This work presents DrEureka, an LLM-guided pipeline that automates reward design and domain randomization for sim-to-real transfer in robotics. By decomposing the problem into reward synthesis, a reward-aware physics prior, and LLM-driven DR generation, DrEureka achieves competitive real-world transfer on quadruped locomotion and dexterous manipulation without manual tuning. It demonstrates robustness through a novel task—walking a quadruped on a yoga ball—and shows superiority over human-designed configurations and prior DR baselines. The results suggest that coupling foundation models with physics simulators can substantially accelerate real-world robot learning with reduced human labor.
Abstract
Transferring policies learned in simulation to the real world is a promising strategy for acquiring robot skills at scale. However, sim-to-real approaches typically rely on manual design and tuning of the task reward function as well as the simulation physics parameters, rendering the process slow and human-labor intensive. In this paper, we investigate using Large Language Models (LLMs) to automate and accelerate sim-to-real design. Our LLM-guided sim-to-real approach, DrEureka, requires only the physics simulation for the target task and automatically constructs suitable reward functions and domain randomization distributions to support real-world transfer. We first demonstrate that our approach can discover sim-to-real configurations that are competitive with existing human-designed ones on quadruped locomotion and dexterous manipulation tasks. Then, we showcase that our approach is capable of solving novel robot tasks, such as quadruped balancing and walking atop a yoga ball, without iterative manual design.
