A Recipe for Efficient Sim-to-Real Transfer in Manipulation with Online Imitation-Pretrained World Models
Yilin Wang, Shangzhe Li, Haoyi Niu, Zhiao Huang, Weitong Zhang, Hao Su
TL;DR
This work tackles imitation learning for robotic manipulation when real-world expert data and rewards are scarce. It introduces a three-stage sim-to-real pipeline that pretrains a latent world model via online imitation in simulation using a CDRED reward to align expert and behavioral data, followed by offline finetuning on a small real-world dataset. Empirically, the approach yields at least $31.7\%$ improvements in sim-to-sim and $23.3\%$ improvements in sim-to-real transfer over offline baselines, with stronger out-of-distribution generalization and better data coverage attributed to online exploration. The method offers a practical path to robust, data-efficient domain transfer for manipulation tasks under reward-free conditions.
Abstract
We are interested in solving the problem of imitation learning with a limited amount of real-world expert data. Existing offline imitation methods often struggle with poor data coverage and severe performance degradation. We propose a solution that leverages robot simulators to achieve online imitation learning. Our sim-to-real framework is based on world models and combines online imitation pretraining with offline finetuning. By leveraging online interactions, our approach alleviates the data coverage limitations of offline methods, leading to improved robustness and reduced performance degradation during finetuning. It also enhances generalization during domain transfer. Our empirical results demonstrate its effectiveness, improving success rates by at least 31.7% in sim-to-sim transfer and 23.3% in sim-to-real transfer over existing offline imitation learning baselines.
