Simulation-based reinforcement learning for real-world autonomous driving
Błażej Osiński, Adam Jakubowski, Piotr Miłoś, Paweł Zięcina, Christopher Galias, Silviu Homoceanu, Henryk Michalewski
TL;DR
The paper addresses the challenge of transferring an end-to-end reinforcement learning-based autonomous driving policy trained in a simulator to a real vehicle. It uses RGB input from a single camera plus a separately trained semantic segmentation module, with dense rewards in simulation and extensive domain randomization to promote transfer. Key findings show that regularization, segmentation as an auxiliary representation, and waypoint-based control improve real-world performance, while certain memory-based and discrete-action approaches can hinder transfer; an offline proxy metric is explored as a potential predictor of real-world autonomy. The work provides practical guidelines for sim-to-real autonomous driving, highlights the importance of perception-control-training design choices, and suggests avenues for improving robustness, such as alternative RL algorithms, BEV representations, and model-based enhancements to increase sample efficiency and transfer reliability.
Abstract
We use reinforcement learning in simulation to obtain a driving system controlling a full-size real-world vehicle. The driving policy takes RGB images from a single camera and their semantic segmentation as input. We use mostly synthetic data, with labelled real-world data appearing only in the training of the segmentation network. Using reinforcement learning in simulation and synthetic data is motivated by lowering costs and engineering effort. In real-world experiments we confirm that we achieved successful sim-to-real policy transfer. Based on the extensive evaluation, we analyze how design decisions about perception, control, and training impact the real-world performance.
