Partial End-to-end Reinforcement Learning for Robustness Against Modelling Error in Autonomous Racing
Andrew Murdoch, Johannes Cornelius Schoeman, Hendrik Willem Jordaan
TL;DR
This work tackles the simulation-to-reality gap in autonomous racing caused by modelling errors in vehicle dynamics. It introduces a partial end-to-end reinforcement learning framework where an RL planner outputs a trajectory (path and velocity) in the Frenet frame, which is then tracked by a pure pursuit controller and a velocity controller, leveraging track geometry for safety. Compared with fully end-to-end baselines, the approach shows fewer crashes and higher success rates, especially on complex tracks, under a range of model mismatches including friction, tire stiffness, and mass variations. The key contributions include explicit Frenet-frame path generation, integration with classical controllers, and a robust evaluation demonstrating improved safety and training efficiency for real-world deployment in autonomous racing settings.
Abstract
In this paper, we address the issue of increasing the performance of reinforcement learning (RL) solutions for autonomous racing cars when navigating under conditions where practical vehicle modelling errors (commonly known as \emph{model mismatches}) are present. To address this challenge, we propose a partial end-to-end algorithm that decouples the planning and control tasks. Within this framework, an RL agent generates a trajectory comprising a path and velocity, which is subsequently tracked using a pure pursuit steering controller and a proportional velocity controller, respectively. In contrast, many current learning-based (i.e., reinforcement and imitation learning) algorithms utilise an end-to-end approach whereby a deep neural network directly maps from sensor data to control commands. By leveraging the robustness of a classical controller, our partial end-to-end driving algorithm exhibits better robustness towards model mismatches than standard end-to-end algorithms.
