Reinforcement Learning-based Dynamic Adaptation for Sampling-Based Motion Planning in Agile Autonomous Driving
Alexander Langmann, Yevhenii Tokarev, Mattia Piccinini, Korbinian Moller, Johannes Betz
TL;DR
This work presents a hybrid motion‑planning approach that layers a high‑level reinforcement learning agent atop a low‑level sampling‑based planner in a Frenet frame to achieve dynamic, interactive maneuvers in autonomous racing. The RL agent selects among predefined cost‑weight sets to balance safety, racing performance, and interaction with opponents, preserving trajectory validity by design. Evaluations in simulation demonstrate that the adaptive planner attains 0% collisions while delivering faster overtakes and better interaction than static parameter configurations, and it generalizes to unseen tracks. The approach offers a path toward adaptive yet interpretable motion planning suitable for safety‑critical autonomous driving beyond racing scenarios.
Abstract
Sampling-based trajectory planners are widely used for agile autonomous driving due to their ability to generate fast, smooth, and kinodynamically feasible trajectories. However, their behavior is often governed by a cost function with manually tuned, static weights, which forces a tactical compromise that is suboptimal across the wide range of scenarios encountered in a race. To address this shortcoming, we propose using a Reinforcement Learning (RL) agent as a high-level behavioral selector that dynamically switches the cost function parameters of an analytical, low-level trajectory planner during runtime. We show the effectiveness of our approach in simulation in an autonomous racing environment where our RL-based planner achieved 0% collision rate while reducing overtaking time by up to 60% compared to state-of-the-art static planners. Our new agent now dynamically switches between aggressive and conservative behaviors, enabling interactive maneuvers unattainable with static configurations. These results demonstrate that integrating reinforcement learning as a high-level selector resolves the inherent trade-off between safety and competitiveness in autonomous racing planners. The proposed methodology offers a pathway toward adaptive yet interpretable motion planning for broader autonomous driving applications.
