Contextual Tuning of Model Predictive Control for Autonomous Racing
Lukas P. Fröhlich, Christian Küttel, Elena Arcari, Lukas Hewing, Melanie N. Zeilinger, Andrea Carron
TL;DR
The paper addresses the challenge of adapting both a predictive dynamics model and controller parameters in learning-based MPC for autonomous racing under changing conditions. It introduces a residual dynamics model that encodes environmental context and employs contextual Bayesian optimization to transfer knowledge across contexts, enabling data-efficient tuning. The approach is validated on a hardware platform with 1:28 RC cars, showing improved prediction accuracy and faster convergence to context-specific optimal controllers than standard Bayesian optimization. The work demonstrates that context-aware learning and tuning can significantly enhance closed-loop performance in autonomous racing with limited data, and provides an open-source implementation for the community.
Abstract
Learning-based model predictive control has been widely applied in autonomous racing to improve the closed-loop behaviour of vehicles in a data-driven manner. When environmental conditions change, e.g., due to rain, often only the predictive model is adapted, but the controller parameters are kept constant. However, this can lead to suboptimal behaviour. In this paper, we address the problem of data-efficient controller tuning, adapting both the model and objective simultaneously. The key novelty of the proposed approach is that we leverage a learned dynamics model to encode the environmental condition as a so-called context. This insight allows us to employ contextual Bayesian optimization to efficiently transfer knowledge across different environmental conditions. Consequently, we require fewer data to find the optimal controller configuration for each context. The proposed framework is extensively evaluated with more than 3'000 laps driven on an experimental platform with 1:28 scale RC race cars. The results show that our approach successfully optimizes the lap time across different contexts requiring fewer data compared to other approaches based on standard Bayesian optimization.
