Performance-driven Constrained Optimal Auto-Tuner for MPC
Albert Gassol Puigjaner, Manish Prajapat, Andrea Carron, Andreas Krause, Melanie N. Zeilinger
TL;DR
The paper tackles tuning MPC cost function parameters under a hard performance constraint by modeling the unknown performance function with a Gaussian process and enforcing safety through Lipschitz-based optimistic/pessimistic sets. It introduces COAt-MPC, a safe, goal-directed auto-tuner that samples only from the pessimistic set while targeting a goal in the optimistic set, yielding finite-time convergence to the constrained optimum with high probability. Theoretical guarantees show the constraint is satisfied with probability at least 1-δ at all iterations and that the optimum under the constraint is reached within a finite budget, independent of discretization granularity. Empirically, COAt-MPC outperforms constrained and unconstrained baselines in autonomous racing, achieving lower constraint violations and competitive or better cumulative regret, with faster convergence on an RC platform. The work advances safe, data-efficient MPC tuning and suggests scalable extensions to higher-dimensional parameter spaces.
Abstract
A key challenge in tuning Model Predictive Control (MPC) cost function parameters is to ensure that the system performance stays consistently above a certain threshold. To address this challenge, we propose a novel method, COAT-MPC, Constrained Optimal Auto-Tuner for MPC. With every tuning iteration, COAT-MPC gathers performance data and learns by updating its posterior belief. It explores the tuning parameters' domain towards optimistic parameters in a goal-directed fashion, which is key to its sample efficiency. We theoretically analyze COAT-MPC, showing that it satisfies performance constraints with arbitrarily high probability at all times and provably converges to the optimum performance within finite time. Through comprehensive simulations and comparative analyses with a hardware platform, we demonstrate the effectiveness of COAT-MPC in comparison to classical Bayesian Optimization (BO) and other state-of-the-art methods. When applied to autonomous racing, our approach outperforms baselines in terms of constraint violations and cumulative regret over time.
