BP-MPC: Optimizing the Closed-Loop Performance of MPC using BackPropagation
Riccardo Zuliani, Efe C. Balta, John Lygeros
TL;DR
The paper addresses tuning MPC policies to maximize closed-loop performance for nonlinear systems with constraints. It introduces BP-MPC, a backpropagation framework that differentiates through the MPC policy by leveraging conservative Jacobians to handle nonsmooth sensitivities and by using linearized dynamics to preserve convex subproblems. Key contributions include a dual-QP based differentiation of the MPC, a modular backpropagation scheme for the whole horizon with convergence guarantees to a critical point, extensions to state-dependent costs/constraints and infeasibility recovery, and demonstration on nonlinear simulation. The approach yields a practical, convergent method to improve MPC performance and provides a path toward robust, differentiable MPC tuning.
Abstract
Model predictive control (MPC) is pervasive in research and industry. However, designing the cost function and the constraints of the MPC to maximize closed-loop performance remains an open problem. To achieve optimal tuning, we propose a backpropagation scheme that solves a policy optimization problem with nonlinear system dynamics and MPC policies. We enforce the system dynamics using linearization and allow the MPC problem to contain elements that depend on the current system state and on past MPC solutions. Moreover, we propose a simple extension that can deal with losses of feasibility. Our approach, unlike other methods in the literature, enjoys convergence guarantees.
