Learning Soft Constrained MPC Value Functions: Efficient MPC Design and Implementation providing Stability and Safety Guarantees
Nicolas Chatzikiriakos, Kim P. Wabersich, Felix Berkel, Patricia Pauli, Andrea Iannelli
TL;DR
The paper addresses stability and safety of nonlinear MPC on embedded hardware by learning a continuous value function for a soft constrained MPC, enabling efficient online control. It extends soft constrained MPC to nonlinear systems with constraint tightening, proving local Lipschitz continuity of the value function and input-to-state stability (ISS) of the closed-loop under approximation error. It further proposes a split-value approach to decompose the value function into a smooth performance term and a nonnegative safety term, facilitating supervised learning and ensuring constraint satisfaction. A nonlinear mass-spring-damper example demonstrates feasibility and fast evaluation while preserving the closed-loop behavior and constraint safety.
Abstract
Model Predictive Control (MPC) can be applied to safety-critical control problems, providing closed-loop safety and performance guarantees. Implementation of MPC controllers requires solving an optimization problem at every sampling instant, which is challenging to execute on embedded hardware. To address this challenge, we propose a framework that combines a tightened soft constrained MPC formulation with supervised learning to approximate the MPC value function. This combination enables us to obtain a corresponding optimal control law, which can be implemented efficiently on embedded platforms. The framework ensures stability and constraint satisfaction for various nonlinear systems. While the design effort is similar to that of nominal MPC, the proposed formulation provides input-to-state stability (ISS) with respect to the approximation error of the value function. Furthermore, we prove that the value function corresponding to the soft constrained MPC problem is Lipschitz continuous for Lipschitz continuous systems, even if the optimal control law may be discontinuous. This serves two purposes: First, it allows to relate approximation errors to a sufficiently large constraint tightening to obtain constraint satisfaction guarantees. Second, it paves the way for an efficient supervised learning procedure to obtain a continuous value function approximation. We demonstrate the effectiveness of the method using a nonlinear numerical example.
