Self-Supervised Learning of Iterative Solvers for Constrained Optimization
Lukas Lüken, Sergio Lucia
TL;DR
The paper tackles the challenge of real-time, high-accuracy solutions to parametric constrained optimization, especially in model predictive control, where traditional solvers struggle under tight time constraints. It introduces LISCO, a two-stage, learning-based solver comprising a predictor for warm-start primal-dual estimates and a solver that iteratively refines these estimates using updates guided by a differentiable KKT residual, all trained in a fully self-supervised fashion through a KKT-based loss. A convexification strategy enables application to nonconvex problems while preserving the theoretical link between the training loss and KKT points, with guarantees that minima of the per-sample loss align with KKT points. Empirical results on NMPC with a nonlinear double integrator and a high-dimensional nonconvex QP show substantial online speedups over IPOPT and higher accuracy than competing learning-based baselines such as DC3 and PDL, highlighting LISCO’s potential for real-time certified optimization in complex control tasks.
Abstract
The real-time solution of parametric optimization problems is critical for applications that demand high accuracy under tight real-time constraints, such as model predictive control. To this end, this work presents a learning-based iterative solver for constrained optimization, comprising a neural network predictor that generates initial primal-dual solution estimates, followed by a learned iterative solver that refines these estimates to reach high accuracy. We introduce a novel loss function based on Karush-Kuhn-Tucker (KKT) optimality conditions, enabling fully self-supervised training without pre-sampled optimizer solutions. Theoretical guarantees ensure that the training loss function attains minima exclusively at KKT points. A convexification procedure enables application to nonconvex problems while preserving these guarantees. Experiments on two nonconvex case studies demonstrate speedups of up to one order of magnitude compared to state-of-the-art solvers such as IPOPT, while achieving orders of magnitude higher accuracy than competing learning-based approaches.
