Deep FlexQP: Accelerated Nonlinear Programming via Deep Unfolding
Alex Oshin, Rahul Vodeb Ghosh, Augustinos D. Saravanos, Evangelos A. Theodorou
TL;DR
This work tackles the challenge of solving NLPs with potentially infeasible linearizations by introducing FlexQP, an always-feasible QP solver built on an exact elastic relaxation and ADMM-style splitting. It then enhances FlexQP with Deep FlexQP, a learning-based extension that jointly learns dimension-agnostic feedback policies for penalty and relaxation parameters via deep unfolding, supported by PAC-Bayes generalization guarantees. The approach is validated across diverse problem classes, including portfolio optimization, SVMs, LASSO, Huber fitting, and nonlinear optimal control, demonstrating substantial speedups and robust performance, including in nonconvex NLP via SQP subproblems. The results show that learning-based parameter control yields notable convergence and runtime benefits, with formal certificates providing a practical safety net for deployment in real-time control and predictive safety applications.
Abstract
We propose an always-feasible quadratic programming (QP) optimizer, FlexQP, which is based on an exact relaxation of the QP constraints. If the original constraints are feasible, then the optimizer finds the optimal solution to the original QP. On the other hand, if the constraints are infeasible, the optimizer identifies a solution that minimizes the constraint violation in a sparse manner. FlexQP scales favorably with respect to the problem dimension, is robust to both feasible and infeasible QPs with minimal assumptions on the problem data, and can be effectively warm-started. We subsequently apply deep unfolding to improve our optimizer through data-driven techniques, leading to an accelerated Deep FlexQP. By learning dimension-agnostic feedback policies for the parameters from a small number of training examples, Deep FlexQP generalizes to problems with larger dimensions and can optimize for many more iterations than it was initially trained for. Our approach outperforms two recently proposed state-of-the-art accelerated QP approaches on a suite of benchmark systems including portfolio optimization, classification, and regression problems. We provide guarantees on the expected performance of our deep QP optimizer through probably approximately correct (PAC) Bayes generalization bounds. These certificates are used to design an accelerated sequential quadratic programming solver that solves nonlinear optimal control and predictive safety filter problems faster than traditional approaches. Overall, our approach is very robust and greatly outperforms existing non-learning and learning-based optimizers in terms of both runtime and convergence to the optimal solution across multiple classes of NLPs.
