Principled Data Augmentation for Learning to Solve Quadratic Programming Problems
Chendi Qian, Christopher Morris
TL;DR
This work tackles data scarcity in learning-to-optimize for linear and quadratic programs by introducing principled data augmentations that preserve optimality. It formulates optimization-aware affine transformations of the KKT system to generate diverse yet solvable LCQP instances and integrates them into supervised and contrastive learning pipelines for MPNNs. Empirical results show consistent improvements in data-scarce regimes, strong transfer to out-of-distribution and larger problems, and competitive or superior performance compared with standard graph augmentation baselines. The approach enables more robust, data-efficient neural solvers for convex optimization with practical impact on resource allocation, logistics, and model training pipelines that rely on LP/QP solutions.
Abstract
Linear and quadratic optimization are crucial in numerous real-world applications, ranging from training machine learning models to solving integer linear programs. Recently, learning-to-optimize methods (L2O) for linear (LPs) or quadratic programs (QPs) using message-passing graph neural networks (MPNNs) have gained traction, promising lightweight, data-driven proxies for solving such optimization problems. For example, they replace the costly computation of strong branching scores in branch-and-bound solvers, thereby reducing the need to solve many such optimization problems. However, robust L2O MPNNs remain challenging in data-scarce settings, especially when addressing complex optimization problems such as QPs. This work introduces a principled approach to data augmentation tailored for QPs via MPNNs. Our method leverages theoretically justified data augmentation techniques to generate diverse yet optimality-preserving instances. Furthermore, we integrate these augmentations into a self-supervised contrastive learning framework, thereby pretraining MPNNs for improved performance on L2O tasks. Extensive experiments demonstrate that our approach improves generalization in supervised scenarios and facilitates effective transfer learning to related optimization problems.
