Solver-Free Decision-Focused Learning for Linear Optimization Problems
Senne Berden, Ali İrfan Mahmutoğulları, Dimos Tsouros, Tias Guns
TL;DR
This paper tackles the computational bottleneck of decision-focused learning (DFL) in linear programs by proposing a solver-free training objective, lava, that avoids solving the predicted LP during training. Lava leverages LP geometry by precomputing adjacent vertices to the ground-truth optimum and enforcing that the predicted cost vector ranks the optimum above its neighbors through a convex, hinge-like loss. The approach dramatically reduces training time while maintaining competitive decision quality, especially on nondegenerate problems, and scales favorably with problem size. It also provides a practical framework for efficient DFL in real-world linear optimization settings, with a clear path to extending to ILPs/MILPs via LP relaxations.
Abstract
Mathematical optimization is a fundamental tool for decision-making in a wide range of applications. However, in many real-world scenarios, the parameters of the optimization problem are not known a priori and must be predicted from contextual features. This gives rise to predict-then-optimize problems, where a machine learning model predicts problem parameters that are then used to make decisions via optimization. A growing body of work on decision-focused learning (DFL) addresses this setting by training models specifically to produce predictions that maximize downstream decision quality, rather than accuracy. While effective, DFL is computationally expensive, because it requires solving the optimization problem with the predicted parameters at each loss evaluation. In this work, we address this computational bottleneck for linear optimization problems, a common class of problems in both DFL literature and real-world applications. We propose a solver-free training method that exploits the geometric structure of linear optimization to enable efficient training with minimal degradation in solution quality. Our method is based on the insight that a solution is optimal if and only if it achieves an objective value that is at least as good as that of its adjacent vertices on the feasible polytope. Building on this, our method compares the estimated quality of the ground-truth optimal solution with that of its precomputed adjacent vertices, and uses this as loss function. Experiments demonstrate that our method significantly reduces computational cost while maintaining high decision quality.
