Learning-Based Hierarchical Approach for Fast Mixed-Integer Optimization
Stefan Clarke, Bartolomeo Stellato
TL;DR
This work introduces a learning-based hierarchical approach to accelerate structured MIPs by decomposing a large problem into an upper-level and a lower-level optimization, with the upper-level decisions serving as parameters for the lower-level constraints. The authors formulate a convex, decision-focused training objective for predicting the upper-level cost vector, and develop multiple differentiable convex surrogates (including GSPO+, ASL, Z, and Fenchel-Young variants) to enable tractable learning of the optimizer. A key robustness component uses conformal prediction to provide probabilistic bounds on suboptimality, calibrated on separate data, offering online guarantees for the quality of predicted solutions. Through experiments on hierarchical knapsack, capacitated facility location, and multi-agent routing problems, the method achieves substantial speedups over state-of-the-art solvers while maintaining feasible and high-quality solutions, with conformal bounds offering reliable risk control. Together, these contributions advance practical, scalable optimization in data-rich settings where rapid reoptimization is essential.
Abstract
We propose a hierarchical architecture for efficiently computing high-quality solutions to structured mixed-integer programs (MIPs). To reduce computational effort, our approach decouples the original problem into a higher level problem and a lower level problem, both of smaller size. We solve both problems sequentially, where decisions of the higher level problem become parameters of the constraints of the lower level problem. We formulate this learning task as a convex optimization problem using decision-focused learning techniques and solve it by differentiating through the higher and the lower level problems in our architecture. To ensure robustness, we derive out-of-sample performance guarantees using conformal prediction. Numerical experiments in facility location, knapsack problems, and vehicle routing problems demonstrate that our approach significantly reduces computation time while maintaining feasibility and high solution quality compared to state-of-the-art solvers.
