PDHG-Unrolled Learning-to-Optimize Method for Large-Scale Linear Programming
Bingheng Li, Linxin Yang, Yupeng Chen, Senmiao Wang, Qian Chen, Haitao Mao, Yao Ma, Akang Wang, Tian Ding, Jiliang Tang, Ruoyu Sun
TL;DR
This work tackles accelerating large-scale linear programming by unrolling the Primal-Dual Hybrid Gradient (PDHG) method into a neural network, PDHG-Net, and pairing it with a two-stage L2O framework that warm-starts a PDLP solver for refinement. PDHG-Net uses channel expansion and ReLU-based proximal substitutes to replicate PDHG updates while remaining size-generalizable, with theoretical results showing exact alignment to the PDHG iterations and an ε-approximation guarantee using O(1/ε) neurons. Empirically, the approach achieves up to a 3x speedup on large LPs (e.g., PageRank-scale problems) and provides robust improvements on challenging LP relaxations (IP/WA), aided by a warm-start PDLP refinement that accelerates convergence. The findings highlight the potential of combining unrolled optimization nets with classical solvers to scale LP solving and suggest avenues for applying similar L2O strategies to broader optimization problems such as MIP.
Abstract
Solving large-scale linear programming (LP) problems is an important task in various areas such as communication networks, power systems, finance and logistics. Recently, two distinct approaches have emerged to expedite LP solving: (i) First-order methods (FOMs); (ii) Learning to optimize (L2O). In this work, we propose an FOM-unrolled neural network (NN) called PDHG-Net, and propose a two-stage L2O method to solve large-scale LP problems. The new architecture PDHG-Net is designed by unrolling the recently emerged PDHG method into a neural network, combined with channel-expansion techniques borrowed from graph neural networks. We prove that the proposed PDHG-Net can recover PDHG algorithm, thus can approximate optimal solutions of LP instances with a polynomial number of neurons. We propose a two-stage inference approach: first use PDHG-Net to generate an approximate solution, and then apply PDHG algorithm to further improve the solution. Experiments show that our approach can significantly accelerate LP solving, achieving up to a 3$\times$ speedup compared to FOMs for large-scale LP problems.
