Towards Foundation Models for Mixed Integer Linear Programming
Sirui Li, Janardhan Kulkarni, Ishai Menache, Cathy Wu, Beibin Li
TL;DR
The paper addresses the lack of generalizable, foundation-model-style learning for MILP by introducing MILP-Evolve, a large-scale data-generation pipeline that creates diverse MILP classes using LLMs. It trains a single model per learning task (integrality-gap prediction, learning-to-branch, and Language-MILP contrastive learning) on this data, with a GNN-based encoder and attention to capture global structure, and demonstrates strong out-of-domain generalization and transfer to MIPLIB. Key findings show that diversity in training data greatly boosts generalization, and pretraining on MILP-Evolve accelerates convergence and performance on heterogeneous MILP benchmarks. The work lays groundwork for broadly applicable MILP foundation models and provides public code and data to spur further development in multi-class MILP learning and NLP-style alignment for optimization problems.
Abstract
Mixed Integer Linear Programming (MILP) is essential for modeling complex decision-making problems but faces challenges in computational tractability and requires expert formulation. Current deep learning approaches for MILP focus on specific problem classes and do not generalize to unseen classes. To address this shortcoming, we take a foundation model training approach, where we train a single deep learning model on a diverse set of MILP problems to generalize across problem classes. As existing datasets for MILP lack diversity and volume, we introduce MILP-Evolve, a novel LLM-based evolutionary framework that is capable of generating a large set of diverse MILP classes with an unlimited amount of instances. We study our methodology on three key learning tasks that capture diverse aspects of MILP: (1) integrality gap prediction, (2) learning to branch, and (3) a new task of aligning MILP instances with natural language descriptions. Our empirical results show that models trained on the data generated by MILP-Evolve achieve significant improvements on unseen problems, including MIPLIB benchmarks. Our work highlights the potential of moving towards a foundation model approach for MILP that can generalize to a broad range of MILP applications. Our code and data are publicly available at https://github.com/microsoft/OptiGuide.
