OptiMUS: Scalable Optimization Modeling with (MI)LP Solvers and Large Language Models
Ali AhmadiTeshnizi, Wenzhi Gao, Madeleine Udell
TL;DR
OptiMUS tackles the automation gap in optimization modeling by deploying a modular, multi-agent LLM system that translates natural-language descriptions into structured MILP/LP formulations, generates and debugs solver code, and validates results with data-driven evaluation. A central connection graph keeps context and dependencies local, enabling scalable processing of long texts and large data files. The approach yields state-of-the-art performance on existing benchmarks and on the new NLP4LP dataset, with substantial gains compared to baselines and detailed analyses of ablations, data handling, and failure modes. By coupling LLM-driven modeling with traditional solvers (e.g., $\text{MILP}$/LP) and releasing NLP4LP, OptiMUS points toward scalable, accessible optimization tooling that can benefit industry sectors with limited access to optimization expertise.
Abstract
Optimization problems are pervasive in sectors from manufacturing and distribution to healthcare. However, most such problems are still solved heuristically by hand rather than optimally by state-of-the-art solvers because the expertise required to formulate and solve these problems limits the widespread adoption of optimization tools and techniques. This paper introduces OptiMUS, a Large Language Model (LLM)-based agent designed to formulate and solve (mixed integer) linear programming problems from their natural language descriptions. OptiMUS can develop mathematical models, write and debug solver code, evaluate the generated solutions, and improve its model and code based on these evaluations. OptiMUS utilizes a modular structure to process problems, allowing it to handle problems with long descriptions and complex data without long prompts. Experiments demonstrate that OptiMUS outperforms existing state-of-the-art methods on easy datasets by more than $20\%$ and on hard datasets (including a new dataset, NLP4LP, released with this paper that features long and complex problems) by more than $30\%$.
