Inverse Optimization Without Inverse Optimization: Direct Solution Prediction with Transformer Models
Macarena Navarro, Willem-Jan van Hoeve, Karan Singh
TL;DR
The paper tackles learning solutions to discrete optimization problems with unknown objectives or constraints by replacing traditional inverse optimization with a structured-prediction approach that uses transformer-based sequence-to-sequence models. A DFA-based constraint reasoning module masks the decoder to ensure feasibility, enabling end-to-end learning of latent objective and constraint structure from data. Across knapsack, bipartite matching, and single-machine scheduling, the transformer framework consistently achieves high-quality, feasible solutions with far faster inference than IO and outperforming LSTM baselines, even under data corruption and varying training sizes. The approach offers a scalable, robust alternative for decision problems where exact objective forms or implicit constraints are unknown, relying on monotone constraint systems and rich historical data to capture complex latent structure.
Abstract
We present an end-to-end framework for generating solutions to combinatorial optimization problems with unknown components using transformer-based sequence-to-sequence neural networks. Our framework learns directly from past solutions and incorporates the known components, such as hard constraints, via a constraint reasoning module, yielding a constrained learning scheme. The trained model generates new solutions that are structurally similar to past solutions and are guaranteed to respect the known constraints. We apply our approach to three combinatorial optimization problems with unknown components: the knapsack problem with an unknown reward function, the bipartite matching problem with an unknown objective function, and the single-machine scheduling problem with release times and unknown precedence constraints, with the objective of minimizing average completion time. We demonstrate that transformer models have remarkably strong performance and often produce near-optimal solutions in a fraction of a second. They can be particularly effective in the presence of more complex underlying objective functions and unknown implicit constraints compared to an LSTM-based alternative and inverse optimization.
