Table of Contents
Fetching ...

Learning to Optimize for Mixed-Integer Non-linear Programming with Feasibility Guarantees

Bo Tang, Elias B. Khalil, Ján Drgoňa

TL;DR

The paper addresses scalable solution of parametric MINLPs by introducing a solver-free L2O framework that directly predicts mixed-integer decisions. It introduces differentiable integer-correction layers and an inference-time integer feasibility projection, with theoretical convergence guarantees for the projection under mild regularity. Empirically, the approach scales to tens of thousands of variables and delivers high-quality, feasible solutions in milliseconds, often outperforming traditional solvers and heuristics in repeated-solve settings. Ablations and analyses demonstrate the necessity of the correction layers and the projection, and show favorable training times, highlighting the method's practicality for real-time or large-scale deployment.

Abstract

Mixed-integer nonlinear programs (MINLPs) arise in domains such as energy systems, process engineering, and transportation, and are notoriously difficult to solve at scale due to the interplay of discrete decisions and nonlinear constraints. In many practical settings, these problems appear in parametric form, where objectives and constraints depend on instance-specific parameters, creating the need for fast and reliable solutions across related instances. While learning-to-optimize (L2O) methods have shown strong performance in continuous optimization, extending them to MINLPs requires enforcing both feasibility and integrality within a data-driven framework. We propose an L2O approach tailored to parametric MINLPs that generates instance-specific solutions using integer correction layers to enforce integrality and a gradient-based projection to ensure feasibility of the inequality constraints. Theoretically, we provide asymptotic and non-asymptotic convergence guarantees of the projection step. Empirically, the framework scales to MINLPs with tens of thousands of variables and produces feasible high-quality solutions within milliseconds, often outperforming traditional solvers and heuristic baselines in repeated-solve settings.

Learning to Optimize for Mixed-Integer Non-linear Programming with Feasibility Guarantees

TL;DR

The paper addresses scalable solution of parametric MINLPs by introducing a solver-free L2O framework that directly predicts mixed-integer decisions. It introduces differentiable integer-correction layers and an inference-time integer feasibility projection, with theoretical convergence guarantees for the projection under mild regularity. Empirically, the approach scales to tens of thousands of variables and delivers high-quality, feasible solutions in milliseconds, often outperforming traditional solvers and heuristics in repeated-solve settings. Ablations and analyses demonstrate the necessity of the correction layers and the projection, and show favorable training times, highlighting the method's practicality for real-time or large-scale deployment.

Abstract

Mixed-integer nonlinear programs (MINLPs) arise in domains such as energy systems, process engineering, and transportation, and are notoriously difficult to solve at scale due to the interplay of discrete decisions and nonlinear constraints. In many practical settings, these problems appear in parametric form, where objectives and constraints depend on instance-specific parameters, creating the need for fast and reliable solutions across related instances. While learning-to-optimize (L2O) methods have shown strong performance in continuous optimization, extending them to MINLPs requires enforcing both feasibility and integrality within a data-driven framework. We propose an L2O approach tailored to parametric MINLPs that generates instance-specific solutions using integer correction layers to enforce integrality and a gradient-based projection to ensure feasibility of the inequality constraints. Theoretically, we provide asymptotic and non-asymptotic convergence guarantees of the projection step. Empirically, the framework scales to MINLPs with tens of thousands of variables and produces feasible high-quality solutions within milliseconds, often outperforming traditional solvers and heuristic baselines in repeated-solve settings.

Paper Structure

This paper contains 81 sections, 6 theorems, 16 equations, 8 figures, 15 tables, 3 algorithms.

Key Result

Theorem 1

Under Assumptions assump:regularity and assump:bounded_constraints, gradient descent applied to $\mathcal{V}({\bm{x}}) = \|{\bm g}(\varphi({\bm{x}}))_+\|_1$ with step size $\eta \in (0, 1/L]$, where $L = \bar{n}_c(G_{\bm g} L_\varphi + G_\varphi L_{\bm g})$, satisfies the following properties:

Figures (8)

  • Figure 1: Overview of the proposed solver-free L2O pipeline for parametric MINLPs. Given parameters $\bm{\xi}$, the model generates a relaxed solution, applies integer correction, and refines feasibility through an iterative projection step.
  • Figure 2: Two integer correction layers: (a) Rounding Classification and (b) Learning Threshold, both using STE for backpropagation.
  • Figure 3: Illustration of objective value evolution for a $100 \times 100$ Integer Quadratic and $60 \times 4$ Mixed-Integer Rosenbrock over 600 seconds.
  • Figure 4: Example of the relaxed solutions $\bar{x}, \bar{y}$ and the mixed-integer solutions $\hat{x}, \hat{y}$ across different epochs of training for the same sample instance.
  • Figure 5: Example of the relaxed solutions $\bar{x}, \bar{y}$ and the mixed-integer solutions $\hat{x}, \hat{y}$ across iterations of feasibility projection to refine an infeasible solution.
  • ...and 3 more figures

Theorems & Definitions (20)

  • Remark 1
  • Remark 2: Feasibility boundary
  • Theorem 1: Asymptotic Convergence of Integer Feasibility Projection
  • proof
  • Theorem 2: Non-Asymptotic Convergence of Integer Feasibility Projection
  • proof
  • Remark 3
  • Corollary 3: Iteration Complexity for Approximate Feasibility
  • Remark 4: On Regularity and Practical Tightness of Lipschitz Assumptions
  • Remark 5: Robustness
  • ...and 10 more