Table of Contents
Fetching ...

Geometric Algorithms for Neural Combinatorial Optimization with Constraints

Nikolaos Karalias, Akbar Rafiey, Yifei Xu, Zhishang Luo, Behrooz Tahmasebi, Connie Jiang, Stefanie Jegelka

TL;DR

This paper tackles self-supervised learning for combinatorial optimization with discrete constraints by introducing an end-to-end differentiable framework that leverages Carathéodory-based polyhedral decomposition. The neural network outputs a point in the convex hull of feasible solutions, which is then decomposed into a sparsely supported distribution over feasible corner sets, enabling an exact, differentiable extension loss $F(\boldsymbol{x}) = \mathbb{E}_{S \sim \mathcal{D}_{\mathcal{C}}(\boldsymbol{x})}[f(S)]$ and a rounding strategy with guarantees. The approach is instantiated via a GLS-type decomposition for several constraint families (cardinality, partition matroids, spanning trees, independent sets) and shown to perform well on large-scale cardinality-constrained problems, with broad applicability to other CO tasks. Empirical results on Maximum Coverage demonstrate competitive or superior performance against neural and traditional baselines, while ablations highlight the importance of the decomposition design and inference strategies. Overall, the work provides a principled integration of convex geometry and neural solvers, offering scalable, differentiable handling of discrete constraints with rigorous rounding guarantees.

Abstract

Self-Supervised Learning (SSL) for Combinatorial Optimization (CO) is an emerging paradigm for solving combinatorial problems using neural networks. In this paper, we address a central challenge of SSL for CO: solving problems with discrete constraints. We design an end-to-end differentiable framework that enables us to solve discrete constrained optimization problems with neural networks. Concretely, we leverage algorithmic techniques from the literature on convex geometry and Carathéodory's theorem to decompose neural network outputs into convex combinations of polytope corners that correspond to feasible sets. This decomposition-based approach enables self-supervised training but also ensures efficient quality-preserving rounding of the neural net output into feasible solutions. Extensive experiments in cardinality-constrained optimization show that our approach can consistently outperform neural baselines. We further provide worked-out examples of how our method can be applied beyond cardinality-constrained problems to a diverse set of combinatorial optimization tasks, including finding independent sets in graphs, and solving matroid-constrained problems.

Geometric Algorithms for Neural Combinatorial Optimization with Constraints

TL;DR

This paper tackles self-supervised learning for combinatorial optimization with discrete constraints by introducing an end-to-end differentiable framework that leverages Carathéodory-based polyhedral decomposition. The neural network outputs a point in the convex hull of feasible solutions, which is then decomposed into a sparsely supported distribution over feasible corner sets, enabling an exact, differentiable extension loss and a rounding strategy with guarantees. The approach is instantiated via a GLS-type decomposition for several constraint families (cardinality, partition matroids, spanning trees, independent sets) and shown to perform well on large-scale cardinality-constrained problems, with broad applicability to other CO tasks. Empirical results on Maximum Coverage demonstrate competitive or superior performance against neural and traditional baselines, while ablations highlight the importance of the decomposition design and inference strategies. Overall, the work provides a principled integration of convex geometry and neural solvers, offering scalable, differentiable handling of discrete constraints with rigorous rounding guarantees.

Abstract

Self-Supervised Learning (SSL) for Combinatorial Optimization (CO) is an emerging paradigm for solving combinatorial problems using neural networks. In this paper, we address a central challenge of SSL for CO: solving problems with discrete constraints. We design an end-to-end differentiable framework that enables us to solve discrete constrained optimization problems with neural networks. Concretely, we leverage algorithmic techniques from the literature on convex geometry and Carathéodory's theorem to decompose neural network outputs into convex combinations of polytope corners that correspond to feasible sets. This decomposition-based approach enables self-supervised training but also ensures efficient quality-preserving rounding of the neural net output into feasible solutions. Extensive experiments in cardinality-constrained optimization show that our approach can consistently outperform neural baselines. We further provide worked-out examples of how our method can be applied beyond cardinality-constrained problems to a diverse set of combinatorial optimization tasks, including finding independent sets in graphs, and solving matroid-constrained problems.

Paper Structure

This paper contains 48 sections, 13 theorems, 77 equations, 2 figures, 14 tables, 3 algorithms.

Key Result

Theorem 4.1

There exists a polynomial-time algorithm that for any well-described polytope $\mathcal{P}$ given by a strong optimization oracle, for any rational vector $\mathbf{x}$, finds vertex-probability pairs $\{p_{\mathbf{x}_t}(S_t), \mathbf{1}_{S_t}\}$ for $t=0,1, \dots, n-1$ such that $\mathbf{x}= \sum_{t

Figures (2)

  • Figure 1: Overview of our framework. During training (top), the model learns to output a point in the convex hull of feasible solutions. A self-supervised extension loss is computed by decomposing this point and evaluating the objective, which is then used to backpropagate. During inference (bottom), the model outputs a relaxed solution, which is decomposed and the best feasible set is selected with a rounding guarantee.
  • Figure : General decomposition algorithm

Theorems & Definitions (29)

  • Theorem 4.1
  • Proposition 4.2
  • Theorem 4.3
  • Proposition 4.4
  • Definition B.1: Defining properties of polyhedra
  • Definition B.2: Strong optimization problem
  • Theorem B.3
  • proof
  • Theorem B.3
  • proof
  • ...and 19 more