GOAL: A Generalist Combinatorial Optimization Agent Learner

Darko Drakulic; Sofia Michel; Jean-Marc Andreoli

GOAL: A Generalist Combinatorial Optimization Agent Learner

Darko Drakulic, Sofia Michel, Jean-Marc Andreoli

TL;DR

GOAL tackles the fragmentation of combinatorial optimization by introducing a single backbone with lightweight task adapters capable of solving diverse COPs via mixed-attention and a multi-type transformer. It frames COP as tail-recursive constructive MDPs and trains on eight problems through imitation, while enabling efficient fine-tuning to new tasks in supervised or unsupervised modes. The results show competitive performance with specialized baselines on training tasks, strong zero-shot generalization to larger instances, and meaningful gains from ablation studies, including codebook sparsification for robust fine-tuning. This work points toward a foundation-model-like direction for COP, enabling rapid adaptation to new problems with reduced data and computation, and highlighting interesting avenues for scalable attention and larger-scale COPs.

Abstract

Machine Learning-based heuristics have recently shown impressive performance in solving a variety of hard combinatorial optimization problems (COPs). However, they generally rely on a separate neural model, specialized and trained for each single problem. Any variation of a problem requires adjustment of its model and re-training from scratch. In this paper, we propose GOAL (for Generalist combinatorial Optimization Agent Learner), a generalist model capable of efficiently solving multiple COPs and which can be fine-tuned to solve new COPs. GOAL consists of a single backbone plus light-weight problem-specific adapters for input and output processing. The backbone is based on a new form of mixed-attention blocks which allows to handle problems defined on graphs with arbitrary combinations of node, edge and instance-level features. Additionally, problems which involve heterogeneous types of nodes or edges are handled through a novel multi-type transformer architecture, where the attention blocks are duplicated to attend the meaningful combinations of types while relying on the same shared parameters. We train GOAL on a set of routing, scheduling and classic graph problems and show that it is only slightly inferior to the specialized baselines while being the first multi-task model that solves a wide range of COPs. Finally we showcase the strong transfer learning capacity of GOAL by fine-tuning it on several new problems. Our code is available at https://github.com/naver/goal-co/.

GOAL: A Generalist Combinatorial Optimization Agent Learner

TL;DR

Abstract

Paper Structure (55 sections, 4 equations, 19 figures, 2 tables)

This paper contains 55 sections, 4 equations, 19 figures, 2 tables.

Introduction
Related Works
End-to-end heuristic learning for CO
Generalization of neural heuristics
Multi-task training for sequential decision making
Multi-Task Learning for Combinatorial Optimization
Constructive CO problem solving
Multi-Task CO Problems
Task Representation
Architecture of the Model
Multi-Type Transformer Architecture
Training Procedure
Experiments
Training tasks.
Training details.
...and 40 more sections

Figures (19)

Figure 1: GOAL - a single neural model to solve a variety of routing, scheduling, packing and graph problems.
Figure 2: Left: Architecture of GOAL for single-type problems; the green components are task-specific, while the others belong to the backbone. The codebook is a shared $\ell{\times}D$ (resp. $\bar{\ell}{\times}\bar{D}$) matrix (with $\ell{\ll}D,\bar{\ell}{\ll}\bar{D}$) applied to the node (resp. edge) representations produced by the input adapters. Right: Architecture of GOAL for multi-type problems (with here 2 node types T1,T2); it uses multiple mixed attention blocks ("MMA"), some in self- and some in cross-attention mode, but they all share the same parameters (per layer); so do the two Feed Forward blocks ("FF"). For the edges, all the type combinations are shown (here T11,T12,T21,T22) but those which have no meaning in a task are simply omitted. The same set of parameters is used for the two architectures.
Figure 3: Unsupervised fine-tuning of GOAL to eight diverse new tasks. We report the best results over 10 runs both for training from scratch and fine-tuning.
Figure 4: (left) Performance with different variants of mixed-attention. (middle) Performance of single- vs multi-type GOAL. (right) Fine-tuning of GOAL with and without the codebook.
Figure 5: Comparison of the (supervised) fine-tuning of the single- versus multi-task GOAL on four new problems.
...and 14 more figures

GOAL: A Generalist Combinatorial Optimization Agent Learner

TL;DR

Abstract

GOAL: A Generalist Combinatorial Optimization Agent Learner

Authors

TL;DR

Abstract

Table of Contents

Figures (19)