Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

Robbert Reijnen; Yaoxin Wu; Zaharah Bukhsh; Yingqian Zhang

Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

Robbert Reijnen, Yaoxin Wu, Zaharah Bukhsh, Yingqian Zhang

TL;DR

This work introduces GS-MODAC, a Graph Neural Network-augmented DRL framework for dynamic algorithm configuration in multi-objective combinatorial optimization (MOCO). By representing the search state as a graph of normalized objective values and learning state embeddings with GNNs, GS-MODAC uses a PPO-based policy to adapt evolutionary algorithm parameters during search, aiming to improve convergence and Pareto diversity. The approach is instantiated on NSGA-II and MOPSO and evaluated on Flexible Job Shop Scheduling (FJSP) and Capacitated Vehicle Routing Problem (CVRP) across 2–5 objectives, demonstrating superior hypervolume performance and strong generalization to unseen sizes and more complex problem variants, with instance-agnostic rewards guiding learning. The results suggest GS-MODAC offers robust, scalable dynamic configuration for MOCO and generalizes across problem types, sizes, and objective configurations, potentially extending to a broad class of MOEAs.

Abstract

Deep reinforcement learning (DRL) has been widely used for dynamic algorithm configuration, particularly in evolutionary computation, which benefits from the adaptive update of parameters during the algorithmic execution. However, applying DRL to algorithm configuration for multi-objective combinatorial optimization (MOCO) problems remains relatively unexplored. This paper presents a novel graph neural network (GNN) based DRL to configure multi-objective evolutionary algorithms. We model the dynamic algorithm configuration as a Markov decision process, representing the convergence of solutions in the objective space by a graph, with their embeddings learned by a GNN to enhance the state representation. Experiments on diverse MOCO challenges indicate that our method outperforms traditional and DRL-based algorithm configuration methods in terms of efficacy and adaptability. It also exhibits advantageous generalizability across objective types and problem sizes, and applicability to different evolutionary computation methods.

Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

TL;DR

Abstract

Graph-Supported Dynamic Algorithm Configuration for Multi-Objective Combinatorial Optimization

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)