Optimizing Job Allocation using Reinforcement Learning with Graph Neural Networks
Lars C. P. M. Quaedvlieg
TL;DR
The paper addresses the Job Allocation Problem (JAP), which seeks to maximize the number of feasible job-to-resource assignments |A| under selection and conflict constraints on a graph G(P ∪ J, S ∪ C). It formulates JAP as a Markov Decision Process and applies a Graph Neural Network with a Context-Aware Embedding (CAE) module to approximate Q-values for edge selections, enabling policy learning without labeled data. Training employs Double Deep Q-Learning with Prioritized Experience Replay to improve stability and sample efficiency. Empirical results on real-world data (Planny) and synthetic graphs (Erdős–Rényi, Barabási–Albert) show the GNN-based RL method outperforms baselines and generalizes to out-of-distribution instances, highlighting the practical potential of RL+GNN for complex scheduling problems. The approach demonstrates how graph-structured representations and edge-level decision modeling can yield scalable, adaptable solutions for resource allocation tasks.
Abstract
Efficient job allocation in complex scheduling problems poses significant challenges in real-world applications. In this report, we propose a novel approach that leverages the power of Reinforcement Learning (RL) and Graph Neural Networks (GNNs) to tackle the Job Allocation Problem (JAP). The JAP involves allocating a maximum set of jobs to available resources while considering several constraints. Our approach enables learning of adaptive policies through trial-and-error interactions with the environment while exploiting the graph-structured data of the problem. By leveraging RL, we eliminate the need for manual annotation, a major bottleneck in supervised learning approaches. Experimental evaluations on synthetic and real-world data demonstrate the effectiveness and generalizability of our proposed approach, outperforming baseline algorithms and showcasing its potential for optimizing job allocation in complex scheduling problems.
