Table of Contents
Fetching ...

Learning to Solve Resource-Constrained Project Scheduling Problems with Duration Uncertainty using Graph Neural Networks

Guillaume Infantes, Stéphanie Roussel, Antoine Jacquet, Emmanuel Benazera

TL;DR

The paper tackles solving the Resource-Constrained Project Scheduling Problem (RCPSP) under duration uncertainty by proposing a proactive offline approach that yields reusable baselines. It models uncertainty as an MDP over AON-flow graphs and learns a scheduling policy with a Graph Neural Network (GNN) driven by Deep Reinforcement Learning, specifically using PPO, within a framework called Wheatley. Key contributions include (i) a formal MDP formulation for RCPSP with uncertainty, (ii) a GNN-based agent with graph rewiring, resource-aware representations, and a virtual pooling node, and (iii) extensive experiments on PSPLib benchmarks showing strong performance and generalization, with Wheatley made publicly available. The work demonstrates that a learned, uncertainty-aware priority rule can outperform traditional Priority Dispatch Rules and scale to unseen datasets, offering a robust, reusable baseline for industrial scheduling under variability.

Abstract

The Resource-Constrained Project Scheduling Problem (RCPSP) is a classical scheduling problem that has received significant attention due to of its numerous applications in industry. However, in practice, task durations are subject to uncertainty that must be considered in order to propose resilient scheduling. In this paper, we address the RCPSP variant with uncertain tasks duration (modeled using known probabilities) and aim to minimize the overall expected project duration. Our objective is to produce a baseline schedule that can be reused multiple times in an industrial setting regardless of the actual duration scenario. We leverage Graph Neural Networks in conjunction with Deep Reinforcement Learning (DRL) to develop an effective policy for task scheduling. This policy operates similarly to a priority dispatch rule and is paired with a Serial Schedule Generation Scheme to produce a schedule. Our empirical evaluation on standard benchmarks demonstrates the approach's superiority in terms of performance and its ability to generalize. The developed framework, Wheatley, is made publicly available online to facilitate further research and reproducibility.

Learning to Solve Resource-Constrained Project Scheduling Problems with Duration Uncertainty using Graph Neural Networks

TL;DR

The paper tackles solving the Resource-Constrained Project Scheduling Problem (RCPSP) under duration uncertainty by proposing a proactive offline approach that yields reusable baselines. It models uncertainty as an MDP over AON-flow graphs and learns a scheduling policy with a Graph Neural Network (GNN) driven by Deep Reinforcement Learning, specifically using PPO, within a framework called Wheatley. Key contributions include (i) a formal MDP formulation for RCPSP with uncertainty, (ii) a GNN-based agent with graph rewiring, resource-aware representations, and a virtual pooling node, and (iii) extensive experiments on PSPLib benchmarks showing strong performance and generalization, with Wheatley made publicly available. The work demonstrates that a learned, uncertainty-aware priority rule can outperform traditional Priority Dispatch Rules and scale to unseen datasets, offering a robust, reusable baseline for industrial scheduling under variability.

Abstract

The Resource-Constrained Project Scheduling Problem (RCPSP) is a classical scheduling problem that has received significant attention due to of its numerous applications in industry. However, in practice, task durations are subject to uncertainty that must be considered in order to propose resilient scheduling. In this paper, we address the RCPSP variant with uncertain tasks duration (modeled using known probabilities) and aim to minimize the overall expected project duration. Our objective is to produce a baseline schedule that can be reused multiple times in an industrial setting regardless of the actual duration scenario. We leverage Graph Neural Networks in conjunction with Deep Reinforcement Learning (DRL) to develop an effective policy for task scheduling. This policy operates similarly to a priority dispatch rule and is paired with a Serial Schedule Generation Scheme to produce a schedule. Our empirical evaluation on standard benchmarks demonstrates the approach's superiority in terms of performance and its ability to generalize. The developed framework, Wheatley, is made publicly available online to facilitate further research and reproducibility.

Paper Structure

This paper contains 33 sections, 4 figures, 2 tables, 1 algorithm.

Figures (4)

  • Figure 1: Example tasks features (precedences are black arrows of the graph, duration and resource consumption in the table) and schedule $\sigma$ in the table. Flow arcs mentioned in Example \ref{['ex:flows']} are depicted in dotted red.
  • Figure 2: Scheduled tasks (light green), flows (dotted red) and open flows (dashed blue) for states $s_i$ and $s_{i+1}$ described in Example \ref{['ex:transition']}
  • Figure 3: General Architecture - PPO's inputs and outputs are respectively surrounded by green and blue dotted ellipses.
  • Figure 4: Rewiring of the graph presented in Figure \ref{['fig:AON-flow_insertion']}. Nodes $r$ and $v$ represent the resource node for $r$ and the pooling node. Back arcs of flows are in dotted light red, arcs to and from resource $r$ are in dashdotted green, and arcs to the pooling node are in dashdotted cyan for activity nodes, dotted purple for resources nodes.

Theorems & Definitions (4)

  • Example 1
  • Example 2
  • Example 3
  • Example 4