Learning to Solve Job Shop Scheduling under Uncertainty
Guillaume Infantes, Stéphanie Roussel, Pierre Pereira, Antoine Jacquet, Emmanuel Benazera
TL;DR
This work addresses the Job-Shop Scheduling Problem under duration uncertainty with the objective of minimizing the average makespan $\mathbb{E}[C_{max}]$. It introduces Wheatley, a Graph Neural Network–driven DRL framework that solves the uncertainty-aware JSSP by modeling it as an MDP, using PPO for policy optimization and a graph rewiring strategy to enable rich information flow. Key contributions include empirical improvements in DRL/JSSP generalization and robustness to duration uncertainty, a scalable GNN architecture with uncertainty handling, and strong performance on Taillard benchmarks, especially in stochastic settings. The method yields a flexible, open-source solution capable of robustly scheduling under uncertainty and generalizing to larger problem sizes, with practical impact for industrial scheduling under real-world variability.
Abstract
Job-Shop Scheduling Problem (JSSP) is a combinatorial optimization problem where tasks need to be scheduled on machines in order to minimize criteria such as makespan or delay. To address more realistic scenarios, we associate a probability distribution with the duration of each task. Our objective is to generate a robust schedule, i.e. that minimizes the average makespan. This paper introduces a new approach that leverages Deep Reinforcement Learning (DRL) techniques to search for robust solutions, emphasizing JSSPs with uncertain durations. Key contributions of this research include: (1) advancements in DRL applications to JSSPs, enhancing generalization and scalability, (2) a novel method for addressing JSSPs with uncertain durations. The Wheatley approach, which integrates Graph Neural Networks (GNNs) and DRL, is made publicly available for further research and applications.
