Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

Baris Ata; Ebru Kasikaralar

Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

Baris Ata, Ebru Kasikaralar

TL;DR

This paper develops a diffusion-approximation framework for dynamic, multiclass queueing with nonstationary data in the Halfin-Whitt regime and solves the resulting high-dimensional Brownian control problem using a novel deep splitting neural-network method. By translating the HJB PDE into a sequence of tractable subproblems solved backward in time with neural approximations of the value function and its gradient, the authors produce a data-driven scheduling policy that performs competitively with or better than state-of-the-art benchmarks, even in problems with up to 500 classes. The methodology is validated on13 data-driven test problems derived from US Bank call-center data, showcasing the approach’s scalability, computational feasibility, and practical impact for large-scale call centers. The work contributes a computationally feasible pathway to diffusion-controlled policies in high dimensions, bridging stochastic control theory, deep learning, and real-world operations management.

Abstract

We consider a multi-class queueing model of a telephone call center, in which a system manager dynamically allocates available servers to customer calls. Calls can terminate through either service completion or customer abandonment, and the manager strives to minimize the expected total of holding costs plus abandonment costs over a finite horizon. Focusing on the Halfin-Whitt heavy traffic regime, we derive an approximating diffusion control problem, and building on earlier work by Beck et al. (2021), develop a simulation-based computational method for solution of such problems, one that relies heavily on deep neural network technology. Using this computational method, we propose a policy for the original (pre-limit) call center scheduling problem. Finally, the performance of this policy is assessed using test problems based on publicly available call center data. For the test problems considered so far, our policy does as well as or better than the best benchmark we could find. Moreover, our method is computationally feasible at least up to dimension 500, that is, for call centers with 500 or more distinct customer classes.

Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

TL;DR

Abstract

Paper Structure (71 sections, 121 equations, 103 figures, 40 tables, 3 algorithms)

This paper contains 71 sections, 121 equations, 103 figures, 40 tables, 3 algorithms.

Introduction
Model
A Brownian control problem in the Halfin-Whitt regime
HJB equation
Computational method
Data, test problems, and benchmark policies
Data
Main test problem and its variants
Low dimensional test problems
High dimensional test problems that have pathwise optimal policies
High dimensional test problems that do not admit a pathwise optimal policy
Benchmark policies
Benchmark policies for the main test problem and its variants
Computational results
Results for the low dimensional test problems
...and 56 more sections

Figures (103)

Figure 1: Hourly arrival rate of callers on weekdays during May - July 2003. The resolution of the horizontal axis is five minutes. That is, arrival rates are calculated over five-minute intervals. The solid line shows the hourly average arrival rate across all days. The two dashed lines that enclose it are computed by taking the average plus/minus 2 times the standard deviation of the rates for each five-minute interval across all days.
Figure 2: A non-stationary system model.
Figure 3: The number of agents (after the aforementioned service capacity adjustment) on weekdays during May-July 2003. The resolution of the horizontal axis is five minutes. The solid line depicts the average staffing level observed throughout all days. The two dashed lines that enclose it are derived by calculating the average staffing level plus or minus twice the standard deviation for each five-minute interval across all days.
Figure 4: The average traffic intensity observed throughout the day. The resolution of the horizontal axis is five minutes. That is, traffic intensity is calculated over five-minute intervals.
Figure 5: Hourly accumulation of the system cost.
...and 98 more figures

Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

TL;DR

Abstract

Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

Authors

TL;DR

Abstract

Table of Contents

Figures (103)