Table of Contents
Fetching ...

Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

Baris Ata, Ebru Kasikaralar

TL;DR

This paper develops a diffusion-approximation framework for dynamic, multiclass queueing with nonstationary data in the Halfin-Whitt regime and solves the resulting high-dimensional Brownian control problem using a novel deep splitting neural-network method. By translating the HJB PDE into a sequence of tractable subproblems solved backward in time with neural approximations of the value function and its gradient, the authors produce a data-driven scheduling policy that performs competitively with or better than state-of-the-art benchmarks, even in problems with up to 500 classes. The methodology is validated on13 data-driven test problems derived from US Bank call-center data, showcasing the approach’s scalability, computational feasibility, and practical impact for large-scale call centers. The work contributes a computationally feasible pathway to diffusion-controlled policies in high dimensions, bridging stochastic control theory, deep learning, and real-world operations management.

Abstract

We consider a multi-class queueing model of a telephone call center, in which a system manager dynamically allocates available servers to customer calls. Calls can terminate through either service completion or customer abandonment, and the manager strives to minimize the expected total of holding costs plus abandonment costs over a finite horizon. Focusing on the Halfin-Whitt heavy traffic regime, we derive an approximating diffusion control problem, and building on earlier work by Beck et al. (2021), develop a simulation-based computational method for solution of such problems, one that relies heavily on deep neural network technology. Using this computational method, we propose a policy for the original (pre-limit) call center scheduling problem. Finally, the performance of this policy is assessed using test problems based on publicly available call center data. For the test problems considered so far, our policy does as well as or better than the best benchmark we could find. Moreover, our method is computationally feasible at least up to dimension 500, that is, for call centers with 500 or more distinct customer classes.

Dynamic Scheduling of a Multiclass Queue in the Halfin-Whitt Regime: A Computational Approach for High-Dimensional Problems

TL;DR

This paper develops a diffusion-approximation framework for dynamic, multiclass queueing with nonstationary data in the Halfin-Whitt regime and solves the resulting high-dimensional Brownian control problem using a novel deep splitting neural-network method. By translating the HJB PDE into a sequence of tractable subproblems solved backward in time with neural approximations of the value function and its gradient, the authors produce a data-driven scheduling policy that performs competitively with or better than state-of-the-art benchmarks, even in problems with up to 500 classes. The methodology is validated on13 data-driven test problems derived from US Bank call-center data, showcasing the approach’s scalability, computational feasibility, and practical impact for large-scale call centers. The work contributes a computationally feasible pathway to diffusion-controlled policies in high dimensions, bridging stochastic control theory, deep learning, and real-world operations management.

Abstract

We consider a multi-class queueing model of a telephone call center, in which a system manager dynamically allocates available servers to customer calls. Calls can terminate through either service completion or customer abandonment, and the manager strives to minimize the expected total of holding costs plus abandonment costs over a finite horizon. Focusing on the Halfin-Whitt heavy traffic regime, we derive an approximating diffusion control problem, and building on earlier work by Beck et al. (2021), develop a simulation-based computational method for solution of such problems, one that relies heavily on deep neural network technology. Using this computational method, we propose a policy for the original (pre-limit) call center scheduling problem. Finally, the performance of this policy is assessed using test problems based on publicly available call center data. For the test problems considered so far, our policy does as well as or better than the best benchmark we could find. Moreover, our method is computationally feasible at least up to dimension 500, that is, for call centers with 500 or more distinct customer classes.
Paper Structure (71 sections, 121 equations, 103 figures, 40 tables, 3 algorithms)

This paper contains 71 sections, 121 equations, 103 figures, 40 tables, 3 algorithms.

Figures (103)

  • Figure 1: Hourly arrival rate of callers on weekdays during May - July 2003. The resolution of the horizontal axis is five minutes. That is, arrival rates are calculated over five-minute intervals. The solid line shows the hourly average arrival rate across all days. The two dashed lines that enclose it are computed by taking the average plus/minus 2 times the standard deviation of the rates for each five-minute interval across all days.
  • Figure 2: A non-stationary system model.
  • Figure 3: The number of agents (after the aforementioned service capacity adjustment) on weekdays during May-July 2003. The resolution of the horizontal axis is five minutes. The solid line depicts the average staffing level observed throughout all days. The two dashed lines that enclose it are derived by calculating the average staffing level plus or minus twice the standard deviation for each five-minute interval across all days.
  • Figure 4: The average traffic intensity observed throughout the day. The resolution of the horizontal axis is five minutes. That is, traffic intensity is calculated over five-minute intervals.
  • Figure 5: Hourly accumulation of the system cost.
  • ...and 98 more figures