Service Rate Control in Queues with Abandonments
Runhua Wu, Hayriye Ayhan
TL;DR
This paper develops a continuous-time Markov decision process model for a single-server queue with customer abandonments and two reward-timing variants (reward at arrival or at service completion). It establishes that, for infinite buffer capacity, the optimal service-rate policy is monotone in the number of customers and can be found by minimizing over the lower boundary of the convex hull of the action space, enabling an efficient policy-iteration algorithm; the policy converges as the state grows, allowing state-space truncation for computation. For finite capacity, monotonicity can fail, with optimal service rates rising then falling in the state, though the long-run average problem exhibits analogous structural properties. The paper also shows equivalence between reward-timing schemes under appropriate cost transformations and provides computational methods and results validating the theory on truncated-state models. These results offer practical guidance for rate-control in queues with abandonments, with implications for operations in settings like call centers and healthcare.
Abstract
We consider a Markovian single server queue with impatient customers. There is a customer abandonment cost and a holding cost for customers in the system. We consider two versions of the problem. In the first version, customers pay a reward at the time of arrival whereas in the second version, reward is received at the time of service completion. Service rate attains values in a compact set and there is a cost associated with each service rate. Under these assumptions, our objective is to characterize the service rate policy that maximizes the infinite-horizon discounted reward and the long-run average reward. We show that for systems with an infinite buffer, the optimal service rate policy is monotone. However, the optimal policy is not necessarily monotone when capacity is finite. Furthermore, we prove that the set of possible optimal actions can be reduced to the lower boundary of the convex hull of the action space and develop an efficient policy iteration algorithm. Finally, we show that the optimal service rate converges as the state goes to infinity which allows us to truncate the state space to numerically compute the optimal service rate when system has infinite buffer space.
