Strongly Tail-Optimal Scheduling in the Light-Tailed M/G/1
George Yu, Ziv Scully
TL;DR
This paper addresses minimizing the tail of the response time in an M/G/1 queue with light-tailed job sizes by introducing Boost, a family of scheduling policies that adjust the effective arrival times via a boost function. The authors derive a closed-form expression for the optimal tail constant C and prove that the γ-Boost policy is strongly tail-optimal in the full-information setting, with extensions to partial information and promising practical performance. A key insight is the reduction to a batch-optimization problem using WDSPT/WDSEPT concepts and a cheating M/G/1 as a lower-bound tool, enabling rigorous tail-analysis via a tagged-job framework. Simulations corroborate substantial tail improvements over FCFS, Nudge, and SRPT across distributions and information regimes, suggesting both theoretical significance and practical applicability for high-reliability queueing systems.
Abstract
We study the problem of scheduling jobs in a queueing system, specifically an M/G/1 with light-tailed job sizes, to asymptotically optimize the response time tail. This means scheduling to make $\mathbf{P}[T > t]$, the chance a job's response time exceeds $t$, decay as quickly as possible in the $t \to \infty$ limit. For some time, the best known policy was First-Come First-Served (FCFS), which has an asymptotically exponential tail: $\mathbf{P}[T > t] \sim C e^{-γt}$. FCFS achieves the optimal *decay rate* $γ$, but its *tail constant* $C$ is suboptimal. Only recently have policies that improve upon FCFS's tail constant been discovered. But it is unknown what the optimal tail constant is, let alone what policy might achieve it. In this paper, we derive a closed-form expression for the optimal tail constant $C$, and we introduce *$γ$-Boost*, a new policy that achieves this optimal tail constant. Roughly speaking, $γ$-Boost operates similarly to FCFS, but it pretends that small jobs arrive earlier than their true arrival times. This significantly reduces the response time of small jobs without unduly delaying large jobs, improving upon FCFS's tail constant by up to 50% with only moderate job size variability, with even larger improvements for higher variability. While these results are for systems with full job size information, we also introduce and analyze a version of $γ$-Boost that works in settings with partial job size information, showing it too achieves significant gains over FCFS. Finally, we show via simulation that $γ$-Boost has excellent practical performance.
