Tail Optimality and Performance Analysis of the Nudge-M Scheduling Algorithm
Nils Charlet, Benny Van Houdt
TL;DR
This work introduces Nudge-M, a scheduling policy for two-type job systems where decisions rely only on arrival types and their order, to achieve strong tail performance improvements over FCFS. The authors establish that Nudge-M is asymptotically tail-optimal within a large family of Nudge-like policies by deriving explicit ATIR expressions and proving a unique optimal parameter M_{opt} that matches the Nudge-K optimum from prior work. In heavy traffic, ATIR(M_{opt}) converges to a Gamma-Boost–like limit, and the optimal M is shown to depend only on coarse job-size statistics, not on full distributions. The paper also provides a practical numerical framework using phase-type distributions and Markov-modulated fluid queues to compute mean and distributional performance for both type-1 and type-2 jobs, highlighting significant mean and tail gains with Nudge-M. The results offer a rigorous, broadly applicable approach to tail optimization under limited information, with potential extensions to more types and general arrival processes.
Abstract
Recently it was shown that the response time of First-Come-First-Served (FCFS) scheduling can be stochastically and asymptotically improved upon by the {\it Nudge} scheduling algorithm in case of light-tailed job size distributions. Such improvements are feasible even when the jobs are partitioned into two types and the scheduler only has information about the type of incoming jobs (but not their size). In this paper we introduce Nudge-$M$ scheduling, where basically any incoming type-1 job is allowed to pass any type-2 job that is still waiting in the queue given that it arrived as one of the last $M$ jobs. We prove that Nudge-$M$ has an asymptotically optimal response time within a large family of Nudge scheduling algorithms when job sizes are light-tailed. Simple explicit results for the asymptotic tail improvement ratio (ATIR) of Nudge-$M$ over FCFS are derived as well as explicit results for the optimal parameter $M$. An expression for the ATIR that only depends on the type-1 and type-2 mean job sizes and the fraction of type-1 jobs is presented in the heavy traffic setting. The paper further presents a numerical method to compute the response time distribution and mean response time of Nudge-$M$ scheduling provided that the job size distribution of both job types follows a phase-type distribution (by making use of the framework of Markov modulated fluid queues with jumps).
