Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems

Yishun Luo; Martin Zubeldia

Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems

Yishun Luo, Martin Zubeldia

TL;DR

The paper tackles scalable load balancing in heterogeneous parallel queues with ultra-low communication overhead by introducing the $k$-Skip-the-$d$-Longest-Queues policy. It establishes a stability region that is independent of the sampling parameter $k$ and shows that, in classical heavy-traffic, the policy is delay-optimal for any fixed $k$ under a suitable $d$, with the overhead vanishing as $k$ grows. State Space Collapse is used to connect the multi-queue dynamics to a one-dimensional projection, enabling a sharp delay analysis and showing a pathway to vanishing communication while preserving performance. In the many-server regime, it derives necessary conditions on $d$ for throughput and provides tight delay-optimality criteria, including phase-transition insights in homogeneous systems, thereby offering a practical load-balancing scheme with tunable communication cost.

Abstract

We consider a discrete-time parallel service system consisting of $n$ heterogeneous single server queues with infinite capacity. Jobs arrive to the system as an i.i.d. process with rate proportional to $n$, and must be immediately dispatched in the time slot that they arrive. The dispatcher is assumed to be able to exchange messages with the servers to obtain their queue lengths and make dispatching decisions, introducing an undesirable communication overhead. In this setting, we propose a ultra-low communication overhead load balancing policy dubbed $k$-Skip-the-$d$-Longest-Queues ($k$-SLQ-$d$), where queue lengths are only observed every $k(n-d)$ time slots and, between observations, incoming jobs are sent to a queue that is not one of the $d$ longest ones at the time that the queues were last observed. For this policy, we establish conditions on $d$ for it to be throughput optimal and we show that, under that condition, it is asymptotically delay-optimal in heavy-traffic for arbitrarily low communication overheads (i.e., for arbitrarily large $k$).

Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems

TL;DR

The paper tackles scalable load balancing in heterogeneous parallel queues with ultra-low communication overhead by introducing the

-Skip-the-

-Longest-Queues policy. It establishes a stability region that is independent of the sampling parameter

and shows that, in classical heavy-traffic, the policy is delay-optimal for any fixed

under a suitable

, with the overhead vanishing as

grows. State Space Collapse is used to connect the multi-queue dynamics to a one-dimensional projection, enabling a sharp delay analysis and showing a pathway to vanishing communication while preserving performance. In the many-server regime, it derives necessary conditions on

for throughput and provides tight delay-optimality criteria, including phase-transition insights in homogeneous systems, thereby offering a practical load-balancing scheme with tunable communication cost.

Abstract

We consider a discrete-time parallel service system consisting of

heterogeneous single server queues with infinite capacity. Jobs arrive to the system as an i.i.d. process with rate proportional to

, and must be immediately dispatched in the time slot that they arrive. The dispatcher is assumed to be able to exchange messages with the servers to obtain their queue lengths and make dispatching decisions, introducing an undesirable communication overhead. In this setting, we propose a ultra-low communication overhead load balancing policy dubbed

-Skip-the-

-Longest-Queues (

-SLQ-

), where queue lengths are only observed every

time slots and, between observations, incoming jobs are sent to a queue that is not one of the

longest ones at the time that the queues were last observed. For this policy, we establish conditions on

for it to be throughput optimal and we show that, under that condition, it is asymptotically delay-optimal in heavy-traffic for arbitrarily low communication overheads (i.e., for arbitrarily large

Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems

TL;DR

Abstract

Heavy-traffic Optimality of Skip-the-Longest-Queues in Heterogeneous Service Systems

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (8)