Table of Contents
Fetching ...

Power-of-$d$ Choices Load Balancing in the Sub-Halfin Whitt Regime

Sushil Mahavir Varma, Francisco Castro, Siva Theja Maguluri

TL;DR

This work analyzes Power-of-$d$ choices routing in a many-server load-balancing system under the sub-Halfin-Whitt regime with arrival rate $\lambda = n - n^{1-\gamma}$, $\gamma\in(0,0.5)$. It develops an iterative state-space collapse framework driven by Lyapunov drift to obtain sharp, high-probability bounds on queue lengths across a broad range of $d$-scalings, revealing phase transitions: zero-delay for $d \ge n^{\gamma}\log n$, finite-delay with queue length $m$ when $d = \Theta((n^{\gamma}\log n)^{1/m})$, and infinite delay for polylogarithmic $d$. By linking the stochastic steady-state to the fixed point of a mean-field ODE, the paper characterizes the dominant term $s_i/n \approx (\lambda/n)^{(d^i-1)/(d-1)}$ and provides matching upper and lower bounds up to lower-order corrections. Simulations corroborate the fixed-point predictions and illustrate the phase transitions. The approach sidesteps Stein’s method, using iterative SSC with Lyapunov drift to achieve a comprehensive, rigorous understanding of Power-of-$d$ dynamics in this regime, with implications for selecting $d$ in large-scale load-balancing systems.

Abstract

We consider the load balancing system under Poisson arrivals, exponential services, and homogeneous servers. Upon arrival, a job is to be routed to one of the servers, where it is queued until service. We consider the Power-of-$d$ choices routing algorithm, which chooses the queue with minimum length among $d$ randomly sampled queues. We study this system in the many-server heavy-traffic regime where the number of servers goes to infinity simultaneously when the load approaches the capacity. In particular, we consider a sequence of systems with $n$ servers and the arrival rate is given by $λ=n-n^{1-γ}$ for some $γ\in (0, 0.5)$, known as the sub-Halfin-Whitt regime. It was shown by [Liu Ying (2020)] that under Power-of-$d$ choices routing with $d \geq n^γ\log n$, the queue length behaves similarly to that of JSQ and that there are asymptotically zero queueing delays. The focus of this paper is to characterize the behavior when $d$ is below this threshold. We obtain high probability bounds on the queue lengths for various values of $d$ and large enough $n$. In particular, we show that when $d$ grows polynomially in $n$ but slower than in [Liu Ying (2020)], i.e., if $d$ is $Θ\left((n^γ\log n)^{1/m})\right)$ for some integer $m>1$, then the asymptotic queue length is $m$ with high probability. Moreover, if $d$ grows polylog in $n$, i.e., slower than any polynomial, but is at least $Ω(\log (n)^3)$, the queue length blows up to infinity asymptotically. We obtain these results by using an iterative state space collapse approach. We first establish a weak state-space collapse (SSC) on the queue lengths. Then, we bootstrap on weak SSC to iteratively narrow down the region of the collapse. After enough steps, this inductive refinement provides the bounds we seek. We establish these sequences of collapse using Lyapunov drift arguments.

Power-of-$d$ Choices Load Balancing in the Sub-Halfin Whitt Regime

TL;DR

This work analyzes Power-of- choices routing in a many-server load-balancing system under the sub-Halfin-Whitt regime with arrival rate , . It develops an iterative state-space collapse framework driven by Lyapunov drift to obtain sharp, high-probability bounds on queue lengths across a broad range of -scalings, revealing phase transitions: zero-delay for , finite-delay with queue length when , and infinite delay for polylogarithmic . By linking the stochastic steady-state to the fixed point of a mean-field ODE, the paper characterizes the dominant term and provides matching upper and lower bounds up to lower-order corrections. Simulations corroborate the fixed-point predictions and illustrate the phase transitions. The approach sidesteps Stein’s method, using iterative SSC with Lyapunov drift to achieve a comprehensive, rigorous understanding of Power-of- dynamics in this regime, with implications for selecting in large-scale load-balancing systems.

Abstract

We consider the load balancing system under Poisson arrivals, exponential services, and homogeneous servers. Upon arrival, a job is to be routed to one of the servers, where it is queued until service. We consider the Power-of- choices routing algorithm, which chooses the queue with minimum length among randomly sampled queues. We study this system in the many-server heavy-traffic regime where the number of servers goes to infinity simultaneously when the load approaches the capacity. In particular, we consider a sequence of systems with servers and the arrival rate is given by for some , known as the sub-Halfin-Whitt regime. It was shown by [Liu Ying (2020)] that under Power-of- choices routing with , the queue length behaves similarly to that of JSQ and that there are asymptotically zero queueing delays. The focus of this paper is to characterize the behavior when is below this threshold. We obtain high probability bounds on the queue lengths for various values of and large enough . In particular, we show that when grows polynomially in but slower than in [Liu Ying (2020)], i.e., if is for some integer , then the asymptotic queue length is with high probability. Moreover, if grows polylog in , i.e., slower than any polynomial, but is at least , the queue length blows up to infinity asymptotically. We obtain these results by using an iterative state space collapse approach. We first establish a weak state-space collapse (SSC) on the queue lengths. Then, we bootstrap on weak SSC to iteratively narrow down the region of the collapse. After enough steps, this inductive refinement provides the bounds we seek. We establish these sequences of collapse using Lyapunov drift arguments.
Paper Structure (36 sections, 18 theorems, 174 equations, 17 figures, 3 tables)

This paper contains 36 sections, 18 theorems, 174 equations, 17 figures, 3 tables.

Key Result

Theorem 1

Let $\left\{m_n \in \mathbb{Z}_+ : n \in \mathbb{Z}_+\right\}$ be a sequence such that either $m_n \equiv m \in \mathbb{Z}_+$ or $m_n \rightarrow \infty$. Consider a load balancing model operating under Power-of-$\lfloor d \rfloor$ routing algorithm with $d = (2m_n n^\gamma)^{1 /m_n} \log (d)^{1/m_n

Figures (17)

  • Figure 1: Performance of JSQ $(d=n)$ under many-server-heavy-traffic regimes $(\gamma \in [0, \infty])$, where $\epsilon = n^{-\gamma}$ (left) and performance of Power-of-$d$ for different choices of $d$ under the sub-Halfin-Whitt regime, i.e. $\gamma \in (0, 0.5)$ (right).
  • Figure 2: A homogeneous load balancing model with $n$-servers.
  • Figure 3: Illustration of the fixed point of the ODE approximation in terms of queue occupancy.
  • Figure 4: Approximate ODE trajectory with the initial condition equal to $s_1=0$.
  • Figure 5: Graphical representation of iterative SSC for lower bound for $m=2$: The red hatched region is shown to have low steady-state probability leading to the collapse into the solid blue region. In addition, the arrows represent the drift - $(ds_1/dt, ds_2/dt)$.
  • ...and 12 more figures

Theorems & Definitions (22)

  • Theorem 1
  • Theorem 2
  • Theorem 3
  • Lemma 1: Base Case
  • Lemma 2: Induction Part $a$
  • Lemma 3: Induction Part $b$
  • Lemma 4: Induction Part $c$
  • Lemma 5: Induction Part $d$
  • Lemma 6: Step 1a
  • Lemma 7: Step 1b
  • ...and 12 more