Performing Load Balancing under Constraints
Andrea Fox, Francesco De Pellegrini, Eitan Altman, Arnob Ghosh, Ness Shroff
TL;DR
This work reframes load balancing under practical constraints as a constrained Markov decision process (CMDP), addressing both communication-rate limits and minimum-queue-activity requirements. It develops a near-optimal, Lyapunov-drift-based framework and introduces three lightweight safe policies—JSVED, JSED-$k$, and JSSQ—that guarantee constraint satisfaction while aiming to minimize system occupancy. The paper also extends these ideas to generic convex gain functions and validates performance through extensive simulations under light and heavy traffic, highlighting trade-offs between safety, memory demand, and scalability. The results offer scalable, provably safe strategies for constrained load balancing with potential applications in IoT, edge computing, and content delivery networks.
Abstract
Join-the-shortest queue (JSQ) and its variants have often been used in solving load balancing problems. The aim of such policies is to minimize the average system occupation, e.g., the customer's system time. In this work we extend the traditional load balancing setting to include constraints that may be imposed, e.g., due to the communication network. We cast the problem into the framework of constrained MDPs, enabling the consideration of both action-dependent constraints, such as, e.g, bandwidth limitation, and state-dependent constraints, such as, e.g., minimum queue utilization. Unlike the state-of-the-art approaches, our load-balancing policies, in particular JSED-$k$ and JSSQ, are both provably safe and yet strive to minimize the system occupancy. Their performance is tested with extensive numerical results under various system settings.
