Exponentially Weighted Algorithm for Online Network Resource Allocation with Long-Term Constraints
Ahmed Sid-Ali, Ioannis Lambadaris, Yiqiang Q. Zhao, Gennady Shaikhet, Amirhossein Asgharnia
TL;DR
This work tackles online resource reservation across a network of servers with possible inter-server transfers under a long-term constraint on combined violation and transfer costs. It introduces a randomized exponentially weighted algorithm that operates on the discrete reservation space, incorporating a history-based penalty to enforce time-averaged constraints and achieving sublinear regret with vanishing constraint violations. The approach is theoretically analyzed to yield explicit regret and constraint-violation bounds and is empirically compared against a tailored reinforcement learning baseline, where the proposed method generally outperforms RL in dynamic-demand scenarios. The results demonstrate the practical viability of history-aware, constraint-aware online optimization for network resource management, with potential for broader application in similar discrete-action, long-horizon constrained problems.
Abstract
This paper studies an online optimal resource reservation problem in communication networks with job transfers where the goal is to minimize the reservation cost while maintaining the blocking cost under a certain budget limit. To tackle this problem, we propose a novel algorithm based on a randomized exponentially weighted method that encompasses long-term constraints. We then analyze the performance of our algorithm by establishing an upper bound for the associated regret and the cumulative constraint violations. Finally, we present numerical experiments where we compare the performance of our algorithm with those of reinforcement learning where we show that our algorithm surpasses it.
