Dynamic Content Caching with Waiting Costs via Restless Multi-Armed Bandits
Ankita Koley, Chandramani Singh
TL;DR
This work addresses dynamic content caching with AoV-based ageing and waiting costs by modeling each content as an arm in a Restless Multi-Armed Bandit (RMAB). It develops a Whittle-index based policy with explicit index expressions, and proves asymptotic optimality for large content sets with capacity scaling. A complete solution is provided for the infinite-cache case, while a finite-cache policy leverages indexability and per-content indices to govern eviction and fetching. Numerical results show the proposed policy closely matches the relaxed RMAB lower bound and significantly outperforms greedy and static push-pull baselines, with the waiting option reducing costs in practice.
Abstract
We consider a system with a local cache connected to a backend server and an end user population. A set of contents are stored at the the server where they continuously get updated. The local cache keeps copies, potentially stale, of a subset of the contents. The users make content requests to the local cache which either can serve the local version if available or can fetch a fresh version or can wait for additional requests before fetching and serving a fresh version. Serving a stale version of a content incurs an age-of-version(AoV) dependent ageing cost, fetching it from the server incurs a fetching cost, and making a request wait incurs a per unit time waiting cost. We focus on the optimal actions subject to the cache capacity constraint at each decision epoch, aiming at minimizing the long term average cost. We pose the problem as a Restless Multi-armed Bandit(RMAB) Problem and propose a Whittle index based policy which is known to be asymptotically optimal. We explicitly characterize the Whittle indices. We numerically evaluate the proposed policy and also compare it to a greedy policy. We show that it is close to the optimal policy and substantially outperforms the exising policies.
