On the Regret of Coded Caching with Adversarial Requests
Anupam Nayak, Kota Srinivas Reddy, Nikhil Karamchandani
TL;DR
This work investigates adversarial regret in coded caching with a broadcast delivery channel within an online-learning framework. It proposes a Follow-The-Perturbed-Leader policy that updates cache placements in a restricted set of time slots and analyzes a non-linear rate expression by a careful transformation of the request vector, achieving sublinear regret $O(\sqrt{T})$ relative to a static oracle. The paper also derives upper bounds on switching costs under unrestricted and restricted switching and validates the theoretical findings through numerical experiments on real data. The results establish the first regret guarantees for coded caching under adversarial requests and provide practical guidance on balancing online learning performance with cache-update costs.
Abstract
We study the well-known coded caching problem in an online learning framework, wherein requests arrive sequentially, and an online policy can update the cache contents based on the history of requests seen thus far. We introduce a caching policy based on the Follow-The-Perturbed-Leader principle and show that for any time horizon T and any request sequence, it achieves a sub-linear regret of \mathcal{O}(\sqrt(T) ) with respect to an oracle that knows the request sequence beforehand. Our study marks the first examination of adversarial regret in the coded caching setup. Furthermore, we also address the issue of switching cost by establishing an upper bound on the expected number of cache updates made by our algorithm under unrestricted switching and also provide an upper bound on the regret under restricted switching when cache updates can only happen in a pre-specified subset of timeslots. Finally, we validate our theoretical insights with numerical results using a real-world dataset
