Recommenadation aided Caching using Combinatorial Multi-armed Bandits

Pavamana K J; Chandramani Kishore Singh

Recommenadation aided Caching using Combinatorial Multi-armed Bandits

Pavamana K J, Chandramani Kishore Singh

TL;DR

This paper addresses cache optimization in wireless networks with recommendations by modeling joint caching and recommendation as a CMAB problem, aiming to maximize cache hits under partial observation. It develops a UCB-based algorithm that estimates per-item request probabilities from cached-content observations and selects the top $C$ contents to cache while recommending $R$ items among them; a confidence interval scales with the average recommendation acceptance $\bar{w}^{\text{rec}}$ and context parameter $\eta$. A second contribution tackles unknown user acceptance, introducing an estimator for $\bar{w}^{\text{rec}}(t)$ and adapting the UCB indices accordingly, with a dedicated algorithm and discussion of future regret analysis. Numerical results on synthetic settings (e.g., $N=50$, $C=20$, $U=20$) show improved cache-hit performance over baselines, robustness to Zipf-distributed recommendations, and effective learning of acceptance rates, highlighting practical gains for edge caching integration with recommendations.

Abstract

We study content caching with recommendations in a wireless network where the users are connected through a base station equipped with a finite-capacity cache. We assume a fixed set of contents with unknown user preferences and content popularities. The base station can cache a subset of the contents and can also recommend subsets of the contents to different users in order to encourage them to request the recommended contents. Recommendations, depending on their acceptability, can thus be used to increase cache hits. We first assume that the users' recommendation acceptabilities are known and formulate the cache hit optimization problem as a combinatorial multi-armed bandit (CMAB). We propose a UCB-based algorithm to decide which contents to cache and recommend and provide an upper bound on the regret of this algorithm. Subsequently, we consider a more general scenario where the users' recommendation acceptabilities are also unknown and propose another UCB-based algorithm that learns these as well. We numerically demonstrate the performance of our algorithms and compare these to state-of-the-art algorithms.

Recommenadation aided Caching using Combinatorial Multi-armed Bandits

TL;DR

contents to cache while recommending

items among them; a confidence interval scales with the average recommendation acceptance

and context parameter

. A second contribution tackles unknown user acceptance, introducing an estimator for

and adapting the UCB indices accordingly, with a dedicated algorithm and discussion of future regret analysis. Numerical results on synthetic settings (e.g.,

) show improved cache-hit performance over baselines, robustness to Zipf-distributed recommendations, and effective learning of acceptance rates, highlighting practical gains for edge caching integration with recommendations.

Abstract

Paper Structure (18 sections, 5 theorems, 47 equations, 1 figure, 3 algorithms)

This paper contains 18 sections, 5 theorems, 47 equations, 1 figure, 3 algorithms.

Introduction
Related Works
System Model and Caching Problem
Content Caching Problem
Algorithm Design
Unknown users’ recommendation acceptability $w_u^{\text{rec}}$
Numerical Results
Performance of Algorithm \ref{['alg:ucb-rec']} compared with CMAB-UCB DBLP:journals/corr/KvetonWAEE14, $\epsilon$-greedy and greedy algorithms with uniform distribution over the recommended contents
Performance of Algorithm \ref{['alg:ucb-rec']} compared with CMAB-UCB DBLP:journals/corr/KvetonWAEE14 for various $w_u^{rec}$
Performance of Algorithm \ref{['alg:ucb-rec']} compared with CMAB-UCB DBLP:journals/corr/KvetonWAEE14 for various number of users
Performance of Algorithm \ref{['alg:ucb-rec']} compared with existing algorithms under the Zipf distribution among the recommended contents
Convergence of $\bar{w}^{\text{rec}}(t)$ to true mean $\bar{w}^{\text{rec}}$
Regret performance of algorithm \ref{['alg:ucb-unk-w']} compared with Algorithm \ref{['alg:ucb-rec']} and CMAB-UCB DBLP:journals/corr/KvetonWAEE14
Conclusion and Future Work
Proof of Lemma \ref{['Lemma 1']}
...and 3 more sections

Key Result

Lemma 1

The regret $\operatorname{R}(T)$Regret:Definition can also be written as follows

Figures (1)

Figure 1: Performance of Algorithms \ref{['alg:ucb-rec']} and \ref{['alg:ucb-unk-w']}.

Theorems & Definitions (18)

Remark 1
Remark 2
Remark 3
Lemma 1
proof
Theorem 1
Remark 4
Remark 5
proof
proof
...and 8 more

Recommenadation aided Caching using Combinatorial Multi-armed Bandits

TL;DR

Abstract

Recommenadation aided Caching using Combinatorial Multi-armed Bandits

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (18)