Table of Contents
Fetching ...

Differentially Private Kernelized Contextual Bandits

Nikola Pavlovic, Sudeep Salgia, Qing Zhao

TL;DR

This work addresses contextual kernel bandits where the reward function resides in an RKHS and contexts are revealed over time. It introduces USCA, a privacy-preserving algorithm that uses data-independent uniform sampling during learning and a novel low-sensitivity estimator to achieve joint differential privacy with respect to contexts and rewards. The main result is a diminishing simple-regret bound of $\mathcal{O}\left(\sqrt{\frac{\gamma_T}{T}} + \frac{\gamma_T}{T\varepsilon}\right)$ for kernels with polynomial eigen-decay, matching non-private rates up to the privacy term and extending applicability beyond the Square Exponential family. The combination of a forward-private sampling strategy and a covariance-based estimator yields strong privacy guarantees (JDP) while maintaining near-optimal learning performance across a broad class of kernels, including Matérn kernels.

Abstract

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space (RKHS). We study this problem under the additional constraint of joint differential privacy, where the agents needs to ensure that the sequence of query points is differentially private with respect to both the sequence of contexts and rewards. We propose a novel algorithm that improves upon the state of the art and achieves an error rate of $\mathcal{O}\left(\sqrt{\frac{γ_T}{T}} + \frac{γ_T}{T \varepsilon}\right)$ after $T$ queries for a large class of kernel families, where $γ_T$ represents the effective dimensionality of the kernel and $\varepsilon > 0$ is the privacy parameter. Our results are based on a novel estimator for the reward function that simultaneously enjoys high utility along with a low-sensitivity to observed rewards and contexts, which is crucial to obtain an order optimal learning performance with improved dependence on the privacy parameter.

Differentially Private Kernelized Contextual Bandits

TL;DR

This work addresses contextual kernel bandits where the reward function resides in an RKHS and contexts are revealed over time. It introduces USCA, a privacy-preserving algorithm that uses data-independent uniform sampling during learning and a novel low-sensitivity estimator to achieve joint differential privacy with respect to contexts and rewards. The main result is a diminishing simple-regret bound of for kernels with polynomial eigen-decay, matching non-private rates up to the privacy term and extending applicability beyond the Square Exponential family. The combination of a forward-private sampling strategy and a covariance-based estimator yields strong privacy guarantees (JDP) while maintaining near-optimal learning performance across a broad class of kernels, including Matérn kernels.

Abstract

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space (RKHS). We study this problem under the additional constraint of joint differential privacy, where the agents needs to ensure that the sequence of query points is differentially private with respect to both the sequence of contexts and rewards. We propose a novel algorithm that improves upon the state of the art and achieves an error rate of after queries for a large class of kernel families, where represents the effective dimensionality of the kernel and is the privacy parameter. Our results are based on a novel estimator for the reward function that simultaneously enjoys high utility along with a low-sensitivity to observed rewards and contexts, which is crucial to obtain an order optimal learning performance with improved dependence on the privacy parameter.
Paper Structure (17 sections, 15 theorems, 79 equations, 1 algorithm)

This paper contains 17 sections, 15 theorems, 79 equations, 1 algorithm.

Key Result

Theorem 2.1

SVM_Book Let $\mathcal{W}$ be a compact metric space and $k : \mathcal{W} \times \mathcal{W}\rightarrow \mathbb{R}$ be a continuous kernel. Furthermore, let $\zeta$ be a finite Borel probability measure supported on $\mathcal{W}$. Then, there exists an orthonormal system of functions $\{\psi_j\}_{j\

Theorems & Definitions (29)

  • Theorem 2.1
  • Definition 2.2
  • Definition 2.4
  • Definition 2.5
  • Lemma 3.1
  • proof
  • Definition 3.2
  • Theorem 4.1
  • Lemma 4.2
  • Lemma 4.3
  • ...and 19 more