Table of Contents
Fetching ...

Differentially Private High Dimensional Bandits

Apurv Shukla

TL;DR

This work addresses high-dimensional stochastic contextual linear bandits with a sparse parameter and joint differential privacy by introducing PrivateLASSO, a privacy-preserving LASSO-bandit algorithm. The method combines a sparse hard-thresholding privacy mechanism with an episodic thresholding rule to identify the support of $\theta$, and then applies private $\ell_2$ regression on the identified support via a tree-based aggregation framework, controlled by the privacy budget. The authors prove $(\varepsilon,\delta)$-DP guarantees and derive regret bounds: $\mathcal{R}(T)=\mathcal{O}\left(\frac{s_0^{3/2}\log^3 T}{\varepsilon}\right)$ under a margin condition and $\mathcal{O}\left(\frac{s_0^{3/2}\sqrt{T}\log^2 T}{\varepsilon}\right)$ without it, along with minimax lower bounds that characterize the privacy-accuracy trade-offs. The results demonstrate the feasibility of private learning in high-dimensional contextual bandits with sparse structure and clarify how privacy parameters shape learning performance in practice.

Abstract

We consider a high-dimensional stochastic contextual linear bandit problem when the parameter vector is $s_{0}$-sparse and the decision maker is subject to privacy constraints under both central and local models of differential privacy. We present PrivateLASSO, a differentially private LASSO bandit algorithm. PrivateLASSO is based on two sub-routines: (i) a sparse hard-thresholding-based privacy mechanism and (ii) an episodic thresholding rule for identifying the support of the parameter $θ$. We prove minimax private lower bounds and establish privacy and utility guarantees for PrivateLASSO for the central model under standard assumptions.

Differentially Private High Dimensional Bandits

TL;DR

This work addresses high-dimensional stochastic contextual linear bandits with a sparse parameter and joint differential privacy by introducing PrivateLASSO, a privacy-preserving LASSO-bandit algorithm. The method combines a sparse hard-thresholding privacy mechanism with an episodic thresholding rule to identify the support of , and then applies private regression on the identified support via a tree-based aggregation framework, controlled by the privacy budget. The authors prove -DP guarantees and derive regret bounds: under a margin condition and without it, along with minimax lower bounds that characterize the privacy-accuracy trade-offs. The results demonstrate the feasibility of private learning in high-dimensional contextual bandits with sparse structure and clarify how privacy parameters shape learning performance in practice.

Abstract

We consider a high-dimensional stochastic contextual linear bandit problem when the parameter vector is -sparse and the decision maker is subject to privacy constraints under both central and local models of differential privacy. We present PrivateLASSO, a differentially private LASSO bandit algorithm. PrivateLASSO is based on two sub-routines: (i) a sparse hard-thresholding-based privacy mechanism and (ii) an episodic thresholding rule for identifying the support of the parameter . We prove minimax private lower bounds and establish privacy and utility guarantees for PrivateLASSO for the central model under standard assumptions.
Paper Structure (14 sections, 13 theorems, 13 equations, 2 algorithms)

This paper contains 14 sections, 13 theorems, 13 equations, 2 algorithms.

Key Result

Lemma 1

Suppose $\mathcal{A} : \mathcal{U} \to \mathcal{V}$ is $(\varepsilon,\delta)$-differentially private. Consider any set of functions $f_t: \mathcal{U}_t \times \mathcal{V} \to \mathcal{V}_0$, where $\mathcal{U}_t$ is the portion of the database containing the $t$-th component of the input data. Then

Theorems & Definitions (17)

  • Definition 1: Compatibility
  • Definition 2: Differential Privacy for Streams DMNS06CPW16
  • Definition 3: Joint Differential Privacy for Streams KPRU14shariff2018differentially
  • Lemma 1: Billboard Lemma HHRRW14RR14
  • Proposition 1: Advanced Composition DRV10
  • Proposition 2: Post Processing DMNS06
  • Lemma 2: Privacy of SparseEstimation, Algorithm \ref{['algo:sparse-estimation']}
  • Theorem 1: Privacy Guarantees for Algorithm \ref{['algo:private-lasso']}
  • Definition 4: Estimation Accuracy
  • Lemma 3: Accuracy of Algorithm \ref{['algo:sparse-estimation']}
  • ...and 7 more