Differentially Private High Dimensional Bandits

Apurv Shukla

Differentially Private High Dimensional Bandits

Apurv Shukla

TL;DR

This work addresses high-dimensional stochastic contextual linear bandits with a sparse parameter and joint differential privacy by introducing PrivateLASSO, a privacy-preserving LASSO-bandit algorithm. The method combines a sparse hard-thresholding privacy mechanism with an episodic thresholding rule to identify the support of $\theta$, and then applies private $\ell_2$ regression on the identified support via a tree-based aggregation framework, controlled by the privacy budget. The authors prove $(\varepsilon,\delta)$-DP guarantees and derive regret bounds: $\mathcal{R}(T)=\mathcal{O}\left(\frac{s_0^{3/2}\log^3 T}{\varepsilon}\right)$ under a margin condition and $\mathcal{O}\left(\frac{s_0^{3/2}\sqrt{T}\log^2 T}{\varepsilon}\right)$ without it, along with minimax lower bounds that characterize the privacy-accuracy trade-offs. The results demonstrate the feasibility of private learning in high-dimensional contextual bandits with sparse structure and clarify how privacy parameters shape learning performance in practice.

Abstract

We consider a high-dimensional stochastic contextual linear bandit problem when the parameter vector is $s_{0}$-sparse and the decision maker is subject to privacy constraints under both central and local models of differential privacy. We present PrivateLASSO, a differentially private LASSO bandit algorithm. PrivateLASSO is based on two sub-routines: (i) a sparse hard-thresholding-based privacy mechanism and (ii) an episodic thresholding rule for identifying the support of the parameter $θ$. We prove minimax private lower bounds and establish privacy and utility guarantees for PrivateLASSO for the central model under standard assumptions.

Differentially Private High Dimensional Bandits

TL;DR

, and then applies private

regression on the identified support via a tree-based aggregation framework, controlled by the privacy budget. The authors prove

-DP guarantees and derive regret bounds:

under a margin condition and

without it, along with minimax lower bounds that characterize the privacy-accuracy trade-offs. The results demonstrate the feasibility of private learning in high-dimensional contextual bandits with sparse structure and clarify how privacy parameters shape learning performance in practice.

Abstract

We consider a high-dimensional stochastic contextual linear bandit problem when the parameter vector is

-sparse and the decision maker is subject to privacy constraints under both central and local models of differential privacy. We present PrivateLASSO, a differentially private LASSO bandit algorithm. PrivateLASSO is based on two sub-routines: (i) a sparse hard-thresholding-based privacy mechanism and (ii) an episodic thresholding rule for identifying the support of the parameter

. We prove minimax private lower bounds and establish privacy and utility guarantees for PrivateLASSO for the central model under standard assumptions.

Paper Structure (14 sections, 13 theorems, 13 equations, 2 algorithms)

This paper contains 14 sections, 13 theorems, 13 equations, 2 algorithms.

Introduction
Our Contributions
Related Work
Model and Preliminaries
Model Assumptions
Privacy Preliminaries
PrivateThresholdLASSO
SparseEstimation
Privacy Guarantees
Regret Guarantee
Utility Gaurantees
Regret bounds under the Margin Condition
Regret bounds without the Margin condition
Lower Bounds

Key Result

Lemma 1

Suppose $\mathcal{A} : \mathcal{U} \to \mathcal{V}$ is $(\varepsilon,\delta)$-differentially private. Consider any set of functions $f_t: \mathcal{U}_t \times \mathcal{V} \to \mathcal{V}_0$, where $\mathcal{U}_t$ is the portion of the database containing the $t$-th component of the input data. Then

Theorems & Definitions (17)

Definition 1: Compatibility
Definition 2: Differential Privacy for Streams DMNS06CPW16
Definition 3: Joint Differential Privacy for Streams KPRU14shariff2018differentially
Lemma 1: Billboard Lemma HHRRW14RR14
Proposition 1: Advanced Composition DRV10
Proposition 2: Post Processing DMNS06
Lemma 2: Privacy of SparseEstimation, Algorithm \ref{['algo:sparse-estimation']}
Theorem 1: Privacy Guarantees for Algorithm \ref{['algo:private-lasso']}
Definition 4: Estimation Accuracy
Lemma 3: Accuracy of Algorithm \ref{['algo:sparse-estimation']}
...and 7 more

Differentially Private High Dimensional Bandits

TL;DR

Abstract

Differentially Private High Dimensional Bandits

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (17)