Distribution-Aware Mean Estimation under User-level Local Differential Privacy

Corentin Pla; Hugo Richard; Maxime Vono

Distribution-Aware Mean Estimation under User-level Local Differential Privacy

Corentin Pla, Hugo Richard, Maxime Vono

TL;DR

Based on a distribution-aware mean estimation algorithm, an upper bounds on the worst-case risk over $\mu$ are established and a lower bound is derived for the task of mean estimation under user-level local differential privacy.

Abstract

We consider the problem of mean estimation under user-level local differential privacy, where $n$ users are contributing through their local pool of data samples. Previous work assume that the number of data samples is the same across users. In contrast, we consider a more general and realistic scenario where each user $u \in [n]$ owns $m_u$ data samples drawn from some generative distribution $μ$; $m_u$ being unknown to the statistician but drawn from a known distribution $M$ over $\mathbb{N}^\star$. Based on a distribution-aware mean estimation algorithm, we establish an $M$-dependent upper bounds on the worst-case risk over $μ$ for the task of mean estimation. We then derive a lower bound. The two bounds are asymptotically matching up to logarithmic factors and reduce to known bounds when $m_u = m$ for any user $u$.

Distribution-Aware Mean Estimation under User-level Local Differential Privacy

TL;DR

Based on a distribution-aware mean estimation algorithm, an upper bounds on the worst-case risk over

are established and a lower bound is derived for the task of mean estimation under user-level local differential privacy.

Abstract

We consider the problem of mean estimation under user-level local differential privacy, where

users are contributing through their local pool of data samples. Previous work assume that the number of data samples is the same across users. In contrast, we consider a more general and realistic scenario where each user

owns

data samples drawn from some generative distribution

;

being unknown to the statistician but drawn from a known distribution

over

. Based on a distribution-aware mean estimation algorithm, we establish an

-dependent upper bounds on the worst-case risk over

for the task of mean estimation. We then derive a lower bound. The two bounds are asymptotically matching up to logarithmic factors and reduce to known bounds when

for any user

Distribution-Aware Mean Estimation under User-level Local Differential Privacy

TL;DR

Abstract

Distribution-Aware Mean Estimation under User-level Local Differential Privacy

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (1)

Theorems & Definitions (16)