Differentially Private Worst-group Risk Minimization

Xinyu Zhou; Raef Bassily

Differentially Private Worst-group Risk Minimization

Xinyu Zhou, Raef Bassily

TL;DR

An algorithmic framework for worst-group population risk minimization using any DP online convex optimization algorithm as a subroutine and another excess risk bound is given of $\tilde{O}(\frac{p\sqrt{d}}{K\epsilon})$.

Abstract

We initiate a systematic study of worst-group risk minimization under $(ε, δ)$-differential privacy (DP). The goal is to privately find a model that approximately minimizes the maximal risk across $p$ sub-populations (groups) with different distributions, where each group distribution is accessed via a sample oracle. We first present a new algorithm that achieves excess worst-group population risk of $\tilde{O}(\frac{p\sqrt{d}}{Kε} + \sqrt{\frac{p}{K}})$, where $K$ is the total number of samples drawn from all groups and $d$ is the problem dimension. Our rate is nearly optimal when each distribution is observed via a fixed-size dataset of size $K/p$. Our result is based on a new stability-based analysis for the generalization error. In particular, we show that $Δ$-uniform argument stability implies $\tilde{O}(Δ+ \frac{1}{\sqrt{n}})$ generalization error w.r.t. the worst-group risk, where $n$ is the number of samples drawn from each sample oracle. Next, we propose an algorithmic framework for worst-group population risk minimization using any DP online convex optimization algorithm as a subroutine. Hence, we give another excess risk bound of $\tilde{O}\left( \sqrt{\frac{d^{1/2}}{εK}} +\sqrt{\frac{p}{Kε^2}} \right)$. Assuming the typical setting of $ε=Θ(1)$, this bound is more favorable than our first bound in a certain range of $p$ as a function of $K$ and $d$. Finally, we study differentially private worst-group empirical risk minimization in the offline setting, where each group distribution is observed by a fixed-size dataset. We present a new algorithm with nearly optimal excess risk of $\tilde{O}(\frac{p\sqrt{d}}{Kε})$.

Differentially Private Worst-group Risk Minimization

TL;DR

An algorithmic framework for worst-group population risk minimization using any DP online convex optimization algorithm as a subroutine and another excess risk bound is given of

Abstract

We initiate a systematic study of worst-group risk minimization under

-differential privacy (DP). The goal is to privately find a model that approximately minimizes the maximal risk across

sub-populations (groups) with different distributions, where each group distribution is accessed via a sample oracle. We first present a new algorithm that achieves excess worst-group population risk of

, where

is the total number of samples drawn from all groups and

is the problem dimension. Our rate is nearly optimal when each distribution is observed via a fixed-size dataset of size

. Our result is based on a new stability-based analysis for the generalization error. In particular, we show that

-uniform argument stability implies

generalization error w.r.t. the worst-group risk, where

is the number of samples drawn from each sample oracle. Next, we propose an algorithmic framework for worst-group population risk minimization using any DP online convex optimization algorithm as a subroutine. Hence, we give another excess risk bound of

. Assuming the typical setting of

, this bound is more favorable than our first bound in a certain range of

as a function of

and

. Finally, we study differentially private worst-group empirical risk minimization in the offline setting, where each group distribution is observed by a fixed-size dataset. We present a new algorithm with nearly optimal excess risk of

Paper Structure (35 sections, 12 theorems, 101 equations, 5 algorithms)

This paper contains 35 sections, 12 theorems, 101 equations, 5 algorithms.

Introduction
Contribution
Related Work
Preliminaries
Sample oracle:
$\alpha$-saddle point:
Worst-group population risk minimization
Differential Privacy dwork2006calibrating:
Stability:
Minimax Phased ERM
Algorithm description
Remarks:
Remark:
Privacy:
Utility:
...and 20 more sections

Key Result

Theorem 2

Let $\eta = \frac{M}{D} \min \left\{\frac{\epsilon}{ \sqrt{72d \log (\frac{K}{p}) \log(\frac{1}{\delta})}}, \frac{\sqrt{p}}{\log^{\frac{3}{4}} (K) \sqrt{K}} \right\}$. Algorithm alg:Minimax Phased ERM is $(\epsilon, \delta)$-differentially private and we have where $w_T$ is the output of Algorithm alg:Minimax Phased ERM and the expectation is taken over the sampled data points and the algorithm's

Theorems & Definitions (26)

Definition 1
Theorem 2
Lemma 3
Lemma 4
proof
proof
Definition 5
Theorem 6
proof
Theorem 7
...and 16 more

Differentially Private Worst-group Risk Minimization

TL;DR

Abstract

Differentially Private Worst-group Risk Minimization

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (26)