Table of Contents
Fetching ...

An Empirical Analysis of Fairness Notions under Differential Privacy

Anderson Santana de Oliveira, Caelin Kaplan, Khawla Mallat, Tanmay Chakraborty

TL;DR

Using standard datasets from ML fairness literature, it is shown that by selecting the optimal model architecture for DP-SGD, the differences across groups concerning the relevant fairness metrics more often decrease or are negligibly impacted, compared to the non-private baseline, for which optimal models have been selected to maximize utility.

Abstract

Recent works have shown that selecting an optimal model architecture suited to the differential privacy setting is necessary to achieve the best possible utility for a given privacy budget using differentially private stochastic gradient descent (DP-SGD)(Tramer and Boneh 2020; Cheng et al. 2022). In light of these findings, we empirically analyse how different fairness notions, belonging to distinct classes of statistical fairness criteria (independence, separation and sufficiency), are impacted when one selects a model architecture suitable for DP-SGD, optimized for utility. Using standard datasets from ML fairness literature, we show using a rigorous experimental protocol, that by selecting the optimal model architecture for DP-SGD, the differences across groups concerning the relevant fairness metrics (demographic parity, equalized odds and predictive parity) more often decrease or are negligibly impacted, compared to the non-private baseline, for which optimal model architecture has also been selected to maximize utility. These findings challenge the understanding that differential privacy will necessarily exacerbate unfairness in deep learning models trained on biased datasets.

An Empirical Analysis of Fairness Notions under Differential Privacy

TL;DR

Using standard datasets from ML fairness literature, it is shown that by selecting the optimal model architecture for DP-SGD, the differences across groups concerning the relevant fairness metrics more often decrease or are negligibly impacted, compared to the non-private baseline, for which optimal models have been selected to maximize utility.

Abstract

Recent works have shown that selecting an optimal model architecture suited to the differential privacy setting is necessary to achieve the best possible utility for a given privacy budget using differentially private stochastic gradient descent (DP-SGD)(Tramer and Boneh 2020; Cheng et al. 2022). In light of these findings, we empirically analyse how different fairness notions, belonging to distinct classes of statistical fairness criteria (independence, separation and sufficiency), are impacted when one selects a model architecture suitable for DP-SGD, optimized for utility. Using standard datasets from ML fairness literature, we show using a rigorous experimental protocol, that by selecting the optimal model architecture for DP-SGD, the differences across groups concerning the relevant fairness metrics (demographic parity, equalized odds and predictive parity) more often decrease or are negligibly impacted, compared to the non-private baseline, for which optimal model architecture has also been selected to maximize utility. These findings challenge the understanding that differential privacy will necessarily exacerbate unfairness in deep learning models trained on biased datasets.
Paper Structure (23 sections, 4 equations, 1 figure, 15 tables)

This paper contains 23 sections, 4 equations, 1 figure, 15 tables.

Figures (1)

  • Figure 1: Representation of our predefined model architecture. The number of hidden blocks one of the model design choices included in our grid search.