Table of Contents
Fetching ...

Differentially Private Fair Binary Classifications

Hrad Ghoukasian, Shahab Asoodeh

TL;DR

This work tackles binary classification under dual constraints of differential privacy and fairness, focusing on statistical parity as the fairness objective. It develops a two-stage framework: first training separate classifiers for each demographic group and then applying a post-processing step to enforce $Δ_{SP}(\hat{h})$ with minimal perturbation to the original predictions; it then extends this framework to provide $(ε',δ')$-DP guarantees using Laplace noise and DP-SGD-based base classifiers. Theoretical contributions include a lower bound on prediction changes under SP, a non-private algorithm (Algorithm 1) achieving perfect SP with optimal utility, and a private algorithm (Algorithm 2) delivering DP and SP guarantees along with average- and high-probability utility bounds. Empirically, the approach yields competitive accuracy with markedly improved fairness guarantees on the Adult and Credit Card datasets compared to DP-FERMI across multiple privacy accounting methods, demonstrating practical viability of DP+fairness in real-world tasks.

Abstract

In this work, we investigate binary classification under the constraints of both differential privacy and fairness. We first propose an algorithm based on the decoupling technique for learning a classifier with only fairness guarantee. This algorithm takes in classifiers trained on different demographic groups and generates a single classifier satisfying statistical parity. We then refine this algorithm to incorporate differential privacy. The performance of the final algorithm is rigorously examined in terms of privacy, fairness, and utility guarantees. Empirical evaluations conducted on the Adult and Credit Card datasets illustrate that our algorithm outperforms the state-of-the-art in terms of fairness guarantees, while maintaining the same level of privacy and utility.

Differentially Private Fair Binary Classifications

TL;DR

This work tackles binary classification under dual constraints of differential privacy and fairness, focusing on statistical parity as the fairness objective. It develops a two-stage framework: first training separate classifiers for each demographic group and then applying a post-processing step to enforce with minimal perturbation to the original predictions; it then extends this framework to provide -DP guarantees using Laplace noise and DP-SGD-based base classifiers. Theoretical contributions include a lower bound on prediction changes under SP, a non-private algorithm (Algorithm 1) achieving perfect SP with optimal utility, and a private algorithm (Algorithm 2) delivering DP and SP guarantees along with average- and high-probability utility bounds. Empirically, the approach yields competitive accuracy with markedly improved fairness guarantees on the Adult and Credit Card datasets compared to DP-FERMI across multiple privacy accounting methods, demonstrating practical viability of DP+fairness in real-world tasks.

Abstract

In this work, we investigate binary classification under the constraints of both differential privacy and fairness. We first propose an algorithm based on the decoupling technique for learning a classifier with only fairness guarantee. This algorithm takes in classifiers trained on different demographic groups and generates a single classifier satisfying statistical parity. We then refine this algorithm to incorporate differential privacy. The performance of the final algorithm is rigorously examined in terms of privacy, fairness, and utility guarantees. Empirical evaluations conducted on the Adult and Credit Card datasets illustrate that our algorithm outperforms the state-of-the-art in terms of fairness guarantees, while maintaining the same level of privacy and utility.
Paper Structure (6 sections, 4 theorems, 31 equations, 13 tables, 2 algorithms)

This paper contains 6 sections, 4 theorems, 31 equations, 13 tables, 2 algorithms.

Key Result

Proposition 1

Let $h_0^*: \mathcal{X}\rightarrow \{0,1\}$ and $h_1^*: \mathcal{X}\rightarrow \{0,1\}$, be arbitrary classifiers trained on subgroups specified by the sensitive attribute $A=0$ and $A=1$, respectively. If a predictor $\hat{Y}=\hat{h}(X,A)$ satisfies $\gamma$-statistical parity, then

Theorems & Definitions (10)

  • Definition 1
  • Definition 2
  • Proposition 1
  • Theorem 2
  • Theorem 3
  • Proposition 4
  • proof : Proof of Proposition \ref{['proposition1']}
  • proof : Proof of Theorem \ref{['algorithm1guarantees']}
  • proof : Proof of Theorem \ref{['alg2WHP']}
  • proof : Proof of Proposition \ref{['alg2Exp']}