Differentially Private Post-Processing for Fair Regression

Ruicheng Xian; Qiaobo Li; Gautam Kamath; Han Zhao

Differentially Private Post-Processing for Fair Regression

Ruicheng Xian, Qiaobo Li, Gautam Kamath, Han Zhao

TL;DR

The paper tackles privacy-preserving fairness for regression by proposing a post-processing pipeline that remaps outputs to satisfy statistical parity through privately estimated output distributions, a Wasserstein barycenter, and group-specific optimal transports. Privacy is achieved by privately estimating marginal PMFs with a histogram density estimator using the Laplace mechanism, followed by a re-normalization step, and the entire post-processing preserves DP via the DP post-processing theorem. A key insight is the bias-variance trade-off governed by the histogram bin count $k$: fewer bins improve fairness (via smaller KS deviation) but increase discretization error, while more bins reduce discretization error at the cost of higher variance and DP noise; theory shows an optimal scaling of $k$ with the sample size $n$ in MSE, and experiments on two public datasets illustrate the trade-offs. The approach decouples training from fairness, enabling the pre-trained regressor to be optimized for accuracy while the post-processing enforces fairness with principled guarantees, and it can be extended to approximate statistical parity and to attribute-blind settings in future work.

Abstract

This paper describes a differentially private post-processing algorithm for learning fair regressors satisfying statistical parity, addressing privacy concerns of machine learning models trained on sensitive data, as well as fairness concerns of their potential to propagate historical biases. Our algorithm can be applied to post-process any given regressor to improve fairness by remapping its outputs. It consists of three steps: first, the output distributions are estimated privately via histogram density estimation and the Laplace mechanism, then their Wasserstein barycenter is computed, and the optimal transports to the barycenter are used for post-processing to satisfy fairness. We analyze the sample complexity of our algorithm and provide fairness guarantee, revealing a trade-off between the statistical bias and variance induced from the choice of the number of bins in the histogram, in which using less bins always favors fairness at the expense of error.

Differentially Private Post-Processing for Fair Regression

TL;DR

: fewer bins improve fairness (via smaller KS deviation) but increase discretization error, while more bins reduce discretization error at the cost of higher variance and DP noise; theory shows an optimal scaling of

with the sample size

in MSE, and experiments on two public datasets illustrate the trade-offs. The approach decouples training from fairness, enabling the pre-trained regressor to be optimized for accuracy while the post-processing enforces fairness with principled guarantees, and it can be extended to approximate statistical parity and to attribute-blind settings in future work.

Abstract

Paper Structure (29 sections, 12 theorems, 70 equations, 4 figures, 1 algorithm)

This paper contains 29 sections, 12 theorems, 70 equations, 4 figures, 1 algorithm.

Introduction
Paper Organization.
Related Work
Differential Privacy.
Algorithmic Fairness.
Fairness and Privacy.
Preliminaries
Fair and Private Post-Processing
Fair Post-Processing with Wasserstein-Barycenters
Generalization to Approximate Statistical Parity
Privacy via Discretization and Private PMF Estimation
Step \ref{['it:step.1']} (Estimate Output Distributions).
Steps \ref{['it:step.2']} and \ref{['it:step.3']} (Compute Barycenter and Optimal Transports).
Post-Processed Fair Regressor.
Statistical Analysis
...and 14 more sections

Key Result

theorem 1

Let a regression problem be given by a joint distribution $\mu$ of $(X,A,Y)$, denote $w_a=\mathop{\mathrm{\mathbb{P}}}\nolimits(A=a)$, the Bayes regressor by $f^*(x,a)=\mathop{\mathrm{\mathbb{E}}}\nolimits[Y\mid X=x,A=a]$, and its output distribution conditioned on group $a$ by $r^*_a$. Then

Figures (4)

Figure 1: Illustration of the private fair post-processing steps \ref{['it:step.1']}--\ref{['it:step.3']} performed in \ref{['alg:post.proc']}. The (randomized) transports to the barycenter are represented by (sparse) $k\times k$ matrices, and the value at the $(i,j)$-th entry is the probability of transporting to bin $j$ given bin $i$.
Figure 2: Error-privacy-fairness trade-offs achieved by \ref{['alg:post.proc']} by sweeping $\alpha$ under the indicated number of bins $k$, with different privacy budgets $\varepsilon$. Fairness violation is measured in KS distance as defined in \ref{['def:fair']} ($\Delta_\textnormal{SP}$). Average of 50 random seeds (also for \ref{['fig:results.sweep', 'fig:results.tradeoff']}).
Figure 3: Error-fairness trade-offs achieved by \ref{['alg:post.proc']} on the Law School dataset by sweeping $\alpha$ and $k$ for $\varepsilon=0.1$. The black line is the lower envelope, and ends on the right at $(0.6772,0)$ (outside the cropped figure).
Figure 4: Error-privacy-fairness trade-offs achieved by \ref{['alg:post.proc']} on the Law School dataset by sweeping $\alpha$ and $k$ and taking the lower envelope. The line for $\varepsilon=0.1$ is the black line in \ref{['fig:results.sweep']}. All lines meet and end on the right at $(0.6772,0)$.

Theorems & Definitions (26)

definition 1: Differential Privacy
definition 2: Statistical Parity
theorem 1
theorem 2
theorem 3
definition 3: Markov Kernel
definition 4: Randomized Function
definition 5: Push-Forward Distribution
lemma 1
proof
...and 16 more

Differentially Private Post-Processing for Fair Regression

TL;DR

Abstract

Differentially Private Post-Processing for Fair Regression

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (26)