Table of Contents
Fetching ...

Differentially Private Post-Processing for Fair Regression

Ruicheng Xian, Qiaobo Li, Gautam Kamath, Han Zhao

TL;DR

The paper tackles privacy-preserving fairness for regression by proposing a post-processing pipeline that remaps outputs to satisfy statistical parity through privately estimated output distributions, a Wasserstein barycenter, and group-specific optimal transports. Privacy is achieved by privately estimating marginal PMFs with a histogram density estimator using the Laplace mechanism, followed by a re-normalization step, and the entire post-processing preserves DP via the DP post-processing theorem. A key insight is the bias-variance trade-off governed by the histogram bin count $k$: fewer bins improve fairness (via smaller KS deviation) but increase discretization error, while more bins reduce discretization error at the cost of higher variance and DP noise; theory shows an optimal scaling of $k$ with the sample size $n$ in MSE, and experiments on two public datasets illustrate the trade-offs. The approach decouples training from fairness, enabling the pre-trained regressor to be optimized for accuracy while the post-processing enforces fairness with principled guarantees, and it can be extended to approximate statistical parity and to attribute-blind settings in future work.

Abstract

This paper describes a differentially private post-processing algorithm for learning fair regressors satisfying statistical parity, addressing privacy concerns of machine learning models trained on sensitive data, as well as fairness concerns of their potential to propagate historical biases. Our algorithm can be applied to post-process any given regressor to improve fairness by remapping its outputs. It consists of three steps: first, the output distributions are estimated privately via histogram density estimation and the Laplace mechanism, then their Wasserstein barycenter is computed, and the optimal transports to the barycenter are used for post-processing to satisfy fairness. We analyze the sample complexity of our algorithm and provide fairness guarantee, revealing a trade-off between the statistical bias and variance induced from the choice of the number of bins in the histogram, in which using less bins always favors fairness at the expense of error.

Differentially Private Post-Processing for Fair Regression

TL;DR

The paper tackles privacy-preserving fairness for regression by proposing a post-processing pipeline that remaps outputs to satisfy statistical parity through privately estimated output distributions, a Wasserstein barycenter, and group-specific optimal transports. Privacy is achieved by privately estimating marginal PMFs with a histogram density estimator using the Laplace mechanism, followed by a re-normalization step, and the entire post-processing preserves DP via the DP post-processing theorem. A key insight is the bias-variance trade-off governed by the histogram bin count : fewer bins improve fairness (via smaller KS deviation) but increase discretization error, while more bins reduce discretization error at the cost of higher variance and DP noise; theory shows an optimal scaling of with the sample size in MSE, and experiments on two public datasets illustrate the trade-offs. The approach decouples training from fairness, enabling the pre-trained regressor to be optimized for accuracy while the post-processing enforces fairness with principled guarantees, and it can be extended to approximate statistical parity and to attribute-blind settings in future work.

Abstract

This paper describes a differentially private post-processing algorithm for learning fair regressors satisfying statistical parity, addressing privacy concerns of machine learning models trained on sensitive data, as well as fairness concerns of their potential to propagate historical biases. Our algorithm can be applied to post-process any given regressor to improve fairness by remapping its outputs. It consists of three steps: first, the output distributions are estimated privately via histogram density estimation and the Laplace mechanism, then their Wasserstein barycenter is computed, and the optimal transports to the barycenter are used for post-processing to satisfy fairness. We analyze the sample complexity of our algorithm and provide fairness guarantee, revealing a trade-off between the statistical bias and variance induced from the choice of the number of bins in the histogram, in which using less bins always favors fairness at the expense of error.
Paper Structure (29 sections, 12 theorems, 70 equations, 4 figures, 1 algorithm)

This paper contains 29 sections, 12 theorems, 70 equations, 4 figures, 1 algorithm.

Key Result

theorem 1

Let a regression problem be given by a joint distribution $\mu$ of $(X,A,Y)$, denote $w_a=\mathop{\mathrm{\mathbb{P}}}\nolimits(A=a)$, the Bayes regressor by $f^*(x,a)=\mathop{\mathrm{\mathbb{E}}}\nolimits[Y\mid X=x,A=a]$, and its output distribution conditioned on group $a$ by $r^*_a$. Then

Figures (4)

  • Figure 1: Illustration of the private fair post-processing steps \ref{['it:step.1']}--\ref{['it:step.3']} performed in \ref{['alg:post.proc']}. The (randomized) transports to the barycenter are represented by (sparse) $k\times k$ matrices, and the value at the $(i,j)$-th entry is the probability of transporting to bin $j$ given bin $i$.
  • Figure 2: Error-privacy-fairness trade-offs achieved by \ref{['alg:post.proc']} by sweeping $\alpha$ under the indicated number of bins $k$, with different privacy budgets $\varepsilon$. Fairness violation is measured in KS distance as defined in \ref{['def:fair']} ($\Delta_\textnormal{SP}$). Average of 50 random seeds (also for \ref{['fig:results.sweep', 'fig:results.tradeoff']}).
  • Figure 3: Error-fairness trade-offs achieved by \ref{['alg:post.proc']} on the Law School dataset by sweeping $\alpha$ and $k$ for $\varepsilon=0.1$. The black line is the lower envelope, and ends on the right at $(0.6772,0)$ (outside the cropped figure).
  • Figure 4: Error-privacy-fairness trade-offs achieved by \ref{['alg:post.proc']} on the Law School dataset by sweeping $\alpha$ and $k$ and taking the lower envelope. The line for $\varepsilon=0.1$ is the black line in \ref{['fig:results.sweep']}. All lines meet and end on the right at $(0.6772,0)$.

Theorems & Definitions (26)

  • definition 1: Differential Privacy
  • definition 2: Statistical Parity
  • theorem 1
  • theorem 2
  • theorem 3
  • definition 3: Markov Kernel
  • definition 4: Randomized Function
  • definition 5: Push-Forward Distribution
  • lemma 1
  • proof
  • ...and 16 more