To Pool or Not To Pool: Analyzing the Regularizing Effects of Group-Fair Training on Shared Models

Cyrus Cousins; I. Elizabeth Kumar; Suresh Venkatasubramanian

To Pool or Not To Pool: Analyzing the Regularizing Effects of Group-Fair Training on Shared Models

Cyrus Cousins, I. Elizabeth Kumar, Suresh Venkatasubramanian

TL;DR

This work derives group-specific bounds on the generalization error of welfare-centric fair machine learning that benefit from the larger sample size of the majority group, by considering group-specific Rademacher averages over a restricted hypothesis class.

Abstract

In fair machine learning, one source of performance disparities between groups is over-fitting to groups with relatively few training samples. We derive group-specific bounds on the generalization error of welfare-centric fair machine learning that benefit from the larger sample size of the majority group. We do this by considering group-specific Rademacher averages over a restricted hypothesis class, which contains the family of models likely to perform well with respect to a fair learning objective (e.g., a power-mean). Our simulations demonstrate these bounds improve over a naive method, as expected by theory, with particularly significant improvement for smaller group sizes.

To Pool or Not To Pool: Analyzing the Regularizing Effects of Group-Fair Training on Shared Models

TL;DR

Abstract

Paper Structure (19 sections, 8 theorems, 44 equations, 4 figures, 2 tables)

This paper contains 19 sections, 8 theorems, 44 equations, 4 figures, 2 tables.

INTRODUCTION
BACKGROUND
Preliminaries
Group-Fair Learning
On Malfare Functions
Statistical Background
Related work
Multitask learning
To pool or not to pool
BOUNDING GENERALIZATION ERROR IN FAIR TRAINING
Theoretical Restricted Classes
Empirical Restricted Classes
Monte-Carlo Rademacher Averages of Linear Hypothesis Classes
Estimation
Visualizing $\hat{\mathcal{H}}_i$ in least-squares regression
...and 4 more sections

Key Result

Theorem 2

Suppose a monotonic malfare function$\newline\raisebox{1.53ex}{$\mathrm{W}$}(\cdot): \mathbb{R}^{g} \to \mathbb{R}$, hypothesis class$\mathcal{H} \subseteq \mathcal{X} \to \mathcal{Y}'$, loss function$\ell: \mathcal{Y}' \times \mathcal{Y} \to \mathbb{R}$, per-group distributions$\mathcal{D}_{1:g}$ o

Figures (4)

Figure 1: Visualization of unrestricted class $\mathcal{H}$, theoretical restricted class $\mathcal{H}_{i}^{*}$, and samples of empirical restricted class $\hat{\mathcal{H}}_{i}$ (varying $\bm{z}_{i}$). One possible empirical malfare minimizer $\hat{h}$ (contained by $\hat{\mathcal{H}}_{i}$ and $\mathcal{H}_{i}^{*}$ with high probability), as well as the true malfare minimzer $h^{*}$ (which may fall outside of $\mathcal{H}_{i}^{*}$ or $\hat{\mathcal{H}}_{i}$ due to overfitting to groups other than $i$) are also shown.
Figure 2: Rademacher average samples in the parameter space of $\hat{\mathcal{H}}_i$ for each group $i \in \{1, 2, 3\}$.
Figure 3: Average test risk of pooled and separately trained models on three groups (see \ref{['tab:logistic-regression-data']}).
Figure 4: Generalization error bounds derived from original hypothesis class $\mathcal{H}$ and restricted hypothesis classes $\hat{\mathcal{H}}_i$, compared with shared model $\hat{h}$ train-test gap over 7 independent runs, with quartiles and median trend lines.

Theorems & Definitions (14)

Definition 1: Power-Mean Malfare
Definition 2: Rademacher Averages
Theorem 2: Theoretical Group-Regularized Malfare Bounds
Theorem 2: Empirical Group-Regularized Malfare Bounds
Corollary 2: Empirical Malfare Generalization Bounds
Lemma 2: Convex Optimization for Monte-Carlo Rademacher Averages
Theorem 2: Theoretical Group-Regularized Malfare Bounds
proof
Theorem 2: Empirical Group-Regularized Malfare Bounds
proof
...and 4 more

To Pool or Not To Pool: Analyzing the Regularizing Effects of Group-Fair Training on Shared Models

TL;DR

Abstract

To Pool or Not To Pool: Analyzing the Regularizing Effects of Group-Fair Training on Shared Models

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (14)