Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification

A. Feder Cooper; Katherine Lee; Madiha Zahrah Choksi; Solon Barocas; Christopher De Sa; James Grimmelmann; Jon Kleinberg; Siddhartha Sen; Baobao Zhang

Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification

A. Feder Cooper, Katherine Lee, Madiha Zahrah Choksi, Solon Barocas, Christopher De Sa, James Grimmelmann, Jon Kleinberg, Siddhartha Sen, Baobao Zhang

TL;DR

Most fair binary classification benchmarks are close-to-fair when taking into account the amount of arbitrariness present in predictions -- before the authors even try to apply any fairness interventions, which calls into question the practical utility of common algorithmic fairness methods.

Abstract

Variance in predictions across different trained models is a significant, under-explored source of error in fair binary classification. In practice, the variance on some data examples is so large that decisions can be effectively arbitrary. To investigate this problem, we take an experimental approach and make four overarching contributions: We: 1) Define a metric called self-consistency, derived from variance, which we use as a proxy for measuring and reducing arbitrariness; 2) Develop an ensembling algorithm that abstains from classification when a prediction would be arbitrary; 3) Conduct the largest to-date empirical study of the role of variance (vis-a-vis self-consistency and arbitrariness) in fair binary classification; and, 4) Release a toolkit that makes the US Home Mortgage Disclosure Act (HMDA) datasets easily usable for future research. Altogether, our experiments reveal shocking insights about the reliability of conclusions on benchmark datasets. Most fair binary classification benchmarks are close-to-fair when taking into account the amount of arbitrariness present in predictions -- before we even try to apply any fairness interventions. This finding calls into question the practical utility of common algorithmic fairness methods, and in turn suggests that we should reconsider how we choose to measure fairness in binary classification.

Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification

TL;DR

Abstract

Paper Structure (47 sections, 26 equations, 68 figures, 7 tables, 1 algorithm)

This paper contains 47 sections, 26 equations, 68 figures, 7 tables, 1 algorithm.

Introduction
Preliminaries on Fair Binary Classification
Problem formulation
Empirical approximation of the formulation
Variance, Self-Consistency, and Arbitrariness
Arbitrariness resembles statistical variance
Defining self-consistency from variance
Illustrating self-consistency in practice
Accounting for Self-Consistency
Experiments
Validating Algorithm \ref{['algo:bagging-confidently']}
A problem of empiriclal algorithmic fairness
Discussion and Related Work
Extended Preliminaries
Notes on notation and on our choice of terminology
...and 32 more sections

Figures (68)

Figure 1: 100 bootstrapped logistic regression models show models can be very consistent in predictions $\hat{y}$ for some individuals (Ind. 1) and arbitrary for others (Ind. 2).
Figure 2: COMPAS split by $\texttt{race}$; random forests (RFs)
Figure 3: Old Adult split by $\texttt{sex}$; random forests (RFs)
Figure 5: Old Adult split by $\texttt{sex}$
Figure 6: HMDA-NY-2017 split by $\texttt{ethnicity}$
...and 63 more figures

Theorems & Definitions (8)

Definition 1
Definition 2
Definition 3
proof
proof
Definition 4
Definition 5
proof

Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification

TL;DR

Abstract

Arbitrariness and Social Prediction: The Confounding Role of Variance in Fair Classification

Authors

TL;DR

Abstract

Table of Contents

Figures (68)

Theorems & Definitions (8)