Aleatoric and Epistemic Discrimination: Fundamental Limits of Fairness Interventions
Hao Wang, Luxi He, Rui Gao, Flavio P. Calmon
TL;DR
The paper tackles algorithmic discrimination by separating data-inherent (aleatoric) biases from model-development (epistemic) choices. It defines the fairness Pareto frontier $ontname{phv} extit{FairFront}(oldsymbol{ extalpha}_{ ext{SP}},oldsymbol{ extalpha}_{ ext{EO}},oldsymbol{ extalpha}_{ ext{OAE}})$ as the maximum achievable accuracy under group-fairness constraints and characterizes the feasible set of conditional prediction distributions $P_{ exthat{Y}| extS, extY}$ using Blackwell's comparison theorems. A greedy algorithm with convergence guarantees provides an upper-bound approximation to the frontier, enabling benchmarking of existing fairness interventions. Empirical studies across standard tabular datasets show that state-of-the-art (SOTA) fairness methods closely approach the information-theoretic frontier for epistemic discrimination under pristine data, but disparate missing data patterns reveal substantial aleatoric discrimination and reduce intervention effectiveness. The framework offers guidance for data collection and missing-data handling to promote fair and accurate downstream decisions."
Abstract
Machine learning (ML) models can underperform on certain population groups due to choices made during model development and bias inherent in the data. We categorize sources of discrimination in the ML pipeline into two classes: aleatoric discrimination, which is inherent in the data distribution, and epistemic discrimination, which is due to decisions made during model development. We quantify aleatoric discrimination by determining the performance limits of a model under fairness constraints, assuming perfect knowledge of the data distribution. We demonstrate how to characterize aleatoric discrimination by applying Blackwell's results on comparing statistical experiments. We then quantify epistemic discrimination as the gap between a model's accuracy when fairness constraints are applied and the limit posed by aleatoric discrimination. We apply this approach to benchmark existing fairness interventions and investigate fairness risks in data with missing values. Our results indicate that state-of-the-art fairness interventions are effective at removing epistemic discrimination on standard (overused) tabular datasets. However, when data has missing values, there is still significant room for improvement in handling aleatoric discrimination.
