Intrinsic Fairness-Accuracy Tradeoffs under Equalized Odds
Meiyu Zhong, Ravi Tandon
TL;DR
The paper addresses the fundamental problem of fairness-accuracy tradeoffs for binary classification under equalized odds (EO). It derives a classifier-independent unconstrained upper bound on accuracy and, crucially, an EO-budget bound that incorporates subgroup TV distances and subgroup proportions, linking accuracy limits to data statistics via $d_{TV}(P_1^a,P_0^a)$, $d_{TV}(P_1^b,P_0^b)$, $\alpha$, and $\beta$. The bounds are demonstrated to be tight in simple regimes and are estimable from real data using variational TV estimators; experiments on COMPAS, Adult, and Law School datasets compare these bounds to the performance of EO-fair classifiers, illustrating fundamental limits when subgroup disparities are large. Overall, the work provides theoretical guarantees and practical guidance for designing and evaluating fair classifiers under EO constraints, with implications for policy and deployment in high-stakes domains.
Abstract
With the growing adoption of machine learning (ML) systems in areas like law enforcement, criminal justice, finance, hiring, and admissions, it is increasingly critical to guarantee the fairness of decisions assisted by ML. In this paper, we study the tradeoff between fairness and accuracy under the statistical notion of equalized odds. We present a new upper bound on the accuracy (that holds for any classifier), as a function of the fairness budget. In addition, our bounds also exhibit dependence on the underlying statistics of the data, labels and the sensitive group attributes. We validate our theoretical upper bounds through empirical analysis on three real-world datasets: COMPAS, Adult, and Law School. Specifically, we compare our upper bound to the tradeoffs that are achieved by various existing fair classifiers in the literature. Our results show that achieving high accuracy subject to a low-bias could be fundamentally limited based on the statistical disparity across the groups.
