Fairness-Accuracy Trade-Offs: A Causal Perspective
Drago Plecko, Elias Bareinboim
TL;DR
This work reframes fairness-accuracy trade-offs through causal reasoning, introducing path-specific excess loss (PSEL) and total excess loss (TEL) to quantify predictor loss when enforcing fairness along chosen causal pathways, and a causal fairness/utility ratio (CFUR) to compare fairness gains against cost in accuracy. It proves TEL can be decomposed into a sum of path-specific losses, and provides an algorithm (PSPL-attribution) with a Shapley-value interpretation to attribute excess loss to direct, indirect, and spurious pathways. A practical training approach, Causal Fair Constrained Learning (CFCL), uses a Lagrangian objective and data-driven lambda selection to learn causally-fair predictors $\widehat{Y}^S(\lambda)$, scalable to multiple pathways. Experiments on Census 2018 (and extended analyses on COMPAS and UCI Credit) show that removing causal pathways can reduce group disparity with varying costs in predictive utility, and CFUR serves as a useful metric for comparing trade-offs. Overall, the paper provides a principled framework linking causal discrimination measures with predictive utility, grounded in both algorithmic design and legal notions of business necessity.
Abstract
Systems based on machine learning may exhibit discriminatory behavior based on sensitive characteristics such as gender, sex, religion, or race. In light of this, various notions of fairness and methods to quantify discrimination were proposed, leading to the development of numerous approaches for constructing fair predictors. At the same time, imposing fairness constraints may decrease the utility of the decision-maker, highlighting a tension between fairness and utility. This tension is also recognized in legal frameworks, for instance in the disparate impact doctrine of Title VII of the Civil Rights Act of 1964 -- in which specific attention is given to considerations of business necessity -- possibly allowing the usage of proxy variables associated with the sensitive attribute in case a high-enough utility cannot be achieved without them. In this work, we analyze the tension between fairness and accuracy from a causal lens for the first time. We introduce the notion of a path-specific excess loss (PSEL) that captures how much the predictor's loss increases when a causal fairness constraint is enforced. We then show that the total excess loss (TEL), defined as the difference between the loss of predictor fair along all causal pathways vs. an unconstrained predictor, can be decomposed into a sum of more local PSELs. At the same time, enforcing a causal constraint often reduces the disparity between demographic groups. Thus, we introduce a quantity that summarizes the fairness-utility trade-off, called the causal fairness/utility ratio, defined as the ratio of the reduction in discrimination vs. the excess loss from constraining a causal pathway. This quantity is suitable for comparing the fairness-utility trade-off across causal pathways. Finally, as our approach requires causally-constrained fair predictors, we introduce a new neural approach for causally-constrained fair learning.
