How Far Can Fairness Constraints Help Recover From Biased Data?
Mohit Sharma, Amit Deshpande
TL;DR
This work shows that carefully chosen fairness constraints, notably equal opportunity, can recover optimal and fair classifiers on the original distribution even when training data are severely biased. By revealing that several data-bias mechanisms induce a linear fractional transformation of the group-aware regression function, the authors derive threshold-based characterizations that align biased-distribution optima with the Bayes optimal fair classifier under concrete conditions. The contributions include extending recovery to Massart noise, introducing reject-option and ε-robustness frameworks for arbitrary distributions and hypothesis classes, and modeling time-varying bias in multi-stage pipelines with both infinite and finite horizons. These results highlight a principled route to mitigating bias-induced fairness-accuracy trade-offs in practical settings, with implications for robust, fair decision-making in dynamic environments.
Abstract
A general belief in fair classification is that fairness constraints incur a trade-off with accuracy, which biased data may worsen. Contrary to this belief, Blum & Stangl (2019) show that fair classification with equal opportunity constraints even on extremely biased data can recover optimally accurate and fair classifiers on the original data distribution. Their result is interesting because it demonstrates that fairness constraints can implicitly rectify data bias and simultaneously overcome a perceived fairness-accuracy trade-off. Their data bias model simulates under-representation and label bias in underprivileged population, and they show the above result on a stylized data distribution with i.i.d. label noise, under simple conditions on the data distribution and bias parameters. We propose a general approach to extend the result of Blum & Stangl (2019) to different fairness constraints, data bias models, data distributions, and hypothesis classes. We strengthen their result, and extend it to the case when their stylized distribution has labels with Massart noise instead of i.i.d. noise. We prove a similar recovery result for arbitrary data distributions using fair reject option classifiers. We further generalize it to arbitrary data distributions and arbitrary hypothesis classes, i.e., we prove that for any data distribution, if the optimally accurate classifier in a given hypothesis class is fair and robust, then it can be recovered through fair classification with equal opportunity constraints on the biased distribution whenever the bias parameters satisfy certain simple conditions. Finally, we show applications of our technique to time-varying data bias in classification and fair machine learning pipelines.
