Reranking individuals: The effect of fair classification within-groups
Sofie Goethals, Marco Favier, Toon Calders
TL;DR
This paper investigates the within-group effects of bias mitigation in fair classification, arguing that traditional between-group fairness analyses miss important intra-group reranking dynamics. By formalizing a framework with a biased score $S(x,a)$ and a fair probability $p(Y=1\vert X=x,A=a)$, it shows that, in the absence of within-group bias, fair decisions can be decomposed into group-specific thresholds, making threshold optimization a powerful baseline. Using five real-world datasets and a suite of bias mitigation methods from AIF360, the study demonstrates that preprocessing and inprocessing methods often substantially alter intra-group rankings, while postprocessing methods mainly adjust labels without changing the underlying score rankings. It concludes that evaluating bias mitigation should prioritize prediction scores (AUC) and per-group performance, rather than solely relying on final labels, to capture real-world constraints and fairness outcomes, and discusses the affirmative-action-like decomposition as a condition under which within-group reranking is unnecessary.
Abstract
Artificial Intelligence (AI) finds widespread application across various domains, but it sparks concerns about fairness in its deployment. The prevailing discourse in classification often emphasizes outcome-based metrics comparing sensitive subgroups without a nuanced consideration of the differential impacts within subgroups. Bias mitigation techniques not only affect the ranking of pairs of instances across sensitive groups, but often also significantly affect the ranking of instances within these groups. Such changes are hard to explain and raise concerns regarding the validity of the intervention. Unfortunately, these effects remain under the radar in the accuracy-fairness evaluation framework that is usually applied. Additionally, we illustrate the effect of several popular bias mitigation methods, and how their output often does not reflect real-world scenarios.
