When mitigating bias is unfair: multiplicity and arbitrariness in algorithmic group fairness
Natasa Krco, Thibault Laugel, Vincent Grari, Jean-Michel Loubes, Marcin Detyniecki
TL;DR
This paper addresses how bias mitigation in machine learning can be arbitrary and multiplicity-driven, even when global fairness and accuracy metrics are similar. It introduces the FRAME framework, a five-dimension evaluation tool that analyzes impact size, change direction, decision rates, affected subpopulations, and neglected subpopulations to reveal nuanced effects of debiasing methods. By applying FRAME to five tabular datasets and a range of pre-processing, in-processing, and post-processing debiasing strategies, the study demonstrates substantial differences in which individuals are affected and how global metrics may mask local unfairness. The findings argue for more transparent, multi-dimensional evaluation of debiasing processes and propose directions toward designing fairer, less arbitrary models with better consideration of individual and subpopulation impacts.
Abstract
Most research on fair machine learning has prioritized optimizing criteria such as Demographic Parity and Equalized Odds. Despite these efforts, there remains a limited understanding of how different bias mitigation strategies affect individual predictions and whether they introduce arbitrariness into the debiasing process. This paper addresses these gaps by exploring whether models that achieve comparable fairness and accuracy metrics impact the same individuals and mitigate bias in a consistent manner. We introduce the FRAME (FaiRness Arbitrariness and Multiplicity Evaluation) framework, which evaluates bias mitigation through five dimensions: Impact Size (how many people were affected), Change Direction (positive versus negative changes), Decision Rates (impact on models' acceptance rates), Affected Subpopulations (who was affected), and Neglected Subpopulations (where unfairness persists). This framework is intended to help practitioners understand the impacts of debiasing processes and make better-informed decisions regarding model selection. Applying FRAME to various bias mitigation approaches across key datasets allows us to exhibit significant differences in the behaviors of debiasing methods. These findings highlight the limitations of current fairness criteria and the inherent arbitrariness in the debiasing process.
