Beyond Incompatibility: Trade-offs between Mutually Exclusive Fairness Criteria in Machine Learning and Law
Meike Zehlike, Alex Loosley, Håkan Jonsson, Emil Wiedemann, Philipp Hacker
TL;DR
This work tackles the incompatibility of three key ML fairness notions—calibration within groups, balance for the negative class, and balance for the positive class—by introducing FAIM, a post-processing algorithm that continuously interpolates between them using optimal transport. FAIM constructs a per-group mapping via three criterion-specific distributions, computes their Wasserstein-2 barycenter with weights $\theta^A,\theta^B,\theta^C$, and applies an OT map to obtain fair scores, thereby producing a tunable compromise rather than an absolute trade-off. The authors validate FAIM on synthetic data, the COMPAS recidivism dataset, and a Zalando e-commerce ranking dataset, illustrating how different weightings yield calibrated, balanced, or intermediate fairness outcomes while preserving as much predictive performance as possible. They also discuss the normative and legal implications, showing how FAIM can help align ML systems with EU regulatory regimes such as the AI Act and the Digital Markets Act, albeit with careful attention to ground-truth reliability and the limits of post-hoc fairness adjustments. Overall, FAIM provides a rigorous, flexible framework for operationalizing fairness trade-offs in high-stakes domains and for translating technical fairness objectives into legally interpretable and auditable policies.
Abstract
Fair and trustworthy AI is becoming ever more important in both machine learning and legal domains. One important consequence is that decision makers must seek to guarantee a 'fair', i.e., non-discriminatory, algorithmic decision procedure. However, there are several competing notions of algorithmic fairness that have been shown to be mutually incompatible under realistic factual assumptions. This concerns, for example, the widely used fairness measures of 'calibration within groups' and 'balance for the positive/negative class'. In this paper, we present a novel algorithm (FAir Interpolation Method: FAIM) for continuously interpolating between these three fairness criteria. Thus, an initially unfair prediction can be remedied to, at least partially, meet a desired, weighted combination of the respective fairness conditions. We demonstrate the effectiveness of our algorithm when applied to synthetic data, the COMPAS data set, and a new, real-world data set from the e-commerce sector. Finally, we discuss to what extent FAIM can be harnessed to comply with conflicting legal obligations. The analysis suggests that it may operationalize duties in traditional legal fields, such as credit scoring and criminal justice proceedings, but also for the latest AI regulations put forth in the EU, like the Digital Markets Act and the recently enacted AI Act.
