Standardized Interpretable Fairness Measures for Continuous Risk Scores

Ann-Kristin Becker; Oana Dumitrasc; Klaus Broelemann

Standardized Interpretable Fairness Measures for Continuous Risk Scores

Ann-Kristin Becker, Oana Dumitrasc, Klaus Broelemann

TL;DR

This work tackles fairness for continuous risk scores by extending three fundamental parity concepts—independence, separation, and sufficiency—to score-based disparities using a standardized Wasserstein-distance framework over the joint distribution $(S,A,Y)$. It introduces two main measures, bias$^{\mathcal{U}}$ and bias$^{S}$, which relate to the Wasserstein-1 distance and are invariant to monotone transformations, enabling meaningful comparisons across models and datasets. The authors connect these score biases to ROC-based metrics, show theoretical bounds and decompositions, and validate the approach on COMPAS, German Credit, UCI Adult, and synthetic data, illustrating practical insight for detecting and interpreting bias in scoring systems. The proposed methodology offers interpretable, distribution-aware fairness diagnostics that can guide debiasing decisions and monitoring across time and different populations, with potential extensions to broader Wasserstein distances and causal fairness analyses.

Abstract

We propose a standardized version of fairness measures for continuous scores with a reasonable interpretation based on the Wasserstein distance. Our measures are easily computable and well suited for quantifying and interpreting the strength of group disparities as well as for comparing biases across different models, datasets, or time points. We derive a link between the different families of existing fairness measures for scores and show that the proposed standardized fairness measures outperform ROC-based fairness measures because they are more explicit and can quantify significant biases that ROC-based fairness measures miss.

Standardized Interpretable Fairness Measures for Continuous Risk Scores

TL;DR

. It introduces two main measures, bias

and bias

, which relate to the Wasserstein-1 distance and are invariant to monotone transformations, enabling meaningful comparisons across models and datasets. The authors connect these score biases to ROC-based metrics, show theoretical bounds and decompositions, and validate the approach on COMPAS, German Credit, UCI Adult, and synthetic data, illustrating practical insight for detecting and interpreting bias in scoring systems. The proposed methodology offers interpretable, distribution-aware fairness diagnostics that can guide debiasing decisions and monitoring across time and different populations, with potential extensions to broader Wasserstein distances and causal fairness analyses.

Abstract

Paper Structure (29 sections, 22 theorems, 49 equations, 7 figures, 5 tables)

This paper contains 29 sections, 22 theorems, 49 equations, 7 figures, 5 tables.

Introduction
Parity concepts and fairness measures for classifiers
Independence (selection rate parity)
Separation (error rate parity)
Sufficiency (predictive value parity)
Generalization to continuous risk scores
Expected classifier bias with uniformly weighted thresholds
Standardized Measures
Interpretation of the score bias
Positive and negative components of the score bias
ROC-based fairness measures and relations
Relating Wasserstein and ROC biases
Experiments
COMPAS
German Credit Data
...and 14 more sections

Key Result

Theorem 3.2

For the concepts independence and separation, i.e for $x \in$ {IND, PE, EO}, it holds:

Figures (7)

Figure 2: Changing bias measures with increasing distance between the groups and classes.
Figure : (a) $\mathop{\mathrm{bias}}\nolimits_{\text{EO}}^\mathcal{U}$
Figure C1: Distribution of logistic regression scores, trained on Adult data.
Figure C2: Distribution of logistic regression scores, trained on Adult data without protected attribute.
Figure C3: Distribution of XGBoost scores trained on Adult data.
...and 2 more figures

Theorems & Definitions (44)

Definition 3.1
Theorem 3.2
Definition 3.3
Theorem 3.4
Definition 4.1: ROC
Definition 4.2
Definition 4.3
Theorem 4.4
Proposition 4.5
Lemma 4.6
...and 34 more

Standardized Interpretable Fairness Measures for Continuous Risk Scores

TL;DR

Abstract

Standardized Interpretable Fairness Measures for Continuous Risk Scores

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (44)