BaBE: Enhancing Fairness via Estimation of Latent Explaining Variables
Ruta Binkyte, Daniele Gorla, Catuscia Palamidessi
TL;DR
BaBE tackles unfair discrimination when the legitimate explaining variable $E$ is latent and only a biased proxy $Z$ is observed. By combining Bayes inference with the EM algorithm, BaBE estimates $\mathbb{P}[E|S]$ from data and then derives $\hat{\mathbb{P}}[E|Z,S]$, enabling decisions based on the inferred $E$ to satisfy CSP and EO. The method includes two practical decision strategies and demonstrates strong fairness and accuracy on synthetic data with distribution shifts and on the NHANES dataset, with robustness to changes in $\mathbb{P}[E|S]$ across populations. BaBE does not assume independence between $E$ and $S$ and can transfer causal knowledge via the bias mechanism $\mathbb{P}[Z|E,S]$, offering a principled pre-processing approach for fair decision-making across domains. Overall, BaBE provides a scalable, data-efficient framework for latent-explainer fairness that achieves CSP/EO while preserving predictive performance.
Abstract
We consider the problem of unfair discrimination between two groups and propose a pre-processing method to achieve fairness. Corrective methods like statistical parity usually lead to bad accuracy and do not really achieve fairness in situations where there is a correlation between the sensitive attribute S and the legitimate attribute E (explanatory variable) that should determine the decision. To overcome these drawbacks, other notions of fairness have been proposed, in particular, conditional statistical parity and equal opportunity. However, E is often not directly observable in the data, i.e., it is a latent variable. We may observe some other variable Z representing E, but the problem is that Z may also be affected by S, hence Z itself can be biased. To deal with this problem, we propose BaBE (Bayesian Bias Elimination), an approach based on a combination of Bayes inference and the Expectation-Maximization method, to estimate the most likely value of E for a given Z for each group. The decision can then be based directly on the estimated E. We show, by experiments on synthetic and real data sets, that our approach provides a good level of fairness as well as high accuracy.
