Does Machine Bring in Extra Bias in Learning? Approximating Fairness in Models Promptly
Yijun Bian, Yujie Luo
TL;DR
This work defines a manifold-based fairness framework that unifies data-related and algorithmic discrimination through set-distance measures. The core idea is to compute a harmonic fairness measure $\mathbf{df}(f)=\frac{\mathbf{D}_f(S_0,S_1)}{\mathbf{D}(S_0,S_1)}-1$ using distances between groups, and to make this computation practical with ApproxDist, a projection-based algorithm that reduces complexity to $\mathcal{O}(n\log n)$. The authors provide theoretical guarantees for ApproxDist and validate them empirically on multiple datasets, showing that HFM can capture discriminative risk and offers competitive or complementary insights beyond traditional group fairness metrics. The approach enables scalable, dual-perspective fairness analysis suitable for high-stakes domains and real-world deployment, with findings supported by extensive experiments and parameter analyses.
Abstract
Providing various machine learning (ML) applications in the real world, concerns about discrimination hidden in ML models are growing, particularly in high-stakes domains. Existing techniques for assessing the discrimination level of ML models include commonly used group and individual fairness measures. However, these two types of fairness measures are usually hard to be compatible with each other, and even two different group fairness measures might be incompatible as well. To address this issue, we investigate to evaluate the discrimination level of classifiers from a manifold perspective and propose a "harmonic fairness measure via manifolds (HFM)" based on distances between sets. Yet the direct calculation of distances might be too expensive to afford, reducing its practical applicability. Therefore, we devise an approximation algorithm named "Approximation of distance between sets (ApproxDist)" to facilitate accurate estimation of distances, and we further demonstrate its algorithmic effectiveness under certain reasonable assumptions. Empirical results indicate that the proposed fairness measure HFM is valid and that the proposed ApproxDist is effective and efficient.
