Table of Contents
Fetching ...

Wasserstein projection distance for fairness testing of regression models

Wanxin Li, Yongjin P. Park, Khanh Dao Duc

TL;DR

This work extends Wasserstein-projection fairness testing from classification to regression, focusing on expectation-based fairness criteria. It formulates a hypothesis test and an optimal data-perturbation mechanism to push models toward fairness while balancing accuracy, underpinned by a dual reformulation and asymptotic theory with a chi-square limiting paradigm. The proposed framework is validated on synthetic data and real cases (student grades and housing prices), demonstrating higher specificity than permutation tests and the ability to detect and mitigate biases. Together, these contributions provide a principled, transport-based toolkit for auditing and correcting fairness in continuous-valued predictive models with practical impact for education and housing analytics.

Abstract

Fairness in machine learning is a critical concern, yet most research has focused on classification tasks, leaving regression models underexplored. This paper introduces a Wasserstein projection-based framework for fairness testing in regression models, focusing on expectation-based criteria. We propose a hypothesis-testing approach and an optimal data perturbation method to improve fairness while balancing accuracy. Theoretical results include a detailed categorization of fairness criteria for regression, a dual reformulation of the Wasserstein projection test statistic, and the derivation of asymptotic bounds and limiting distributions. Experiments on synthetic and real-world datasets demonstrate that the proposed method offers higher specificity compared to permutation-based tests, and effectively detects and mitigates biases in real applications such as student performance and housing price prediction.

Wasserstein projection distance for fairness testing of regression models

TL;DR

This work extends Wasserstein-projection fairness testing from classification to regression, focusing on expectation-based fairness criteria. It formulates a hypothesis test and an optimal data-perturbation mechanism to push models toward fairness while balancing accuracy, underpinned by a dual reformulation and asymptotic theory with a chi-square limiting paradigm. The proposed framework is validated on synthetic data and real cases (student grades and housing prices), demonstrating higher specificity than permutation tests and the ability to detect and mitigate biases. Together, these contributions provide a principled, transport-based toolkit for auditing and correcting fairness in continuous-valued predictive models with practical impact for education and housing analytics.

Abstract

Fairness in machine learning is a critical concern, yet most research has focused on classification tasks, leaving regression models underexplored. This paper introduces a Wasserstein projection-based framework for fairness testing in regression models, focusing on expectation-based criteria. We propose a hypothesis-testing approach and an optimal data perturbation method to improve fairness while balancing accuracy. Theoretical results include a detailed categorization of fairness criteria for regression, a dual reformulation of the Wasserstein projection test statistic, and the derivation of asymptotic bounds and limiting distributions. Experiments on synthetic and real-world datasets demonstrate that the proposed method offers higher specificity compared to permutation-based tests, and effectively detects and mitigates biases in real applications such as student performance and housing price prediction.

Paper Structure

This paper contains 26 sections, 10 theorems, 79 equations, 3 figures, 4 tables.

Key Result

Theorem 4.1

Suppose $\mathbb{Q} \in \mathcal{F}_\mathcal{R}$ satisfies $W_c^2(\hat{\mathbb{P}}^N, \mathbb{Q}) < \infty$, then

Figures (3)

  • Figure 1: Visualization of simulations to examine the relationship between (A) Power vs. Sample size, (B) Power vs. Effect size, (C) Power vs. Significance level and (D) Specificity vs. Significance level.
  • Figure 2: Empirical distribution of the Wasserstein projection distance versus limiting distribution for (A) N=1000 and (B) N=10000.
  • Figure 3: Scatterplot of difference in means and relative MAE scores versus correction strength $\eta$.

Theorems & Definitions (23)

  • Definition 3.1: Exact Expectation Equivalence
  • Definition 3.2: Expectation Equivalence Within a Tolerance
  • Definition 3.3: Exact Distributional Equivalence
  • Definition 3.4: Distributional Equivalence Within a Tolerance
  • Theorem 4.1
  • Theorem 4.2: Dual reformulation
  • Corollary 4.2.1: Special case of \ref{['thm:dual']}
  • Theorem 4.3: Asymptotic upper bound
  • Theorem 4.4: Limiting distribution
  • Corollary 4.4.1: Special case of \ref{['thm:limiting']}
  • ...and 13 more