Table of Contents
Fetching ...

Fairness via Independence: A (Conditional) Distance Covariance Framework

Ruifan Huang, Haixia Liu

TL;DR

This work frames fairness as statistical independence between predictions and sensitive attributes, leveraging empirical distance covariance (DC) and conditional distance covariance (CDC) as train-time penalties. It introduces matrix forms for efficient parallel computation and provides convergence guarantees for empirical- population distance covariance in batch settings. A Lagrangian dual approach dynamically balances accuracy and fairness, demonstrated across tabular and image datasets (including CelebA and UTKFace) with competitive DP/EO trade-offs. The results highlight the method's versatility, scalability, and applicability to high-dimensional, tensor-valued data without relying on strong distributional assumptions.

Abstract

We explore fairness from a statistical perspective by selectively utilizing either conditional distance covariance or distance covariance statistics as measures to assess the independence between predictions and sensitive attributes. We boost fairness with independence by adding a distance covariance-based penalty to the model's training. Additionally, we present the matrix form of empirical (conditional) distance covariance for parallel calculations to enhance computational efficiency. Theoretically, we provide a proof for the convergence between empirical and population (conditional) distance covariance, establishing necessary guarantees for batch computations. Through experiments conducted on a range of real-world datasets, we have demonstrated that our method effectively bridges the fairness gap in machine learning. Our code is available at \url{https://github.com/liuhaixias1/Fair_dc/}.

Fairness via Independence: A (Conditional) Distance Covariance Framework

TL;DR

This work frames fairness as statistical independence between predictions and sensitive attributes, leveraging empirical distance covariance (DC) and conditional distance covariance (CDC) as train-time penalties. It introduces matrix forms for efficient parallel computation and provides convergence guarantees for empirical- population distance covariance in batch settings. A Lagrangian dual approach dynamically balances accuracy and fairness, demonstrated across tabular and image datasets (including CelebA and UTKFace) with competitive DP/EO trade-offs. The results highlight the method's versatility, scalability, and applicability to high-dimensional, tensor-valued data without relying on strong distributional assumptions.

Abstract

We explore fairness from a statistical perspective by selectively utilizing either conditional distance covariance or distance covariance statistics as measures to assess the independence between predictions and sensitive attributes. We boost fairness with independence by adding a distance covariance-based penalty to the model's training. Additionally, we present the matrix form of empirical (conditional) distance covariance for parallel calculations to enhance computational efficiency. Theoretically, we provide a proof for the convergence between empirical and population (conditional) distance covariance, establishing necessary guarantees for batch computations. Through experiments conducted on a range of real-world datasets, we have demonstrated that our method effectively bridges the fairness gap in machine learning. Our code is available at \url{https://github.com/liuhaixias1/Fair_dc/}.

Paper Structure

This paper contains 26 sections, 12 theorems, 71 equations, 6 figures, 9 tables, 2 algorithms.

Key Result

Theorem 1

Let $\mathbf Y\in\mathbb{R}^p$ and $\mathbf Z\in\mathbb{R}^q$ be two sub-Gaussian random vectors and $Y=[Y_1,\cdots,Y_n]$, $Z=[Z_1,\cdots,Z_n]$ be the sample matrices. For any $\epsilon>0$, we have

Figures (6)

  • Figure 1: The results between accuracy and fairness for the UCI and ACSIncome datasets.
  • Figure 2: Comparison results on the tabular datasets by Lagrangian dual method (Lag dual) and using a fixed balancing parameter (w/o lag dual), where 'w/o' stands for 'without'.
  • Figure 3: Trends in accuracy and fairness metrics over 90 epochs for distance covariance (DC) on the CelebA dataset with specified hyperparameter settings.
  • Figure 4: Trends in accuracy and fairness metrics over 90 epochs for conditional distance covariance (CDC) on the CelebA dataset with specified hyperparameter settings.
  • Figure 5: Trends in accuracy and fairness metrics over 90 epochs for distance covariance (DC) on the UTKFace dataset with specified hyperparameter settings.
  • ...and 1 more figures

Theorems & Definitions (30)

  • Definition 1: Definition 4 in szekelyMeasuringTestingDependence2007
  • Definition 2: Definition of empirical CDC in wang2015conditional
  • Theorem 1: Convergence of DC in probability
  • proof
  • Theorem 2: Convergence of CDC in probability
  • proof
  • Proposition 3
  • proof
  • Proposition 4
  • proof
  • ...and 20 more