Procedural Fairness in Machine Learning

Ziming Wang; Changwu Huang; Xin Yao

Procedural Fairness in Machine Learning

Ziming Wang, Changwu Huang, Xin Yao

TL;DR

This paper addresses the gap in ML fairness research by defining procedural fairness and distinguishing it from distributive fairness, proposing formal definitions for individual and group procedural fairness. It introduces GPFFAE, an FAE-based metric that measures differences in model decision logic across groups by comparing SHAP/FAE explanations with maximum mean discrepancy and permutation testing; the method requires sampling paired data points from two groups. The authors validate GPFFAE across nine datasets (synthetic plus eight real-world), analyze its relationship with distributive fairness, and show how it can identify unfair features to support targeted mitigation. They propose two mitigation strategies—removing unfair features via retraining and reducing their influence through an explanation-loss penalty—demonstrating substantial improvements in procedural fairness with only modest impacts on accuracy and occasional distributive fairness gains, highlighting practical implications for fair, responsible AI.

Abstract

Fairness in machine learning (ML) has received much attention. However, existing studies have mainly focused on the distributive fairness of ML models. The other dimension of fairness, i.e., procedural fairness, has been neglected. In this paper, we first define the procedural fairness of ML models, and then give formal definitions of individual and group procedural fairness. We propose a novel metric to evaluate the group procedural fairness of ML models, called $GPF_{FAE}$, which utilizes a widely used explainable artificial intelligence technique, namely feature attribution explanation (FAE), to capture the decision process of the ML models. We validate the effectiveness of $GPF_{FAE}$ on a synthetic dataset and eight real-world datasets. Our experiments reveal the relationship between procedural and distributive fairness of the ML model. Based on our analysis, we propose a method for identifying the features that lead to the procedural unfairness of the model and propose two methods to improve procedural fairness after identifying unfair features. Our experimental results demonstrate that we can accurately identify the features that lead to procedural unfairness in the ML model, and both of our proposed methods can significantly improve procedural fairness with a slight impact on model performance, while also improving distributive fairness.

Procedural Fairness in Machine Learning

TL;DR

Abstract

, which utilizes a widely used explainable artificial intelligence technique, namely feature attribution explanation (FAE), to capture the decision process of the ML models. We validate the effectiveness of

on a synthetic dataset and eight real-world datasets. Our experiments reveal the relationship between procedural and distributive fairness of the ML model. Based on our analysis, we propose a method for identifying the features that lead to the procedural unfairness of the model and propose two methods to improve procedural fairness after identifying unfair features. Our experimental results demonstrate that we can accurately identify the features that lead to procedural unfairness in the ML model, and both of our proposed methods can significantly improve procedural fairness with a slight impact on model performance, while also improving distributive fairness.

Paper Structure (23 sections, 7 equations, 12 figures, 8 tables, 2 algorithms)

This paper contains 23 sections, 7 equations, 12 figures, 8 tables, 2 algorithms.

Introduction
Related Work
Related Work on Distributive Fairness in ML
Related Work on Procedural Fairness in ML
Related Work on Applying FAE to Fairness
New Definition and Measurement of Procedural Fairness
Notation
Definition and Measurement of Procedural Fairness
Experimental Evaluation of the Procedural Fairness Metric
Experimental Setup
Datasets
Parameter Setting
Evaluation on Procedural-Fair and Procedural-Unfair Models
Evaluating the Relationship between GPFFAE Metric and the Degree of Group Procedural Fairness
Relationship between Procedural and Distributive Fairness
...and 8 more sections

Figures (12)

Figure 1: Pipeline of using GPFFAE metric to evaluate group procedural fairness.
Figure 2: Relationship graph for the synthetic dataset.
Figure 3: The “sex" feature on the Adult dataset is used as an example to illustrate the criteria for constructing a significantly procedural-unfair model.
Figure 4: FAE explanation results obtained for the sensitive attribute on each constructed procedural-unfair model, where red and blue represent the advantaged and disadvantaged groups, respectively.
Figure 5: Explanation results for disadvantaged and advantaged groups on the COMPAS dataset.
...and 7 more figures

Procedural Fairness in Machine Learning

TL;DR

Abstract

Procedural Fairness in Machine Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (12)