Table of Contents
Fetching ...

Transparency and Proportionality in Post-Processing Algorithmic Bias Correction

Juliett Suárez Ferreira, Marija Slavkovik, Jorge Casillas

TL;DR

The study tackles the problem that post-processing bias corrections can still yield unfair outcomes by focusing on the distribution of prediction flips across groups. It introduces a formal set of flip-centered proportionality metrics (e.g., FR, N_flips, DFR, FRD, HDI) and group-specific versions to quantify and compare how corrections affect privileged and unprivileged groups. Through a toy example, it demonstrates that a method can achieve traditional fairness metrics while concentrating harmful flips on a particular group, highlighting the need for transparency and consideration of proportionality in bias mitigation. The proposed methodology provides practitioners with diagnostics that complement standard fairness measures, promoting fairer and more justifiable post-processing interventions, and paves the way for broader routine adoption and extension to more complex settings.

Abstract

Algorithmic decision-making systems sometimes produce errors or skewed predictions toward a particular group, leading to unfair results. Debiasing practices, applied at different stages of the development of such systems, occasionally introduce new forms of unfairness or exacerbate existing inequalities. We focus on post-processing techniques that modify algorithmic predictions to achieve fairness in classification tasks, examining the unintended consequences of these interventions. To address this challenge, we develop a set of measures that quantify the disparity in the flips applied to the solution in the post-processing stage. The proposed measures will help practitioners: (1) assess the proportionality of the debiasing strategy used, (2) have transparency to explain the effects of the strategy in each group, and (3) based on those results, analyze the possibility of the use of some other approaches for bias mitigation or to solve the problem. We introduce a methodology for applying the proposed metrics during the post-processing stage and illustrate its practical application through an example. This example demonstrates how analyzing the proportionality of the debiasing strategy complements traditional fairness metrics, providing a deeper perspective to ensure fairer outcomes across all groups.

Transparency and Proportionality in Post-Processing Algorithmic Bias Correction

TL;DR

The study tackles the problem that post-processing bias corrections can still yield unfair outcomes by focusing on the distribution of prediction flips across groups. It introduces a formal set of flip-centered proportionality metrics (e.g., FR, N_flips, DFR, FRD, HDI) and group-specific versions to quantify and compare how corrections affect privileged and unprivileged groups. Through a toy example, it demonstrates that a method can achieve traditional fairness metrics while concentrating harmful flips on a particular group, highlighting the need for transparency and consideration of proportionality in bias mitigation. The proposed methodology provides practitioners with diagnostics that complement standard fairness measures, promoting fairer and more justifiable post-processing interventions, and paves the way for broader routine adoption and extension to more complex settings.

Abstract

Algorithmic decision-making systems sometimes produce errors or skewed predictions toward a particular group, leading to unfair results. Debiasing practices, applied at different stages of the development of such systems, occasionally introduce new forms of unfairness or exacerbate existing inequalities. We focus on post-processing techniques that modify algorithmic predictions to achieve fairness in classification tasks, examining the unintended consequences of these interventions. To address this challenge, we develop a set of measures that quantify the disparity in the flips applied to the solution in the post-processing stage. The proposed measures will help practitioners: (1) assess the proportionality of the debiasing strategy used, (2) have transparency to explain the effects of the strategy in each group, and (3) based on those results, analyze the possibility of the use of some other approaches for bias mitigation or to solve the problem. We introduce a methodology for applying the proposed metrics during the post-processing stage and illustrate its practical application through an example. This example demonstrates how analyzing the proportionality of the debiasing strategy complements traditional fairness metrics, providing a deeper perspective to ensure fairer outcomes across all groups.

Paper Structure

This paper contains 11 sections, 23 equations, 4 figures, 2 tables.

Figures (4)

  • Figure 1: Part of the ML pipeline highlighting pre-processing, in-processing and post-processing interventions at different stages as well as the ADM process.
  • Figure 2: A post-processing debiasing method that works by changing the label of the instances. Consider y the label of the instance, y predicted the predicted label and y correctedthe label obtained after applying a post-processing debias strategy. The signs, $+$ and $-$represent the value of the class and the colors (light and dark gray) indicate the membership of each instance to a particular group taking into account a protected attribute such as gender, race, etc.
  • Figure 3: Methodology
  • Figure 4: Visual analysis of Flips and Group-Based Flip Proportionality Metrics

Theorems & Definitions (11)

  • Definition 1: Flip
  • Definition 2: Number of flips
  • Definition 3: Flip Rate (FR)
  • Definition 4: Favorable Flips
  • Definition 5: Unfavorable Flips
  • Definition 6: Directional Flip Ratio (DFR)
  • Definition 7: Harmful Flip Proportion(HFP)
  • Definition 8: Flip Rate Difference (FRD) & Harmful Flip Proportion Difference (HFPD)
  • Definition 9: Disparity Index (DI) & Harmful Disparity Index (HDI)
  • Definition 10: Flip Rate Disparity (FD) & Harmful Flip Proportionality Disparity (HFD)
  • ...and 1 more