Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Lujing Zhang; Aaron Roth; Linjun Zhang

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Lujing Zhang, Aaron Roth, Linjun Zhang

TL;DR

This work introduces the $(\mathbf{s},\mathcal{G}, α)$-Generalized Multicalibration (GMC) framework to post-process predictive models for multi-group fairness in multi-dimensional outputs. It develops a generalized calibration objective, an algorithm with convergence guarantees, and finite-sample analyses, showing how to enforce fairness across many subgroups and dimensions. The authors demonstrate GMC's versatility through three applications: de-biased text generation, prediction-set conditional coverage in hierarchical classification, and fair false-negative-rate control in image segmentation, with empirical results indicating improved fairness without substantial accuracy loss. Overall, GMC provides a unified, theoretically grounded approach to multi-group fairness across complex, multidimensional prediction tasks relevant to real-world AI systems.

Abstract

This paper introduces a framework for post-processing machine learning models so that their predictions satisfy multi-group fairness guarantees. Based on the celebrated notion of multicalibration, we introduce $(\mathbf{s},\mathcal{G}, α)-$GMC (Generalized Multi-Dimensional Multicalibration) for multi-dimensional mappings $\mathbf{s}$, constraint set $\mathcal{G}$, and a pre-specified threshold level $α$. We propose associated algorithms to achieve this notion in general settings. This framework is then applied to diverse scenarios encompassing different fairness concerns, including false negative rate control in image segmentation, prediction set conditional uncertainty quantification in hierarchical classification, and de-biased text generation in language models. We conduct numerical studies on several datasets and tasks.

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

TL;DR

This work introduces the

-Generalized Multicalibration (GMC) framework to post-process predictive models for multi-group fairness in multi-dimensional outputs. It develops a generalized calibration objective, an algorithm with convergence guarantees, and finite-sample analyses, showing how to enforce fairness across many subgroups and dimensions. The authors demonstrate GMC's versatility through three applications: de-biased text generation, prediction-set conditional coverage in hierarchical classification, and fair false-negative-rate control in image segmentation, with empirical results indicating improved fairness without substantial accuracy loss. Overall, GMC provides a unified, theoretically grounded approach to multi-group fairness across complex, multidimensional prediction tasks relevant to real-world AI systems.

Abstract

GMC (Generalized Multi-Dimensional Multicalibration) for multi-dimensional mappings

, constraint set

, and a pre-specified threshold level

. We propose associated algorithms to achieve this notion in general settings. This framework is then applied to diverse scenarios encompassing different fairness concerns, including false negative rate control in image segmentation, prediction set conditional uncertainty quantification in hierarchical classification, and de-biased text generation in language models. We conduct numerical studies on several datasets and tasks.

Paper Structure (29 sections, 12 theorems, 75 equations, 8 figures, 5 tables, 2 algorithms)

This paper contains 29 sections, 12 theorems, 75 equations, 8 figures, 5 tables, 2 algorithms.

Introduction
Related Work
Notation
Formulation and Algorithm
A generalized notion of Multicalibration
Algorithm and Convergence Results
Finite-Sample Results
Applications
De-Biased Text Generation
Prediction-Set Conditional Coverage in Hierarchical Classification
Fair FNR Control in Image Segmentation
Experiments
De-Biased Text Generation
Prediction-Set Conditional Coverage in Hierarchical Classification
Fair FNR Control in Image Segmentation
...and 14 more sections

Key Result

Theorem 1

Under Assumptions 1-4, the $(\bm s,\mathcal{G}, \alpha)$-GMC Algorithm with a suitably chosen $\eta = \mathcal{O}(\alpha/(K_{\mathcal{L}}B))$ converges in $T=\mathcal{O}(\frac{2K_{\mathcal{L}}(C_u-C_l)B)}{\alpha^2})$ iterations and outputs a function $\bm f$ satisfying

Figures (8)

Figure 1: A demo of hierarchical text classification using a subset of labels from the Web of Science dataset. kowsari2017HDLTex.
Figure 2: The bias on outputting different types of sensitive attributes measured on the corpus data. The results for the synthetic data are deferred to the appendix.
Figure 3: The deviation of prediction-set conditional coverage from the target.
Figure 4: The deviation of the false negative rate from the target in image segmentation.
Figure 5: A demonstration of the input data
...and 3 more figures

Theorems & Definitions (35)

Definition 1: $(\bm s,\mathcal{G}, \alpha)$-GMC
Definition 2: The derivative of a functional
Definition 3: Convexity of a functional
Definition 4: $K_{\mathcal{L}}$-smoothness of a functional
Theorem 1
Definition 5: Dimension of the function class
Theorem 2
Theorem 3
Theorem 4
Theorem 5
...and 25 more

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

TL;DR

Abstract

Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (35)