PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers

Julien Bastian; Benjamin Leblanc; Pascal Germain; Amaury Habrard; Christine Largeron; Guillaume Metzler; Emilie Morvant; Paul Viallard

PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers

Julien Bastian, Benjamin Leblanc, Pascal Germain, Amaury Habrard, Christine Largeron, Guillaume Metzler, Emilie Morvant, Paul Viallard

TL;DR

This work tackles the challenge of providing theoretical fairness guarantees for learning algorithms, beyond traditional predictive risk bounds. It develops a unified PAC-Bayesian framework that yields generalization bounds for fairness, applicable to both stochastic Gibbs classifiers and deterministic majority votes, by viewing fairness as a risk-discrepancy $RF_{\mathcal{D}}(h)$. A key contribution is a self-bounding learning procedure that directly optimizes a bound- based trade-off between predictive risk and fairness across common group fairness measures such as Demographic Parity, Equalized Odds, and Equal Opportunity. Empirical results on several datasets show tight bounds and favorable risk/fairness trade-offs, supporting the practicality of certifiable fairness in real-world settings.

Abstract

Classical PAC generalization bounds on the prediction risk of a classifier are insufficient to provide theoretical guarantees on fairness when the goal is to learn models balancing predictive risk and fairness constraints. We propose a PAC-Bayesian framework for deriving generalization bounds for fairness, covering both stochastic and deterministic classifiers. For stochastic classifiers, we derive a fairness bound using standard PAC-Bayes techniques. Whereas for deterministic classifiers, as usual PAC-Bayes arguments do not apply directly, we leverage a recent advance in PAC-Bayes to extend the fairness bound beyond the stochastic setting. Our framework has two advantages: (i) It applies to a broad class of fairness measures that can be expressed as a risk discrepancy, and (ii) it leads to a self-bounding algorithm in which the learning procedure directly optimizes a trade-off between generalization bounds on the prediction risk and on the fairness. We empirically evaluate our framework with three classical fairness measures, demonstrating not only its usefulness but also the tightness of our bounds.

PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers

TL;DR

Abstract

Paper Structure (23 sections, 16 theorems, 76 equations, 3 figures, 5 tables, 1 algorithm)

This paper contains 23 sections, 16 theorems, 76 equations, 3 figures, 5 tables, 1 algorithm.

Introduction
General Supervised Classification Setting
Setting and Notations
Classical PAC-Bayesian Theory
Group Fairness Setting
Setting and Fairness Measures
Generalization Bounds for Fairness
Fairness Generalization Bounds
Stochastic Classifier
Deterministic Classifiers
Specialization to Weighted Majority Votes
A Fair Self-Bounding Algorithm
Experiments
Conclusion
Detailed discussion of the results of oneto2020
...and 8 more sections

Key Result

Theorem 2.1

For any distribution ${\mathcal{P}}$, any hypothesis set ${\mathcal{H}}$, any prior $\pi$ on ${\mathcal{H}}$, and $\delta\!\in\! (0,1]$, with probability at least $1{-}\delta$ on the random choice $S\sim {\mathcal{P}}^m$, we have for any distribution $\rho$ on ${\mathcal{H}}$, where $\operatorname{KL}(\rho\|\pi)\!:=\! \mathop{\mathrm{\mathbb{E}}}\limits_{h \sim \rho} \mathrm{ln}\frac{\rho(h)}{\pi

Figures (3)

Figure 1: Test error and generalization bound of a stochastic majority vote classifier and its deterministic counterpart for Demographic Parity (DP).
Figure 2: Test error and generalization bound of a stochastic majority vote classifier and its deterministic counterpart for Equalized Odds (EO). To compute the Equalized Odds risk, we replace ${\mathbb{P}}[y=+1]$ and ${\mathbb{P}}[y=0]$ by their empirical estimate. In this case, the O2 method cannot be directly generalized because the fairness risk is a linear combination of risks; therefore, we do not report it.
Figure 3: Test error and generalization bound of a stochastic majority vote classifier and its deterministic counterpart for Equal Opportunity (EOP).

Theorems & Definitions (29)

Theorem 2.1: Seeger02Maurer2004
Theorem 3.1: Generalization of Th. 1 of oneto2020
proof
Theorem 4.1
proof
Example 4.2
Proposition 4.2: Risk decomposition, Leblanc25
proof
Lemma 4.2: Bounds on $\Hrisk{\Dcal}$
proof
...and 19 more

PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers

TL;DR

Abstract

PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (29)