PAC-Bayesian Generalization Guarantees for Fairness on Stochastic and Deterministic Classifiers
Julien Bastian, Benjamin Leblanc, Pascal Germain, Amaury Habrard, Christine Largeron, Guillaume Metzler, Emilie Morvant, Paul Viallard
TL;DR
This work tackles the challenge of providing theoretical fairness guarantees for learning algorithms, beyond traditional predictive risk bounds. It develops a unified PAC-Bayesian framework that yields generalization bounds for fairness, applicable to both stochastic Gibbs classifiers and deterministic majority votes, by viewing fairness as a risk-discrepancy $RF_{\mathcal{D}}(h)$. A key contribution is a self-bounding learning procedure that directly optimizes a bound- based trade-off between predictive risk and fairness across common group fairness measures such as Demographic Parity, Equalized Odds, and Equal Opportunity. Empirical results on several datasets show tight bounds and favorable risk/fairness trade-offs, supporting the practicality of certifiable fairness in real-world settings.
Abstract
Classical PAC generalization bounds on the prediction risk of a classifier are insufficient to provide theoretical guarantees on fairness when the goal is to learn models balancing predictive risk and fairness constraints. We propose a PAC-Bayesian framework for deriving generalization bounds for fairness, covering both stochastic and deterministic classifiers. For stochastic classifiers, we derive a fairness bound using standard PAC-Bayes techniques. Whereas for deterministic classifiers, as usual PAC-Bayes arguments do not apply directly, we leverage a recent advance in PAC-Bayes to extend the fairness bound beyond the stochastic setting. Our framework has two advantages: (i) It applies to a broad class of fairness measures that can be expressed as a risk discrepancy, and (ii) it leads to a self-bounding algorithm in which the learning procedure directly optimizes a trade-off between generalization bounds on the prediction risk and on the fairness. We empirically evaluate our framework with three classical fairness measures, demonstrating not only its usefulness but also the tightness of our bounds.
