Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks
Lujing Zhang, Aaron Roth, Linjun Zhang
TL;DR
This work introduces the $(\mathbf{s},\mathcal{G}, α)$-Generalized Multicalibration (GMC) framework to post-process predictive models for multi-group fairness in multi-dimensional outputs. It develops a generalized calibration objective, an algorithm with convergence guarantees, and finite-sample analyses, showing how to enforce fairness across many subgroups and dimensions. The authors demonstrate GMC's versatility through three applications: de-biased text generation, prediction-set conditional coverage in hierarchical classification, and fair false-negative-rate control in image segmentation, with empirical results indicating improved fairness without substantial accuracy loss. Overall, GMC provides a unified, theoretically grounded approach to multi-group fairness across complex, multidimensional prediction tasks relevant to real-world AI systems.
Abstract
This paper introduces a framework for post-processing machine learning models so that their predictions satisfy multi-group fairness guarantees. Based on the celebrated notion of multicalibration, we introduce $(\mathbf{s},\mathcal{G}, α)-$GMC (Generalized Multi-Dimensional Multicalibration) for multi-dimensional mappings $\mathbf{s}$, constraint set $\mathcal{G}$, and a pre-specified threshold level $α$. We propose associated algorithms to achieve this notion in general settings. This framework is then applied to diverse scenarios encompassing different fairness concerns, including false negative rate control in image segmentation, prediction set conditional uncertainty quantification in hierarchical classification, and de-biased text generation in language models. We conduct numerical studies on several datasets and tasks.
