FRAPPE: A Group Fairness Framework for Post-Processing Everything
Alexandru Tifrea, Preethi Lahoti, Ben Packer, Yoni Halpern, Ahmad Beirami, Flavien Prost
TL;DR
FRAPPÉ addresses the practical constraints of achieving group fairness by converting regularized in-processing objectives into post-processing modules. It introduces an additive post-hoc correction $T_{ ext{PP}}(X)$ so that $f_{ ext{fair}}(X)=f_{ ext{base}}(X)+T_{ ext{PP}}(X)$ and trains $T_{ ext{PP}}$ via a bi-level objective that couples a discrepancy measure with a fairness regularizer, without retraining the base model or needing sensitive attributes at inference. Theoretical results show an equivalence between IP and PP objectives for GLMs, implying identical fairness-accuracy Pareto frontiers, while extensive experiments on Adult, COMPAS, HSLS, ENEM, and Communities & Crime demonstrate that FRAPPÉ can match or surpass in-processing trade-offs and outperform many post-processing baselines, even with continuous sensitive attributes and partial group labels. The modular design offers computational efficiency and broad applicability across definitions of fairness, problem settings, and model classes, enabling post-processing to be a practical tool in resource-constrained or multi-component systems.
Abstract
Despite achieving promising fairness-error trade-offs, in-processing mitigation techniques for group fairness cannot be employed in numerous practical applications with limited computation resources or no access to the training pipeline of the prediction model. In these situations, post-processing is a viable alternative. However, current methods are tailored to specific problem settings and fairness definitions and hence, are not as broadly applicable as in-processing. In this work, we propose a framework that turns any regularized in-processing method into a post-processing approach. This procedure prescribes a way to obtain post-processing techniques for a much broader range of problem settings than the prior post-processing literature. We show theoretically and through extensive experiments that our framework preserves the good fairness-error trade-offs achieved with in-processing and can improve over the effectiveness of prior post-processing methods. Finally, we demonstrate several advantages of a modular mitigation strategy that disentangles the training of the prediction model from the fairness mitigation, including better performance on tasks with partial group labels.
