Unprocessing Seven Years of Algorithmic Fairness
André F. Cruz, Moritz Hardt
TL;DR
The paper investigates whether postprocessing remains the most effective way to enforce error-rate parity across demographic groups. It introduces unprocessing, the inverse mapping of postprocessing, to enable fair comparisons across methods and constraint relaxations, and it presents an LP-based approach to relaxed equalized odds with open-source tooling. Across thousands of models on tabular ACS datasets, postprocessing of the most accurate unconstrained predictor consistently matches or dominates all examined fairness interventions. The work underscores the importance of rigorous empirical evaluation in fairness research and provides practical tools to compare fairness methods on an equal footing.
Abstract
Seven years ago, researchers proposed a postprocessing method to equalize the error rates of a model across different demographic groups. The work launched hundreds of papers purporting to improve over the postprocessing baseline. We empirically evaluate these claims through thousands of model evaluations on several tabular datasets. We find that the fairness-accuracy Pareto frontier achieved by postprocessing contains all other methods we were feasibly able to evaluate. In doing so, we address two common methodological errors that have confounded previous observations. One relates to the comparison of methods with different unconstrained base models. The other concerns methods achieving different levels of constraint relaxation. At the heart of our study is a simple idea we call unprocessing that roughly corresponds to the inverse of postprocessing. Unprocessing allows for a direct comparison of methods using different underlying models and levels of relaxation.
