Fairness Constraints: Mechanisms for Fair Classification
Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, Krishna P. Gummadi
TL;DR
The paper tackles fairness in binary classification by introducing a convex, interpretable measure of decision boundary fairness, covariance between sensitive attributes and the signed distance to the boundary. It presents two convex optimization frameworks: one maximizing accuracy under fairness constraints to satisfy the p%-rule, and another maximizing fairness under accuracy constraints to honor business necessity. Through synthetic and real-data experiments, including Adult and Bank datasets, the authors demonstrate controllable fairness-accuracy trade-offs with minimal accuracy loss and show the method outperforms certain pre-processing and regularization baselines. The approach extends to multiple and non-binary sensitive attributes and provides a practical, scalable path toward compliant and fair decision-making in real-world systems.
Abstract
Algorithmic decision making systems are ubiquitous across a wide variety of online as well as offline services. These systems rely on complex learning methods and vast amounts of data to optimize the service functionality, satisfaction of the end user and profitability. However, there is a growing concern that these automated decisions can lead, even in the absence of intent, to a lack of fairness, i.e., their outcomes can disproportionately hurt (or, benefit) particular groups of people sharing one or more sensitive attributes (e.g., race, sex). In this paper, we introduce a flexible mechanism to design fair classifiers by leveraging a novel intuitive measure of decision boundary (un)fairness. We instantiate this mechanism with two well-known classifiers, logistic regression and support vector machines, and show on real-world data that our mechanism allows for a fine-grained control on the degree of fairness, often at a small cost in terms of accuracy.
