Fair Supervised Learning Through Constraints on Smooth Nonconvex Unfairness-Measure Surrogates
Zahra Khatti, Daniel P. Robinson, Frank E. Curtis
TL;DR
This paper tackles fairness in supervised learning by shifting from regularization-based approaches to hard constraints that enforce specified unfairness bounds. It introduces smooth, bounded, nonconvex surrogates to approximate discontinuous unfairness measures and proves that small surrogate values imply small actual unfairness, with a bound that tightens as the surrogate is scaled. The proposed training formulation minimizes standard loss plus a regulator under hard constraint bounds and is solved efficiently via sequential quadratic programming, enabling simultaneous enforcement of multiple unfairness criteria. Empirical results on Dutch, Law, and ACSIncome datasets demonstrate tight control over disparate impact and related measures with minimal sacrifice in predictive accuracy, while highlighting the practical benefits and costs of constraint-based training over regularization-based methods.
Abstract
A new strategy for fair supervised machine learning is proposed. The main advantages of the proposed strategy as compared to others in the literature are as follows. (a) We introduce a new smooth nonconvex surrogate to approximate the Heaviside functions involved in discontinuous unfairness measures. The surrogate is based on smoothing methods from the optimization literature, and is new for the fair supervised learning literature. The surrogate is a tight approximation which ensures the trained prediction models are fair, as opposed to other (e.g., convex) surrogates that can fail to lead to a fair prediction model in practice. (b) Rather than rely on regularizers (that lead to optimization problems that are difficult to solve) and corresponding regularization parameters (that can be expensive to tune), we propose a strategy that employs hard constraints so that specific tolerances for unfairness can be enforced without the complications associated with the use of regularization. (c) Our proposed strategy readily allows for constraints on multiple (potentially conflicting) unfairness measures at the same time. Multiple measures can be considered with a regularization approach, but at the cost of having even more difficult optimization problems to solve and further expense for tuning. By contrast, through hard constraints, our strategy leads to optimization models that can be solved tractably with minimal tuning.
