Differential Adjusted Parity for Learning Fair Representations
Bucher Sahyouni, Matthew Vowels, Liqun Chen, Simon Hadfield
TL;DR
The paper introduces Differential Adjusted Parity (DAP) loss, a differentiable, non-adversarial objective based on a differentiable variant of the adjusted parity metric, to jointly optimize task accuracy and invariance across sensitive domains via Soft Balanced Accuracy. It extends the adjusted parity metric to multiple domains and uses per-domain soft accuracy to form a differentiable objective that rewards high average accuracy while reducing cross-domain inconsistency. Empirically, DAP outperforms several adversarial debiasing methods on Adult and COMPAS datasets, achieving larger improvements in adjusted parity, equalized odds, and demographic parity metrics, while maintaining competitive task performance. While showing strong fairness gains, the method exhibits hyperparameter sensitivity, motivating future work on automatic tuning and dynamic adaptation of $\beta$ and $\Omega$ during training.
Abstract
The development of fair and unbiased machine learning models remains an ongoing objective for researchers in the field of artificial intelligence. We introduce the Differential Adjusted Parity (DAP) loss to produce unbiased informative representations. It utilises a differentiable variant of the adjusted parity metric to create a unified objective function. By combining downstream task classification accuracy and its inconsistency across sensitive feature domains, it provides a single tool to increase performance and mitigate bias. A key element in this approach is the use of soft balanced accuracies. In contrast to previous non-adversarial approaches, DAP does not suffer a degeneracy where the metric is satisfied by performing equally poorly across all sensitive domains. It outperforms several adversarial models on downstream task accuracy and fairness in our analysis. Specifically, it improves the demographic parity, equalized odds and sensitive feature accuracy by as much as 22.5\%, 44.1\% and 40.1\%, respectively, when compared to the best performing adversarial approaches on these metrics. Overall, the DAP loss and its associated metric can play a significant role in creating more fair machine learning models.
