Algorithmic Fairness Generalization under Covariate and Dependence Shifts Simultaneously
Chen Zhao, Kai Jiang, Xintao Wu, Haoliang Wang, Latifur Khan, Christan Grant, Feng Chen
TL;DR
This work addresses fairness-aware domain generalization under simultaneous covariate and dependence shifts by introducing FEDORA, a transformation-based framework that disentangles semantic, sensitive, and style factors and augments data with synthetic domains. It formalizes a fairness-invariant objective through a transformation model and a constrained learning problem, and provides empirical duality guarantees and an unseen-domain fairness bound. The FEDORA algorithm co-trains a disentangled transformation T with a fairness-aware classifier, using synthetic domain augmentation and a primal–dual optimization scheme to enforce invariance across domains. The method demonstrates consistent improvements over 19 baselines across four benchmarks, with notable gains in both DP and AUC_fair, and includes thorough ablations and theoretical insights on duality gaps and target-domain fairness.
Abstract
The endeavor to preserve the generalization of a fair and invariant classifier across domains, especially in the presence of distribution shifts, becomes a significant and intricate challenge in machine learning. In response to this challenge, numerous effective algorithms have been developed with a focus on addressing the problem of fairness-aware domain generalization. These algorithms are designed to navigate various types of distribution shifts, with a particular emphasis on covariate and dependence shifts. In this context, covariate shift pertains to changes in the marginal distribution of input features, while dependence shift involves alterations in the joint distribution of the label variable and sensitive attributes. In this paper, we introduce a simple but effective approach that aims to learn a fair and invariant classifier by simultaneously addressing both covariate and dependence shifts across domains. We assert the existence of an underlying transformation model can transform data from one domain to another, while preserving the semantics related to non-sensitive attributes and classes. By augmenting various synthetic data domains through the model, we learn a fair and invariant classifier in source domains. This classifier can then be generalized to unknown target domains, maintaining both model prediction and fairness concerns. Extensive empirical studies on four benchmark datasets demonstrate that our approach surpasses state-of-the-art methods.
