Learning Fair Invariant Representations under Covariate and Correlation Shifts Simultaneously
Dong Li, Chen Zhao, Minglai Shao, Wenjun Wang
TL;DR
This work tackles fairness-aware domain generalization under simultaneous covariate and correlation shifts, enabling robust generalization to unseen test domains. It introduces FLAIR, a framework that disentangles content and style via latent spaces and learns fair content representations using a Gaussian-mixture of prototypes across sensitive subgroups, guided by a transformation model $T$ to enforce invariance. The predictor $f=h_c\circ g\circ \omega$ is trained with a composite loss $R_{total}=R_{cls}+\lambda_1 R_{inv}+\lambda_2 R_{fair}$, leveraging EM for fairness responsibilities and primal-dual optimization to balance invariance and fairness; it demonstrates superior accuracy and both group and individual fairness on RC MNIST, NYPD, and FairFace. Overall, FLAIR offers a practical algorithmic path for reliable, fair generalization across diverse domains under covariate and correlation shifts.
Abstract
Achieving the generalization of an invariant classifier from training domains to shifted test domains while simultaneously considering model fairness is a substantial and complex challenge in machine learning. Existing methods address the problem of fairness-aware domain generalization, focusing on either covariate shift or correlation shift, but rarely consider both at the same time. In this paper, we introduce a novel approach that focuses on learning a fairness-aware domain-invariant predictor within a framework addressing both covariate and correlation shifts simultaneously, ensuring its generalization to unknown test domains inaccessible during training. In our approach, data are first disentangled into content and style factors in latent spaces. Furthermore, fairness-aware domain-invariant content representations can be learned by mitigating sensitive information and retaining as much other information as possible. Extensive empirical studies on benchmark datasets demonstrate that our approach surpasses state-of-the-art methods with respect to model accuracy as well as both group and individual fairness.
