Enforcing Conditional Independence for Fair Representation Learning and Causal Image Generation
Jensen Hwa, Qingyu Zhao, Aditya Lahiri, Adnan Masood, Babak Salimi, Ehsan Adeli
TL;DR
This paper tackles enforcing conditional independence to achieve fair and unconfounded representations and enable causal image generation. It recasts CI as an equality of Jensen-Shannon divergences and introduces a dynamic sampling scheme to apply CI in high-dimensional latent spaces, applicable to any encoder. The authors demonstrate that enforcing CI in the latent space (v-space) yields better fairness-accuracy trade-offs than label-space CI and enables race-controlled generation through partial CI in diffusion autoencoders. The approach provides a model-agnostic training paradigm with practical impact on fair predictive modeling and controllable image synthesis in high-dimensional settings.
Abstract
Conditional independence (CI) constraints are critical for defining and evaluating fairness in machine learning, as well as for learning unconfounded or causal representations. Traditional methods for ensuring fairness either blindly learn invariant features with respect to a protected variable (e.g., race when classifying sex from face images) or enforce CI relative to the protected attribute only on the model output (e.g., the sex label). Neither of these methods are effective in enforcing CI in high-dimensional feature spaces. In this paper, we focus on a nascent approach characterizing the CI constraint in terms of two Jensen-Shannon divergence terms, and we extend it to high-dimensional feature spaces using a novel dynamic sampling strategy. In doing so, we introduce a new training paradigm that can be applied to any encoder architecture. We are able to enforce conditional independence of the diffusion autoencoder latent representation with respect to any protected attribute under the equalized odds constraint and show that this approach enables causal image generation with controllable latent spaces. Our experimental results demonstrate that our approach can achieve high accuracy on downstream tasks while upholding equality of odds.
