CAD-VAE: Leveraging Correlation-Aware Latents for Comprehensive Fair Disentanglement
Chenrui Ma, Xi Xiao, Tianyang Wang, Xiao Wang, Yanning Shen
TL;DR
CAD-VAE introduces a correlation-aware disentanglement framework by incorporating a correlated latent code $z_R$ to capture information shared between target $Y$ and sensitive $S$ attributes. It directly minimizes the conditional mutual information $I(Y; S \,|\, z_R)$ and employs a relevance-learning strategy to ensure $z_R$ encodes only essential shared information, preserving discriminative power in $z_Y$ and $z_S$. The method combines a VAE objective with a CMI loss, a total correlation penalty, and a learning-relevance loss, optimized in a two-step process. Across fair classification, counterfactual generation, and fair image/text editing tasks, CAD-VAE achieves state-of-the-art fairness while maintaining predictive and generative quality, and demonstrates strong applicability to vision-language models via CLIP-based adaptations.
Abstract
While deep generative models have significantly advanced representation learning, they may inherit or amplify biases and fairness issues by encoding sensitive attributes alongside predictive features. Enforcing strict independence in disentanglement is often unrealistic when target and sensitive factors are naturally correlated. To address this challenge, we propose \textbf{CAD-VAE} (\textbf{C}orrelation-\textbf{A}ware \textbf{D}isentangled \textbf{VAE}), which introduces a correlated latent code to capture the information shared between the target and sensitive attributes. Given this correlated latent, our method effectively separates overlapping factors without extra domain knowledge by directly minimizing the conditional mutual information between target and sensitive codes. A relevance-driven optimization strategy refines the correlated code by efficiently capturing essential correlated features and eliminating redundancy. Extensive experiments on benchmark datasets demonstrate that CAD-VAE produces fairer representations, realistic counterfactuals, and improved fairness-aware image editing. Source code is available : https://github.com/merry7cherry/CAD-VAE
