Equalized Generative Treatment: Matching f-divergences for Fairness in Generative Models
Alexandre Verine, Rafael Pinot, Florian Le Bronnec
TL;DR
The paper addresses fairness in generative modeling, arguing that proportion-based criteria like EGO and MGO are brittle because they do not guarantee equal conditional generation quality across sensitive groups. It defines equalized generative treatment (EGT) via δ-EGT, balancing group-wise $f$-divergences $\mathcal{D}_f(P_a\|Q_a)$, and shows that enforcing EGT relates to minimizing the worst-group divergence. The authors propose a practical min–max training approach to enforce EGT and provide theoretical results bounding the global divergence by the worst conditional divergence. Empirically, Min–Max improves per-group fairness metrics across diffusion-based image generation and text generation with competitive overall performance, while highlighting trade-offs and the limited predictive power of proportion-based metrics for conditional quality. This work offers a principled framework and a scalable optimization strategy for fairness in generative systems, with potential extensions to other divergences and modalities.
Abstract
Fairness is a crucial concern for generative models, which not only reflect but can also amplify societal and cultural biases. Existing fairness notions for generative models are largely adapted from classification and focus on balancing the probability of generating samples from each sensitive group. We show that such criteria are brittle, as they can be met even when different sensitive groups are modeled with widely varying quality. To address this limitation, we introduce a new fairness definition for generative models, termed as equalized generative treatment (EGT), which requires comparable generation quality across all sensitive groups, with quality measured via a reference f-divergence. We further analyze the trade-offs induced by EGT, demonstrating that enforcing fairness constraints necessarily couples the overall model quality to that of the most challenging group to approximate. This indicates that a simple yet efficient min-max fine-tuning method should be able to balance f-divergences across sensitive groups to satisfy EGT. We validate this theoretical insight through a set of experiments on both image and text generation tasks. We demonstrate that min-max methods consistently achieve fairer outcomes compared to other approaches from the literature, while maintaining competitive overall performance for both tasks.
