Improving Fairness and Mitigating MADness in Generative Models
Paul Mayer, Lorenzo Luzi, Ali Siahkoohi, Don H. Johnson, Richard G. Baraniuk
TL;DR
This work targets bias in generative modeling by linking MLE-induced estimator bias to fairness gaps and to Model Autophagy Disorder (MADness). It introduces Penalized Autophagy Estimation (PLE) implemented via hypernetworks to enforce that parameter estimates from real and synthetic data share consistent statistics, thereby reducing bias and stabilizing generation. Theoretical formulation pairs a constrained objective with a tractable Lagrangian relaxation and a data-driven hypernetwork $H_{\phi}$ that predicts downstream weights, enabling scalable debiasing across architectures. Empirically, hypernetwork-enabled PLE improves minority-data representation, slows MADness, and yields more robust performance in imbalanced or low-data regimes across MNIST, GMM, and CIFAR-10 settings. Overall, the approach couples unbiased statistical estimation with deep learning to enhance fairness and stability in generative modelling, with potential extensions to diffusion methods and uncertainty quantification.
Abstract
Generative models unfairly penalize data belonging to minority classes, suffer from model autophagy disorder (MADness), and learn biased estimates of the underlying distribution parameters. Our theoretical and empirical results show that training generative models with intentionally designed hypernetworks leads to models that 1) are more fair when generating datapoints belonging to minority classes 2) are more stable in a self-consumed (i.e., MAD) setting, and 3) learn parameters that are less statistically biased. To further mitigate unfairness, MADness, and bias, we introduce a regularization term that penalizes discrepancies between a generative model's estimated weights when trained on real data versus its own synthetic data. To facilitate training existing deep generative models within our framework, we offer a scalable implementation of hypernetworks that automatically generates a hypernetwork architecture for any given generative model.
