Progressive Monitoring of Generative Model Training Evolution
Vidya Prasad, Anna Vilanova, Nicola Pezzotti
TL;DR
This work tackles biases and inefficiencies in deep generative models by proposing a progressive analysis framework that continuously monitors high-dimensional training data via evolutionary embeddings. By extracting latent representations and data distributions at regular checkpoints and projecting them into a 2D evolutionary space (using encoders such as CLIP), the approach enables mid-training bias detection and timely interventions. Applied to AttentionGAN on CelebA hair-color transformations, the method reveals gender- and age-related biases early in training and demonstrates how data augmentation and targeted image collection can mitigate these biases, improve output realism, and reduce computational costs. The results highlight the practical impact of real-time, interpretable monitoring for fairer and more efficient generative modeling, with clear potential extensions to diffusion models and other DGMs.
Abstract
While deep generative models (DGMs) have gained popularity, their susceptibility to biases and other inefficiencies that lead to undesirable outcomes remains an issue. With their growing complexity, there is a critical need for early detection of issues to achieve desired results and optimize resources. Hence, we introduce a progressive analysis framework to monitor the training process of DGMs. Our method utilizes dimensionality reduction techniques to facilitate the inspection of latent representations, the generated and real distributions, and their evolution across training iterations. This monitoring allows us to pause and fix the training method if the representations or distributions progress undesirably. This approach allows for the analysis of a models' training dynamics and the timely identification of biases and failures, minimizing computational loads. We demonstrate how our method supports identifying and mitigating biases early in training a Generative Adversarial Network (GAN) and improving the quality of the generated data distribution.
