Beyond a Single Mode: GAN Ensembles for Diverse Medical Data Generation
Lorenzo Tronchin, Tommy Löfstedt, Paolo Soda, Valerio Guarrasi
TL;DR
This work tackles the fidelity-diversity-efficiency trilemma in medical imaging by constructing ensembles of GANs rather than relying on a single model. It introduces a multi-objective Pareto optimization that selects a compact, non-redundant subset of GANs—varying architectures, losses, and training iterations—to maximize fidelity to the real data distribution while maximizing coverage of its diversity. Across three diverse medical datasets, the Pareto-derived ensemble G* consistently improves downstream diagnostic performance relative to single GANs and naive ensembles, reducing the real-synthetic performance gap. The approach leverages SwAV-based embeddings for distribution quality and demonstrates practical benefits for data-scarce medical contexts, while highlighting limitations such as static ensemble usage and computational costs, with future work aimed at dynamic, budget-aware ensembles.
Abstract
The advancement of generative AI, particularly in medical imaging, confronts the trilemma of ensuring high fidelity, diversity, and efficiency in synthetic data generation. While Generative Adversarial Networks (GANs) have shown promise across various applications, they still face challenges like mode collapse and insufficient coverage of real data distributions. This work explores the use of GAN ensembles to overcome these limitations, specifically in the context of medical imaging. By solving a multi-objective optimisation problem that balances fidelity and diversity, we propose a method for selecting an optimal ensemble of GANs tailored for medical data. The selected ensemble is capable of generating diverse synthetic medical images that are representative of true data distributions and computationally efficient. Each model in the ensemble brings a unique contribution, ensuring minimal redundancy. We conducted a comprehensive evaluation using three distinct medical datasets, testing 22 different GAN architectures with various loss functions and regularisation techniques. By sampling models at different training epochs, we crafted 110 unique configurations. The results highlight the capability of GAN ensembles to enhance the quality and utility of synthetic medical images, thereby improving the efficacy of downstream tasks such as diagnostic modelling.
