Deep Unsupervised Clustering with Gaussian Mixture Variational Autoencoders
Nat Dilokthanakul, Pedro A. M. Mediano, Marta Garnelo, Matthew C. H. Lee, Hugh Salimbeni, Kai Arulkumaran, Murray Shanahan
TL;DR
This work introduces Gaussian Mixture Variational Autoencoders (GMVAE) to enable unsupervised clustering within deep generative models by employing a Gaussian mixture prior over the latent space. It identifies over-regularisation as a core issue arising from the discrete latent prior and demonstrates that a minimum information constraint can stabilize training and preserve cluster structure. A tractable ELBO is derived using a conditional prior term, allowing backpropagation without sampling discrete variables and enabling efficient optimization. Experiments on synthetic data, MNIST, and SVHN show that GMVAE discovers distinct, interpretable clusters and yields competitive unsupervised clustering performance, with latent variables that disentangle class information (z) from style (w).
Abstract
We study a variant of the variational autoencoder model (VAE) with a Gaussian mixture as a prior distribution, with the goal of performing unsupervised clustering through deep generative models. We observe that the known problem of over-regularisation that has been shown to arise in regular VAEs also manifests itself in our model and leads to cluster degeneracy. We show that a heuristic called minimum information constraint that has been shown to mitigate this effect in VAEs can also be applied to improve unsupervised clustering performance with our model. Furthermore we analyse the effect of this heuristic and provide an intuition of the various processes with the help of visualizations. Finally, we demonstrate the performance of our model on synthetic data, MNIST and SVHN, showing that the obtained clusters are distinct, interpretable and result in achieving competitive performance on unsupervised clustering to the state-of-the-art results.
