The VampPrior Mixture Model
Andrew A. Stirn, David A. Knowles
TL;DR
The VampPrior Mixture Model (VMM) introduces a Bayesian Gaussian Mixture prior to replace the standard $p(z)=\mathcal{N}(0,I)$ in deep latent variable models, enabling automatic, DP-GMM–like clustering in latent space. It employs an alternating variational inference and empirical Bayes (MAP-EM) procedure to learn variational parameters and prior hyperparameters, while modeling cluster centers as distributions via $\mu_j \sim q_\phi(\mu_j;u_j)$ with widths $\Lambda_j$. Empirical results show that VMM achieves strong clustering on image benchmarks and substantially improves scRNA-seq integration when embedded in scVI, outperforming both the standard Gaussian prior and VampPrior-based approaches in key metrics. The work demonstrates that a flexible DP-GMM prior and cluster-center distributions yield better interpretability and performance, with the VMM providing intermediate complexity between a plain VAE and the VampPrior, and offering tunable cluster granularity for diverse data modalities.
Abstract
Widely used deep latent variable models (DLVMs), in particular Variational Autoencoders (VAEs), employ overly simplistic priors on the latent space. To achieve strong clustering performance, existing methods that replace the standard normal prior with a Gaussian mixture model (GMM) require defining the number of clusters to be close to the number of expected ground truth classes a-priori and are susceptible to poor initializations. We leverage VampPrior concepts (Tomczak and Welling, 2018) to fit a Bayesian GMM prior, resulting in the VampPrior Mixture Model (VMM), a novel prior for DLVMs. In a VAE, the VMM attains highly competitive clustering performance on benchmark datasets. Integrating the VMM into scVI (Lopez et al., 2018), a popular scRNA-seq integration method, significantly improves its performance and automatically arranges cells into clusters with similar biological characteristics.
