Variational Autoencoders for Efficient Simulation-Based Inference
Mayank Nautiyal, Andrey Shternshis, Andreas Hellander, Prashant Singh
TL;DR
This work tackles likelihood-free simulation-based inference by proposing two variational autoencoder-based approaches: CP-VAE, which uses a data-dependent prior $p(\mathbf{z}\mid \mathbf{y})$ to adapt latent structure to observed data, and UP-VAE, which relies on an unconditional Gaussian prior $p(\mathbf{z})$ for simpler training. Both models aim to efficiently approximate complex posteriors $p(\boldsymbol{\theta} \mid \mathbf{y})$ in an amortized setting, demonstrated on sbibm benchmarks and Hodgkin–Huxley models. CP-VAE tends to offer greater flexibility through the conditional prior, while UP-VAE provides a stable, simpler alternative with competitive accuracy and lower risk of overfitting. The paper positions these VAEs as efficient, interpretable competitors to flow-based, GAN-based, and diffusion SBI methods, with future directions including integrating normalizing flows, richer priors, and architectural enhancements to capture more structure in the data.
Abstract
We present a generative modeling approach based on the variational inference framework for likelihood-free simulation-based inference. The method leverages latent variables within variational autoencoders to efficiently estimate complex posterior distributions arising from stochastic simulations. We explore two variations of this approach distinguished by their treatment of the prior distribution. The first model adapts the prior based on observed data using a multivariate prior network, enhancing generalization across various posterior queries. In contrast, the second model utilizes a standard Gaussian prior, offering simplicity while still effectively capturing complex posterior distributions. We demonstrate the ability of the proposed approach to approximate complex posteriors while maintaining computational efficiency on well-established benchmark problems.
