Table of Contents
Fetching ...

Generative Modeling: A Review

Maria Nareklishvili, Nick Polson, Vadim Sokolov

TL;DR

The paper surveys simulation-based posterior inference methods that avoid explicit likelihood evaluation by learning a posterior map $\theta = G(Y, Z)$ with $Z \sim p(Z)$. It presents a unifying generative framework and covers architectures such as ABC, VAEs, ICA, normalizing flows, invertible networks, GANs, diffusion models, and deep fiducial inference, with theoretical grounding from the Noise Outsourcing Theorem. A central theme is quantile-based and likelihood-free approaches that enable fast, scalable posterior sampling and uncertainty quantification, illustrated through a Ebola outbreak application and various numerical examples. The work highlights the practical significance of generative Bayesian computation for high-dimensional outcomes and incomplete models, offering a roadmap for future research in learning flexible, data-driven posterior maps. $p(\theta|Y)$ is approximated via deterministic encoders/decoders and latent-variable mappings, enabling efficient posterior draws without repeated likelihood evaluation, which is particularly valuable in complex scientific domains and real-time decision tasks.

Abstract

Generative methods (Gen-AI) are reviewed with a particular goal of solving tasks in machine learning and Bayesian inference. Generative models require one to simulate a large training dataset and to use deep neural networks to solve a supervised learning problem. To do this, we require high-dimensional regression methods and tools for dimensionality reduction (a.k.a. feature selection). The main advantage of Gen-AI methods is their ability to be model-free and to use deep neural networks to estimate conditional densities or posterior quintiles of interest. To illustrate generative methods , we analyze the well-known Ebola data set. Finally, we conclude with directions for future research.

Generative Modeling: A Review

TL;DR

The paper surveys simulation-based posterior inference methods that avoid explicit likelihood evaluation by learning a posterior map with . It presents a unifying generative framework and covers architectures such as ABC, VAEs, ICA, normalizing flows, invertible networks, GANs, diffusion models, and deep fiducial inference, with theoretical grounding from the Noise Outsourcing Theorem. A central theme is quantile-based and likelihood-free approaches that enable fast, scalable posterior sampling and uncertainty quantification, illustrated through a Ebola outbreak application and various numerical examples. The work highlights the practical significance of generative Bayesian computation for high-dimensional outcomes and incomplete models, offering a roadmap for future research in learning flexible, data-driven posterior maps. is approximated via deterministic encoders/decoders and latent-variable mappings, enabling efficient posterior draws without repeated likelihood evaluation, which is particularly valuable in complex scientific domains and real-time decision tasks.

Abstract

Generative methods (Gen-AI) are reviewed with a particular goal of solving tasks in machine learning and Bayesian inference. Generative models require one to simulate a large training dataset and to use deep neural networks to solve a supervised learning problem. To do this, we require high-dimensional regression methods and tools for dimensionality reduction (a.k.a. feature selection). The main advantage of Gen-AI methods is their ability to be model-free and to use deep neural networks to estimate conditional densities or posterior quintiles of interest. To illustrate generative methods , we analyze the well-known Ebola data set. Finally, we conclude with directions for future research.
Paper Structure (30 sections, 93 equations, 12 figures)

This paper contains 30 sections, 93 equations, 12 figures.

Figures (12)

  • Figure 1: Approximate Bayesian Computation (ABC) framework.
  • Figure 2: Variational Autoencoders. The encoder ($q_\phi$) and decoder ($p_\theta$) are composite functions that map between data space ($X_i$) and latent space ($Z_i$). The encoder compresses input data into a meaningful latent representation, from which the decoder generates new, similar examples.
  • Figure 3: Linear Independent Component Analysis.
  • Figure 4: Normalizing Flows.
  • Figure 5: Generative Adversarial Networks.
  • ...and 7 more figures

Theorems & Definitions (2)

  • proof
  • proof