Hitchhiker's guide on the relation of Energy-Based Models with other generative models, sampling and statistical physics: a comprehensive review
Davide Carbone
TL;DR
Energy-Based Models (EBMs) provide a probabilistic framework where the target distribution is the Boltzmann-Gibbs density $\rho(x) \propto e^{-\beta U_\theta(x)}$ with partition function $Z_\theta$. This review argues for viewing EBMs as a natural interface between statistical physics, sampling theory, and modern generative models, contrasting them with VAEs, GANs, diffusion models, and normalizing flows. It surveys sampling-based and sampling-free training methods, including Langevin/MCMC approaches and score-based objectives, and explains how non-equilibrium ideas and free energy illuminate training dynamics and evaluation. The work highlights practical considerations, historical context, and open directions, advocating deeper physics-informed methodologies to advance interpretable and scalable generative modeling.
Abstract
Energy-Based Models have emerged as a powerful framework in the realm of generative modeling, offering a unique perspective that aligns closely with principles of statistical mechanics. This review aims to provide physicists with a comprehensive understanding of EBMs, delineating their connection to other generative models such as Generative Adversarial Networks, Variational Autoencoders, and Normalizing Flows. We explore the sampling techniques crucial for EBMs, including Markov Chain Monte Carlo (MCMC) methods, and draw parallels between EBM concepts and statistical mechanics, highlighting the significance of energy functions and partition functions. Furthermore, we delve into recent training methodologies for EBMs, covering recent advancements and their implications for enhanced model performance and efficiency. This review is designed to clarify the often complex interconnections between these models, which can be challenging due to the diverse communities working on the topic.
