Scalable Monte Carlo for Bayesian Learning
Paul Fearnhead, Christopher Nemeth, Chris J. Oates, Chris Sherlock
TL;DR
The book develops scalable Monte Carlo methods for Bayesian learning by unifying traditional MCMC with stochastic-gradient, non-reversible, and continuous-time approaches. It shows how to leverage gradient information, subsampling, and PDMPs to scale sampling to large datasets and high-dimensional parameter spaces, while rigorously addressing bias, variance, and convergence. Key contributions include a detailed treatment of SGMCMC (SGLD/SGHMC/SGRLD), kernel-based tools for analysis, and practical guidance for tuning and assessing convergence using metrics like kernel Stein discrepancies. The frameworks and algorithms presented—ranging from MH-based schemes to PDMP-driven continuous-time samplers—offer scalable, flexible options for Bayesian inference in modern data-rich settings, with empirical demonstrations on logistic regression, Bayesian neural networks, and time-series models. Overall, the text provides a comprehensive roadmap for implementing fast, reliable Bayesian computation at scale, connecting theory to practice through both analytic results and real-data experiments.
Abstract
This book aims to provide a graduate-level introduction to advanced topics in Markov chain Monte Carlo (MCMC) algorithms, as applied broadly in the Bayesian computational context. Most, if not all of these topics (stochastic gradient MCMC, non-reversible MCMC, continuous time MCMC, and new techniques for convergence assessment) have emerged as recently as the last decade, and have driven substantial recent practical and theoretical advances in the field. A particular focus is on methods that are scalable with respect to either the amount of data, or the data dimension, motivated by the emerging high-priority application areas in machine learning and AI.
