A Conceptual Introduction to Hamiltonian Monte Carlo
Michael Betancourt
TL;DR
This paper reframes Hamiltonian Monte Carlo (HMC) as a geometry-driven Markov transition method that leverages phase-space dynamics to efficiently explore the typical set of high-dimensional distributions. By introducing momentum and Hamiltonian dynamics, it explains how to design transitions that coherently traverse parameter space, and it details practical tuning via kinetic energy choices, integration-time strategies, and symplectic integrators with Metropolis corrections. The work highlights diagnostics and robustness results, argues for adaptive methods like Euclidean and Riemannian metric choices, and discusses termination criteria such as No-U-Turn to achieve dynamic, efficient sampling. Collectively, the approach enables scalable, principled Bayesian computation and underpins modern tools like Stan, with broad implications for high-dimensional statistical modeling.
Abstract
Hamiltonian Monte Carlo has proven a remarkable empirical success, but only recently have we begun to develop a rigorous understanding of why it performs so well on difficult problems and how it is best applied in practice. Unfortunately, that understanding is confined within the mathematics of differential geometry which has limited its dissemination, especially to the applied communities for which it is particularly important. In this review I provide a comprehensive conceptual account of these theoretical foundations, focusing on developing a principled intuition behind the method and its optimal implementations rather of any exhaustive rigor. Whether a practitioner or a statistician, the dedicated reader will acquire a solid grasp of how Hamiltonian Monte Carlo works, when it succeeds, and, perhaps most importantly, when it fails.
