The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo

Matthew D. Hoffman; Andrew Gelman

The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo

Matthew D. Hoffman, Andrew Gelman

TL;DR

<3-5 sentence high-level summary> The No-U-Turn Sampler (NUTS) tackles the practical bottleneck of Hamiltonian Monte Carlo (HMC) by eliminating the need to pre-specify the trajectory length L, while retaining HMC’s efficient exploration of high-dimensional posteriors. It achieves this with a recursive, binary-tree doubling trajectory-building procedure that stops when the trajectory would turn back on itself, preserving detailed balance, and with a dual averaging scheme to adapt the step size ε automatically. Empirical results show NUTS matches or surpasses tuned HMC in efficiency across several challenging models, while offering turnkey applicability suitable for automatic inference engines like Stan. The work also outlines memory-efficient implementations and discusses extensions such as mass matrix adaptations and windowed sampling for future improvements. This approach significantly broadens the practical usability of gradient-based MCMC in complex Bayesian models.}

Abstract

Hamiltonian Monte Carlo (HMC) is a Markov chain Monte Carlo (MCMC) algorithm that avoids the random walk behavior and sensitivity to correlated parameters that plague many MCMC methods by taking a series of steps informed by first-order gradient information. These features allow it to converge to high-dimensional target distributions much more quickly than simpler methods such as random walk Metropolis or Gibbs sampling. However, HMC's performance is highly sensitive to two user-specified parameters: a step size ε and a desired number of steps L. In particular, if L is too small then the algorithm exhibits undesirable random walk behavior, while if L is too large the algorithm wastes computation. We introduce the No-U-Turn Sampler (NUTS), an extension to HMC that eliminates the need to set a number of steps L. NUTS uses a recursive algorithm to build a set of likely candidate points that spans a wide swath of the target distribution, stopping automatically when it starts to double back and retrace its steps. Empirically, NUTS perform at least as efficiently as and sometimes more efficiently than a well tuned standard HMC method, without requiring user intervention or costly tuning runs. We also derive a method for adapting the step size parameter ε on the fly based on primal-dual averaging. NUTS can thus be used with no hand-tuning at all. NUTS is also suitable for applications such as BUGS-style automatic inference engines that require efficient "turnkey" sampling algorithms.

The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo

TL;DR

Abstract

The No-U-Turn Sampler: Adaptively Setting Path Lengths in Hamiltonian Monte Carlo

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)