Large deviations for Independent Metropolis Hastings and Metropolis-adjusted Langevin algorithm
Federica Milinanni, Pierre Nyquist
TL;DR
The paper addresses the problem of establishing large deviation principles for the empirical measures of Metropolis-Hastings-based samplers on continuous state spaces, focusing on IMH and MALA and clarifying limitations for RWM. The authors leverage a Lyapunov-function–based framework from prior work to derive LDPs with speed $n$ and rate function $I( u) = \inf_{\gamma\in A(\nu)} R(\gamma \parallel \nu \otimes K)$, proving precise conditions under which IMH and MALA admit LDPs. They show that IMH admits an LDP when the target and proposal tails satisfy specific relations (e.g., $\alpha=\beta$ with $\eta>\gamma$ or $\alpha>\beta$), and that MALA admits an LDP in the regimes $\beta=2$, $\varepsilon\gamma<2$ or $1<\beta<2$, with explicit Lyapunov functions; conversely, for RWM no Lyapunov function of the required form exists, underscoring the limits of the current framework. These results mark the first LDPs for concrete MH dynamics on uncountable spaces and illuminate the link between geometric ergodicity and large deviations, highlighting directions for extending LDP theory to broader MH schemes.
Abstract
In this paper, we prove large deviation principles for the empirical measures associated with the Independent Metropolis Hastings (IMH) sampler and the Metropolis-adjusted Langevin Algorithm (MALA). These are the first large deviation results for empirical measures of Markov chains arising from specific Metropolis-Hastings methods on a continuous state space. Moreover, we show that the existing large deviation framework, that we developed in a previous work (Milinanni and Nyquist, 2024), does not cover the Random Walk Metropolis sampler, even in cases when the underlying Markov chain is geometrically ergodic.
