Some aspects of robustness in modern Markov Chain Monte Carlo
Sam Power, Giorgos Vasdekis
TL;DR
The paper surveys robustness concerns in modern MCMC when target distributions display rough local geometry or heavy tails, identifying two main pathologies and evaluating a spectrum of remedies. It connects standard diffusion-based MCMC (e.g., overdamped Langevin, MALA) and PDMP-based methods to practical algorithms designed for stability, including Truncated, Tamed, Proximal, Barker, and non-quadratic kinetic-energy variants, as well as PDMC. For heavy tails, it analyzes space- and time-transformations as means to obtain lighter-tailed targets or faster tail exploration, with concrete examples like Cauchy and Laplace-type targets. The work emphasizes open problems in local adaptation, discretisation strategies, and principled model problems, and argues for robust methods that maintain performance in well-behaved settings while gracefully degrading under pathologies, with significant practical implications for Bayesian computation and high-dimensional inference.
Abstract
Markov Chain Monte Carlo (MCMC) is a flexible approach to approximate sampling from intractable probability distributions, with a rich theoretical foundation and comprising a wealth of exemplar algorithms. While the qualitative correctness of MCMC algorithms is often easy to ensure, their practical efficiency is contingent on the `target' distribution being reasonably well-behaved. In this work, we concern ourself with the scenario in which this good behaviour is called into question, reviewing an emerging line of work on `robust' MCMC algorithms which can perform acceptably even in the face of certain pathologies. We focus on two particular pathologies which, while simple, can already have dramatic effects on standard `local' algorithms. The first is roughness, whereby the target distribution varies so rapidly that the numerical stability of the algorithm is tenuous. The second is flatness, whereby the landscape of the target distribution is instead so barren and uninformative that one becomes lost in uninteresting parts of the state space. In each case, we formulate the pathology in concrete terms, review a range of proposed algorithmic remedies to the pathology, and outline promising directions for future research.
