On micromodes in Bayesian posterior distributions and their implications for MCMC
Sanket Agrawal, Sebastiano Grazzi, Gareth O. Roberts
TL;DR
This paper reveals that heavy-tailed data induce micromodes—local posterior peaks near isolated extreme observations—in Bayesian posteriors for high-dimensional location models. It connects posterior geometry to ZZP-based MCMC performance, deriving an Arrhenius-type exit law and a tail-mismatch–driven phase transition that can severely degrade sampling when the tail of the model is mis-specified. Central to the analysis are precise characterizations of micromode location and width via extreme-value theory and a high-precision empirical-score approximation, enabling tractable analysis of exit times. The results highlight practical cautions for robust Bayesian modeling: allowing sufficiently heavy tails can mitigate micromode effects, while underdispersed tails can trap samplers and substantially slow convergence. The work thus links posterior landscape geometry with computational dynamics in a principled way, with potential extensions to other MCMC algorithms and more complex regression settings.
Abstract
We investigate the existence and severity of local modes in posterior distributions from Bayesian analyses. These are known to occur in posterior tails resulting from heavy-tailed error models such as those used in robust regression. To understand this phenomenon clearly, we consider in detail location models with Student-$t$ errors in dimension $d$ with sample size $n$. For sufficiently heavy-tailed data-generating distributions, extreme observations become increasingly isolated as $n \to \infty$. We show that each such observation induces a unique local posterior mode with probability tending to $1$. We refer to such a local mode as a micromode. These micromodes are typically small in height but their domains of attraction are large and grow polynomially with $n$. We then connect this posterior geometry to computation. We establish an Arrhenius law for the time taken by one-dimensional piecewise deterministic Monte Carlo algorithms to exit these micromodes. Our analysis identifies a phase transition where a misspecified and overly underdispersed model causes exit times to increase sharply, leading to a pronounced deterioration in sampling performance.
