Scalability of Metropolis-within-Gibbs schemes for high-dimensional Bayesian models
Filippo Ascolani, Gareth O. Roberts, Giacomo Zanella
TL;DR
This work develops a general theory connecting coordinate-wise MCMC convergence to the Gibbs sampler via conditional conductance, enabling dimension-free mixing insights for high-dimensional Bayesian models. By bounding the approximate conductance of MwG updates in terms of the exact Gibbs conductance and analyzing auxiliary perturbation results, the authors establish dimension-free mixing times for MwG in two-level hierarchical structures and related applications. They provide concrete results for MwG with independent MH updates, conditionally log-concave targets, discrete data, binary regression with unknown prior variance, and diffusion data augmentation, showing that fast convergence can be achieved with manageable computational costs. The findings offer practical guidance on when MwG approaches rival gradient-based methods, and they extend the theoretical toolbox for combining MCMC convergence with Bayesian asymptotics in complex, large-scale models.
Abstract
We study general coordinate-wise MCMC schemes (such as Metropolis-within-Gibbs samplers), which are commonly used to fit Bayesian non-conjugate hierarchical models. We relate their convergence properties to the ones of the corresponding (potentially not implementable) Gibbs sampler through the notion of conditional conductance. This allows us to study the performances of popular Metropolis-within-Gibbs schemes for non-conjugate hierarchical models, in high-dimensional regimes where both number of datapoints and parameters increase. Given random data-generating assumptions, we establish dimension-free convergence results, which are in close accordance with numerical evidences. Applications to Bayesian models for binary regression with unknown hyperparameters and discretely observed diffusions are also discussed. Motivated by such statistical applications, auxiliary results of independent interest on approximate conductances and perturbation of Markov operators are provided.
