Normal approximation for the posterior in exponential families
Adrian Fischer, Robert E. Gaunt, Gesine Reinert, Yvik Swan
TL;DR
This work provides non-asymptotic Bernstein–von Mises type bounds for the normal approximation of posteriors in exponential-family models using Stein's method, with explicit total variation and Wasserstein bounds that apply to both univariate and multivariate posteriors and accommodate non-conjugate priors. By allowing arbitrary centering and scaling and leveraging a third-order (and optionally fourth-order) Taylor expansion of the log-likelihood, the authors derive bounds that depend explicitly on priors and sufficient statistics, and they demonstrate faster-than-$O(n^{-1/2})$ rates in certain settings. The framework is illustrated across a range of exponential-family models—Bernoulli, Poisson, normal-with-known-mean, Weibull, multinomial, univariate normal with unknown mean/variance, and linear regression with unknown variance—highlighting how dimension, standardisation, and priors influence Gaussian proximity and convergence rates. The results furnish practical, finite-sample guarantees for Laplace-type approximations in Bayesian inference and offer new insights into high-dimensional posterior behavior.
Abstract
In this paper, we obtain quantitative, non-asymptotic, and data-dependent \textit{Bernstein-von Mises type} bounds on the normal approximation of the posterior distribution in exponential family models with arbitrary centring and scaling. Our bounds, stated in the total variation and Wasserstein distances, are valid for univariate and multivariate posteriors alike, and do not require a conjugate prior setting. They are obtained through a refined version of Stein's method of comparison of operators that allows for improved dimensional dependence in high-dimensional settings and may also be of interest in other problems. Our approach is rather flexible and, in certain settings, allows for the derivation of bounds with rates of convergence faster than the usual \( O(n^{-1/2}) \) rate (when \( n \) is the sample size). We illustrate our findings on a variety of exponential family distributions, including the Weibull, multinomial, and linear regression with unknown variance. The resulting bounds have an explicit dependence on the prior distribution and on sufficient statistics of the data from the sample, and thus provide insight into how these factors affect the quality of the normal approximation. Insights from our examples include identification of conditions under which faster \( O(n^{-1}) \) convergence rates occur for Bernoulli data, illustrations of how the quality of the normal approximation is influenced by the choice of standardisation, and dimensional dependence in high-dimensional settings.
