Validating Bayesian Inference Algorithms with Simulation-Based Calibration
Sean Talts, Michael Betancourt, Daniel Simpson, Aki Vehtari, Andrew Gelman
TL;DR
<3-5 sentences high-level summary>Simulation-Based Calibration (SBC) provides a general, ground-truth–free framework to validate Bayesian inference algorithms by leveraging the Bayesian joint distribution $\pi(y,\theta)=\pi(y|\theta)\pi(\theta)$ and the posterior $\pi(\theta|\tilde{y})$. The method uses rank statistics derived from prior draws and corresponding posteriors across many simulated replications to test whether the data-averaged posterior is calibrated to the prior, with uniform rank distributions indicating correctness. Deviations from uniformity reveal issues such as autocorrelation, mis-specification, or biases in algorithms like MCMC, ADVI, and INLA, and the paper demonstrates this through misspecified priors, biased MCMC, and spatial modeling examples. SBC is computationally intensive but highly parallelizable and serves as a valuable component of a robust Bayesian workflow alongside posterior predictive checks.
Abstract
Verifying the correctness of Bayesian computation is challenging. This is especially true for complex models that are common in practice, as these require sophisticated model implementations and algorithms. In this paper we introduce \emph{simulation-based calibration} (SBC), a general procedure for validating inferences from Bayesian algorithms capable of generating posterior samples. This procedure not only identifies inaccurate computation and inconsistencies in model implementations but also provides graphical summaries that can indicate the nature of the problems that arise. We argue that SBC is a critical part of a robust Bayesian workflow, as well as being a useful tool for those developing computational algorithms and statistical software.
