Amortized Bayesian Workflow
Chengkun Li, Aki Vehtari, Paul-Christian Bürkner, Stefan T. Radev, Luigi Acerbi, Marvin Schmitt
TL;DR
The Amortized Bayesian Workflow tackles the scalability challenge of Bayesian inference across large collections of datasets by integrating fast amortized posterior approximations with gold-standard MCMC when necessary. It introduces a diagnostic-driven pipeline that reuses computations across steps, employing PSIS refinements and amortized initializations to accelerate convergence while preserving posterior quality. Across four diverse problems, the approach achieves substantial runtime reductions (5×–120×) with accuracy close to reference MCMC posteriors, even under distribution shifts. The modular design supports flexible component replacement and offers a practical path toward scalable, trustworthy Bayesian inference in real-world, data-rich settings.
Abstract
Bayesian inference often faces a trade-off between computational speed and sampling accuracy. We propose an adaptive workflow that integrates rapid amortized inference with gold-standard MCMC techniques to achieve a favorable combination of both speed and accuracy when performing inference on many observed datasets. Our approach uses principled diagnostics to guide the choice of inference method for each dataset, moving along the Pareto front from fast amortized sampling via generative neural networks to slower but guaranteed-accurate MCMC when needed. By reusing computations across steps, our workflow synergizes amortized and MCMC-based inference. We demonstrate the effectiveness of this integrated approach on several synthetic and real-world problems with tens of thousands of datasets, showing efficiency gains while maintaining high posterior quality.
