Causal Posterior Estimation
Simon Dirmeier, Antonietta Mira
TL;DR
CPE tackles SBI for simulator-based models with intractable likelihoods by embedding the model's conditional dependencies directly into normalizing-flow architectures. It introduces continuous and discrete NF variants, a prior-aligned base distribution, and a rectified, constant-time sampling scheme that yields accurate posterior inferences with high sampling efficiency. By leveraging causal factorization, block matrix projections, and time/data conditioning, CPE outperforms or matches state-of-the-art baselines across nine SBI benchmarks while using fewer trainable parameters. The approach connects to structured semiseparable matrices for computational efficiency on accelerators and offers a pathway for scalable, exact-like posterior estimation in complex graphical models. Overall, CPE advances simulator-based Bayesian inference by tightly integrating graphical model structure into neural posterior estimation and enabling fast, accurate sampling.
Abstract
We present Causal Posterior Estimation (CPE), a novel method for Bayesian inference in simulator models, i.e., models where the evaluation of the likelihood function is intractable or too computationally expensive, but where one can simulate model outputs given parameter values. CPE utilizes a normalizing flow-based (NF) approximation to the posterior distribution which carefully incorporates the conditional dependence structure induced by the graphical representation of the model into the neural network. Thereby it is possible to improve the accuracy of the approximation. We introduce both discrete and continuous NF architectures for CPE and propose a constant-time sampling procedure for the continuous case which reduces the computational complexity of drawing samples to O(1) as for discrete NFs. We show, through an extensive experimental evaluation, that by incorporating the conditional dependencies induced by the graphical model directly into the neural network, rather than learning them from data, CPE is able to conduct highly accurate posterior inference either outperforming or matching the state of the art in the field.
