Amortised and provably-robust simulation-based inference

Ayush Bharti; Charita Dellaporta; Yuga Hikida; François-Xavier Briol

Amortised and provably-robust simulation-based inference

Ayush Bharti, Charita Dellaporta, Yuga Hikida, François-Xavier Briol

TL;DR

This work tackles the brittleness of likelihood-free SBI under outliers and misspecification by introducing Neural Score-Matching Bayes (NSM-Bayes), which fuses a neural conditional density surrogate with a weighted score-matching loss inside Generalised Bayesian Inference. It offers two implementations: a general NSM-Bayes and a conjugate NSM-Bayes-conj that yields closed-form posterior updates for Gaussian priors, enabling fully amortised inference. The authors prove robustness guarantees for NSM-Bayes and demonstrate empirical strength across g-and-k, SIR with under-reporting, and radio propagation models, showing superior calibration, robustness, and competitive runtime relative to robust SBI baselines. The approach scales to non-differentiable simulators and leverages IMQ weighting to down-weight outliers, with practical calibration of the learning rate β to match nominal coverage. Limitations include challenges with highly multimodal likelihoods and discrete data, suggesting avenues for extending score-matching robustness to broader SBI settings.

Abstract

Complex simulator-based models are now routinely used to perform inference across the sciences and engineering, but existing inference methods are often unable to account for outliers and other extreme values in data which occur due to faulty measurement instruments or human error. In this paper, we introduce a novel approach to simulation-based inference grounded in generalised Bayesian inference and a neural approximation of a weighted score-matching loss. This leads to a method that is both amortised and provably robust to outliers, a combination not achieved by existing approaches. Furthermore, through a carefully chosen conditional density model, we demonstrate that inference can be further simplified and performed without the need for Markov chain Monte Carlo sampling, thereby offering significant computational advantages, with complexity that is only a small fraction of that of current state-of-the-art approaches.

Amortised and provably-robust simulation-based inference

TL;DR

Abstract

Paper Structure (54 sections, 8 theorems, 57 equations, 12 figures, 5 tables, 3 algorithms)

This paper contains 54 sections, 8 theorems, 57 equations, 12 figures, 5 tables, 3 algorithms.

Introduction
Background
Approximate Bayesian Inference for Simulators
Generalised Bayesian Inference for Simulators
Methodology
Neural Score-matching Bayes for Simulation-based Inference
Conjugate Neural Score-matching Bayes
Hyperparameter Selection
Robustness
Lack of Robustness of Neural Likelihood Estimation
Robustness of Neural Score-matching Bayes
Robustness with Mixture Density Networks and Normalising Flows
Mixture density network.
Masked autoregressive flows.
Robustness of Conjugate Neural Score-matching Bayes
...and 39 more sections

Key Result

Proposition 1

Suppose assumption:exptheta holds, $\pi(\theta) \propto \mathcal{N}(\theta;\mu,\Sigma)$, and $W(x) = I_{d_\mathcal{X}} w(x)$ for some $w: \mathcal{X} \rightarrow \mathbb{R}$. Then $\pi_{\text{NSM}}(\theta|x_{1:n}^o, \hat{\phi}_m) \propto \mathcal{N}(\theta;\mu_{n,m},\Sigma_{n,m})$ where:

Figures (12)

Figure 1: Amortised SBI in the presence of outliers. With a few one-sided outliers (10%) in the observed data ( ), shown in the top-right plot, the neural likelihood estimation (NLE ) posterior of the g-and-k distribution is significantly impacted, specifically parameters $\theta_2$ and $\theta_3$ that govern the scale and skewness of the distribution (details in \ref{['sec:gnk']}). In contrast, our proposed method, named neural score-matching Bayes ( ), is robust as we use the function $w(x)$ ( ) to down-weight the effect of the outliers.
Figure 2: The g-and-k distribution. Kernel density estimates of the marginal posterior distributions from NLE ( ), NSM-Bayes ( ), NSM-Bayes-conj ( ), ACE ( ), NPL-MMD ( ), NPE-RS ( ), RSNL ( ), and GBI-SR ( ) from 500 samples, along with the reference NLE posterior ( ) given uncorrupted data. The dashed and dotted gray lines denote the true parameter value and the prior distribution, respectively.
Figure 3: Observed uncorrupted ( ) and undercounted ( ) trajectories of incidence from the SIR model with $\epsilon=5\%$ and $50\%$ retention probability.
Figure 4: SIR model. Performance vs. posterior inference time for NLE ( ), NSM-Bayes ( ), NSM-Bayes-conj ( ), GBI-SR ( ), and ACE ( ) under both undercounting and heavy-tailed contamination. In both cases, NSM-Bayes-conj performs the best whilst incurring the least time.
Figure 4: Ablation study on NSM-Bayes-conj method involving the estimators used in the IMQ weight function $w(x)$.
...and 7 more figures

Theorems & Definitions (16)

Proposition 1
Theorem 1
Theorem 2
Corollary 1
Proposition 2
Proposition 3
Theorem 3
proof
proof
proof
...and 6 more

Amortised and provably-robust simulation-based inference

TL;DR

Abstract

Amortised and provably-robust simulation-based inference

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (16)