Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm

James Neill; Lloyd A. C. Chapman; Chris Jewell

Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm

James Neill, Lloyd A. C. Chapman, Chris Jewell

TL;DR

The paper tackles inference in discrete-valued, high-dimensional epidemic state-transition models by framing the problem as a coupled hidden Markov model and proposing a novel non-centered data-augmentation MCMC algorithm, the Rippler method, to jointly infer latent states and parameters. Rippler perturbs a latent uniform matrix $U$ to generate proposed latent states $X^*$, moving computations of event probabilities into the proposal step and accepting via Metropolis-Hastings. A data-informed extension uses observations to shape proposals and normalizing constants, improving mixing in complex models; Rippler scales linearly with the number of states $S$, outperforming RJMCMC and iFFBS as $S$ grows. Simulations across SIR, SEIR, and multi-strain models show improved latent-space exploration and scalability, with open-source Python code provided for reproducibility.

Abstract

Stochastic state-transition models of infectious disease transmission can be used to deduce relevant drivers of transmission when fitted to data using statistically principled methods. Fitting this individual-level data requires inference on individuals' unobserved disease statuses over time, which form a high-dimensional and highly correlated state space. We introduce a novel Bayesian (data-augmentation Markov chain Monte Carlo) algorithm for jointly estimating the model parameters and unobserved disease statuses, which we call the Rippler algorithm. This is a non-centred method that can be applied to any individual-based state-transition model. We compare the Rippler algorithm to the state-of-the-art inference methods for individual-based stochastic epidemic models and find that it performs better than these methods as the number of disease states in the model increases.

Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm

TL;DR

to generate proposed latent states

, moving computations of event probabilities into the proposal step and accepting via Metropolis-Hastings. A data-informed extension uses observations to shape proposals and normalizing constants, improving mixing in complex models; Rippler scales linearly with the number of states

, outperforming RJMCMC and iFFBS as

grows. Simulations across SIR, SEIR, and multi-strain models show improved latent-space exploration and scalability, with open-source Python code provided for reproducibility.

Abstract

Paper Structure (30 sections, 3 theorems, 41 equations, 15 figures, 2 algorithms)

This paper contains 30 sections, 3 theorems, 41 equations, 15 figures, 2 algorithms.

Introduction
Epidemic Modelling
Coupled Hidden Markov Models
Transition Probabilities for Epidemics
Simulation and Non-centering
Bayesian Inference: The Rippler Algorithm
Overview
Proposal Mechanism
Acceptance Probability
Adaptive Tuning
Summary
Data-informed Rippler Algorithm
Bayesian Inference: Existing Methods
Existing Inference Methods
Computational Complexity Comparison
...and 15 more sections

Key Result

Proposition 1

We have $\pi(\boldsymbol{X}|\boldsymbol{\theta})q_1(\boldsymbol{U}|\boldsymbol{\theta},\boldsymbol{X}) = 1$ and $\pi(\boldsymbol{X}^{\ast}|\boldsymbol{\theta})q_1(\boldsymbol{U}^{\ast}|\boldsymbol{\theta},\boldsymbol{X}^{\ast}) = 1$.

Figures (15)

Figure 1: Visualisation of a coupled hidden Markov model with $\mathcal{N} =3$ (from $t=1$ to $t=4$). Note that only a subset of time-individual points have observations. Based on a figure from touloupou2020scalable.
Figure 2: Visualisation of the SIR model (where $\beta$ is the infection rate, $\gamma$ is the recovery rate, and $I(t)$ is the current number of infective individuals).
Figure 3: Posterior samples of the number of individuals in each state over time, using the Rippler (top left), data-informed Rippler (top right), reversible-jump (bottom left), and iFFBS (bottom right) methods on the SIR model. The median values are shown in the solid lines and the 95% credible intervals are shown in the shaded regions. The true values are shown by the dashed lines.
Figure 4: Visualisation of the SEIR model with $\mathcal{S}_E=3$ exposure steps.
Figure 5: Relative time taken (left) and MAJD per relative unit time (right) for hidden state inference methods on the extended exposure model (from $\mathcal{S}=4$ to $\mathcal{S}=10$).
...and 10 more figures

Theorems & Definitions (3)

Proposition 1
Proposition 2
Proposition 3

Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm

TL;DR

Abstract

Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (15)

Theorems & Definitions (3)