Causal Inference Using Augmented Epidemic Models
Heejong Bong, Valérie Ventura, Larry Wasserman
TL;DR
This paper clarifies how time-varying interventions in epidemics can be analyzed causally by distinguishing augmented epidemic models as data-generating processes from their role as causal models. It demonstrates that standard maximum-likelihood or Bayesian estimation applied to augmented models yields the g-null paradox due to phantom variables, producing biased causal inferences. Four estimation strategies are analyzed, with Method 3 (marginal structural models via estimating equations using propensity scores) identified as the simplest and most natural approach, while Methods 2 and 4 offer principled alternatives that incorporate deeper causal structure. Through simulations and mobility-data applications, the authors show that MSM-based methods can provide consistent causal effects for time-varying interventions, enhance interpretability, and mitigate phantom-bias issues, albeit with modeling challenges in short time series and potential misspecification. The work has practical impact for scenario analysis and policy evaluation in epidemics, offering a principled framework for adjusting for confounders and time-varying interventions when only one or a few epidemic time series are available.
Abstract
Epidemic models describe the evolution of a communicable disease over time. These models are often modified to include the effects of interventions (control measures) such as vaccination, social distancing, school closings etc. Many such models were proposed during the COVID-19 epidemic. Inevitably these models are used to answer the question: What is the effect of the intervention on the epidemic? These models can either be interpreted as data generating models describing observed random variables or as causal models for counterfactual random variables. These two interpretations are often conflated in the literature. We discuss the difference between these two types of models, and then we discuss how to estimate the parameters of the model. Our focus is causal inference for parameters in epidemic models by adjusting for confounders, allowing time varying interventions.
