Unifying Physics- and Data-Driven Modeling via Novel Causal Spatiotemporal Graph Neural Network for Interpretable Epidemic Forecasting
Shuai Han, Lukas Stelz, Thomas R. Sokolowski, Kai Zhou, Horst Stöcker
TL;DR
This work tackles epidemic forecasting by unifying physics-based SIR dynamics with data-driven graph learning in a Causal Spatiotemporal Graph Neural Network (CSTGNN). It introduces the Spatio-Contact SIR (SCSIR) as a causal prior and combines an adaptive static mobility graph with a learned dynamic graph, enhanced by temporal decomposition, to model inter-regional transmission and temporal evolution. Empirical results on China and Germany datasets show CSTGNN achieving strong or state-of-the-art accuracy across short- and long-term horizons while providing interpretable parameters such as the contact matrix $c$ and the effective reproduction number $R_0(t)$. The framework offers actionable epidemiological insights and a principled path to deploy interpretable, data-driven forecasts in public health decision-making.
Abstract
Accurate epidemic forecasting is crucial for effective disease control and prevention. Traditional compartmental models often struggle to estimate temporally and spatially varying epidemiological parameters, while deep learning models typically overlook disease transmission dynamics and lack interpretability in the epidemiological context. To address these limitations, we propose a novel Causal Spatiotemporal Graph Neural Network (CSTGNN), a hybrid framework that integrates a Spatio-Contact SIR model with Graph Neural Networks (GNNs) to capture the spatiotemporal propagation of epidemics. Inter-regional human mobility exhibits continuous and smooth spatiotemporal patterns, leading to adjacent graph structures that share underlying mobility dynamics. To model these dynamics, we employ an adaptive static connectivity graph to represent the stable components of human mobility and utilize a temporal dynamics model to capture fluctuations within these patterns. By integrating the adaptive static connectivity graph with the temporal dynamics graph, we construct a dynamic graph that encapsulates the comprehensive properties of human mobility networks. Additionally, to capture temporal trends and variations in infectious disease spread, we introduce a temporal decomposition model to handle temporal dependence. This model is then integrated with a dynamic graph convolutional network for epidemic forecasting. We validate our model using real-world datasets at the provincial level in China and the state level in Germany. Extensive studies demonstrate that our method effectively models the spatiotemporal dynamics of infectious diseases, providing a valuable tool for forecasting and intervention strategies. Furthermore, analysis of the learned parameters offers insights into disease transmission mechanisms, enhancing the interpretability and practical applicability of our model.
