Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic Response
Agatha Schmidt, Henrik Zunker, Alexander Heinlein, Martin J. Kühn
TL;DR
The paper addresses the need for rapid, reliable forecasts in time-critical pandemic settings by pairing a validated spatially resolved mechanistic metapopulation model with a graph neural network (GNN) surrogate. It trains a seven-layer ARMAConv GNN on a 400-node county graph to reproduce demographically and spatially resolved trajectories under outbreak and persistent-threat regimes, achieving $MAPE$ in the range of $10\%-27\%$ over forecast horizons of $30$–$90$ days. The surrogate delivers near-constant runtime as horizon increases and accelerates inference by up to $2.87\times 10^{4}$ fold compared with the mechanistic model, enabling interactive, web-based scenario exploration. This work demonstrates that GNN surrogates can translate complex metapopulation dynamics into fast, interpretable decision-support tools with explicit ties to expert models, and outlines future directions such as incorporating edge weights and vaccination dynamics for broader applicability.
Abstract
During the COVID-19 crisis, mechanistic models have guided evidence-based decision making. However, time-critical decisions in a dynamical environment limit the time available to gather supporting evidence. We address this bottleneck by developing a graph neural network (GNN) surrogate of a spatially and demographically resolved mechanistic metapopulation simulator. This combined approach advances classical machine learning approaches which are often black box. Our design of experiments spans outbreak and persistent-threat regimes, up to three contact change points, and age-structured contact matrices on a 400-node spatial graph. We benchmark multiple GNN layers and identify an ARMAConv-based architecture that offers a strong accuracy-runtime trade-off. Across 30-90 day horizons and up to three contact change points, the surrogate attains 10-27 % mean absolute percentage error (MAPE) while delivering (near) constant runtime with respect to the forecast horizon. Our approach accelerates evaluation by up to 28,670 times compared with the mechanistic model, allowing responsive decision support in time-critical scenarios and straightforward web integration. These results show how GNN surrogates can translate complex metapopulation models into immediate, reliable tools for pandemic response.
