Learning epidemic trajectories through Kernel Operator Learning: from modelling to optimal control
Giovanni Ziarelli, Nicola Parolini, Marco Verani
TL;DR
This work addresses the need for fast, data-driven forecasting and scenario analysis of epidemic trajectories under NPIs, without heavy calibration of traditional compartmental models. It introduces Kernel Operator Learning with two surrogates, $KOL$-$m$ and $KOL$-$\partial$, and evaluates their performance using Neural Tangent Kernels among others to approximate the state or its derivative as a function of the control $u(t)$, yielding a closed-form surrogate $\bar{\mathcal{G}}(u)(t)$ guided by RKHS theory. Compared to a neural-network-based model-learning method, KOL offers substantially faster training and often superior generalization across SIS, SIR, SEIRD dynamics, and enables rapid optimal-control analyses. The results show that KOL surrogates can reproduce key optimal-control metrics (e.g., eradication times, total infections) with costs comparable to or better than full-model benchmarks, supporting their use for fast policy analyses in epidemiology.
Abstract
Since infectious pathogens start spreading into a susceptible population, mathematical models can provide policy makers with reliable forecasts and scenario analyses, which can be concretely implemented or solely consulted. In these complex epidemiological scenarios, machine learning architectures can play an important role, since they directly reconstruct data-driven models circumventing the specific modelling choices and the parameter calibration, typical of classical compartmental models. In this work, we discuss the efficacy of Kernel Operator Learning (KOL) to reconstruct population dynamics during epidemic outbreaks, where the transmission rate is ruled by an input strategy. In particular, we introduce two surrogate models, named KOL-m and KOL-$\partial$, which reconstruct in two different ways the evolution of the epidemics. Moreover, we evaluate the generalization performances of the two approaches with different kernels, including the Neural Tangent Kernels, and compare them with a classical neural network model learning method. Employing synthetic but semi-realistic data, we show how the two introduced approaches are suitable for realizing fast and robust forecasts and scenario analyses, and how these approaches are competitive for determining optimal intervention strategies with respect to specific performance measures.
