A two-parameter, minimal-data model to predict dengue cases: the 2022-2023 outbreak in Florida, USA
Saman Hosseini, Lee W. Cohnstaedt, Caterina Scoglio
TL;DR
This study introduces a data-parsimonious incidence-vs-cumulative (ICC) framework for dengue forecasting by extending ICC from a basic SIR to a two-population SEIR model and pairing it with a Bayesian uncertainty mechanism. The method relies only on the current season's incidence data, achieving competitive predictive performance with substantially lower data and computational demands than traditional models. Applied to Florida's 2022–2023 outbreaks, it identifies four distinct outbreak seasons and demonstrates robust short-horizon forecasts, with Bayesian predictions providing calibrated uncertainty intervals and often improved accuracy over a censored-Poisson baseline. The approach offers practical value for dengue early detection in data-limited settings and newly affected regions, where extensive entomological or historical data are unavailable.
Abstract
Reliable and timely dengue predictions provide actionable lead time for targeted vector control and clinical preparedness, reducing preventable diseases and health-system costs in at-risk communities. Dengue forecasting often relies on site-specific covariates and entomological data, limiting generalizability in data-sparse settings. We propose a data-parsimonious (DP) framework based on the incidence versus cumulative cases (ICC) curve, extending it from a basic SIR to a two-population SEIR model for dengue. Our DP model uses only the target season's incidence time series and estimates only two parameters, reducing noise and computational burden. A Bayesian extension quantifies the case reporting and fitting uncertainty to produce calibrated predictive intervals. We evaluated the performance of the DP model in the 2022-2023 outbreaks in Florida, where standardized clinical tests and reporting support accurate case determination. The DP framework demonstrates competitive predictive performance at substantially lower computational cost than more elaborate models, making it suitable for dengue early detection where dense surveillance or long historical records are unavailable.
