PoPStat-COVID19: Leveraging Population Pyramids to Quantify Demographic Vulnerability to COVID-19
Buddhi Wijenayake, Athulya Ratnayake, Lelumi Edirisinghe, Uditha Wijeratne, Tharaka Fonseka, Roshan Godaliyadda, Samath Dharmaratne, Parakrama Ekanayake, Vijitha Herath, Insoha Alwis, Supun Manathunga
TL;DR
This study addresses how age-structure shapes COVID-19 burden and critiques single-number summaries like median age as insufficient. It extends the PoPStat framework to COVID-19 by computing PoPDivergence between each country’s age–sex pyramid and an optimized reference pyramid, yielding PoPStat-COVID19 as the Pearson correlation with log-transformed outcomes; the reference is selected via correlation optimization, with Malta repeatedly identified as the optimal old-skewed benchmark. Using 2019 UN World Population Prospects data and cumulative COVID-19 cases and deaths per million up to 5 May 2023, the authors report strong negative associations: PoPStat-COVID19_cases $r = -0.860$ (p < 0.001, $R^2 = 0.74$) and PoPStat-COVID19_deaths $r = -0.821$ (p < 0.001, $R^2 = 0.67$), robust to the choice of reference across 20 alternate pyramids. Benchmarking against conventional indicators shows PoPStat-COVID19 matches or surpasses predictors like GDP per capita, HDI, life expectancy, and SDI, particularly excelling for fatal burden where it outperforms all comparators. Overall, PoPStat-COVID19 provides a distribution-aware, cross-national scalar of demographic vulnerability to COVID-19 with practical utility for pandemic preparedness and application to future age-dependent threats, including influenza or RSV, using full population pyramids rather than single-number summaries.
Abstract
Understanding how population age structure shapes COVID-19 burden is crucial for pandemic preparedness, yet common summary measures such as median age ignore key distributional features like skewness, bimodality, and the proportional weight of high-risk cohorts. We extend the PoPStat framework, originally devised to link entire population pyramids with cause-specific mortality by applying it to COVID-19. Using 2019 United Nations World Population Prospects age-sex distributions together with cumulative cases and deaths per million recorded up to 5 May 2023 by Our World in Data, we calculate PoPDivergence (the Kullback-Leibler divergence from an optimised reference pyramid) for 180+ countries and derive PoPStat-COVID19 as the Pearson correlation between that divergence and log-transformed incidence or mortality. Optimisation selects Malta's old-skewed pyramid as the reference, yielding strong negative correlations for cases (r=-0.86, p<0.001, R^2=0.74) and deaths (r=-0.82, p<0.001, R^2=0.67). Sensitivity tests across twenty additional, similarly old-skewed references confirm that these associations are robust to reference choice. Benchmarking against eight standard indicators like gross domestic product per capita, Gini index, Human Development Index, life expectancy at birth, median age, population density, Socio-demographic Index, and Universal Health Coverage Index shows that PoPStat-COVID19 surpasses GDP per capita, median age, population density, and several other traditional measures, and outperforms every comparator for fatality burden. PoPStat-COVID19 therefore provides a concise, distribution-aware scalar for quantifying demographic vulnerability to COVID-19.
