Modelling Infodemics on a Global Scale: A 30 Countries Study using Epidemiological and Social Listening Data
Edoardo Loru, Marco Delmastro, Francesco Gesualdo, Matteo Cinelli
TL;DR
The paper tackles the global infodemic problem by modeling how epidemic activity drives information production and demand signals across 30 countries. It uses a fixed-effects panel framework with sources from WHO, OxCGRT, WHO-EARS, and Google Trends to quantify the link between $C_{it}$, $D_{it}$, and $NewDocuments_{it}$, reporting an elasticity of about $0.16$ for deaths and notable foreign-burden effects. Results show that neighboring countries' epidemic burden can dominate domestic signals, and that the relationship evolves over time, being strongest in the early vaccine rollout period. The work enables cross-border infodemic monitoring, potential nowcasting, and informs integrated public health communication strategies.
Abstract
Infodemics are a threat to public health, arising from multiple interacting phenomena occurring both online and offline. The continuous feedback loops between the digital information ecosystem and offline contingencies make infodemics particularly challenging to define operationally, measure, and eventually model in quantitative terms. In this study, we present evidence of the effect of various epidemic-related variables on the dynamics of infodemics, using a robust modelling framework applied to data from 30 countries across diverse income groups. We use WHO COVID-19 surveillance data on new cases and deaths, vaccination data from the Oxford COVID-19 Government Response Tracker, infodemic data (volume of public conversations and social media content) from the WHO EARS platform, and Google Trends data to represent information demand. Our findings show that new deaths are the strongest predictor of the infodemic, measured as new document production including social media content and public conversations, and that the epidemic burden in neighbouring countries appears to have a greater impact on document production than the domestic one. Building on these results, we propose a taxonomy that highlights country-specific discrepancies between the evolution of the infodemic and the epidemic. Further, an analysis of the temporal evolution of the relationship between the two phenomena quantifies how much the discussions around vaccine rollouts may have shaped the development of the infodemic. The insights from our quantitative model contribute to advancing infodemic research, highlighting the importance of a holistic approach integrating both online and offline dimensions.
