Evaluation of Country Dietary Habits Using Machine Learning Techniques in Relation to Deaths from COVID-19
María Teresa García-Ordás, Natalia Arias, Carmen Benavides, Oscar García-Olalla, José Alberto Benítez-Andrades
TL;DR
The paper investigates whether country-level dietary patterns influence COVID-19 mortality by integrating 94 dietary features across 170 countries. A two-step ML pipeline uses PCA to reduce features to 23 components explaining $95\%$ of variance, followed by K-Means clustering and Davies–Bouldin validation to identify 20 clusters. Results show high-mortality clusters correlate with greater intake of animal products, fats, and higher total caloric consumption ($3277.5$ Kcal/day on average) compared to lower-mortality clusters ($2764.3$ Kcal/day), with obesity emerging as a risk factor and cereals linked to lower risk. The findings suggest dietary patterns may inform risk profiling and public health strategies, though future work should incorporate race, age, and socioeconomic factors to enhance clustering reliability.
Abstract
COVID-19 disease has affected almost every country in the world. The large number of infected people and the different mortality rates between countries has given rise to many hypotheses about the key points that make the virus so lethal in some places. In this study, the eating habits of 170 countries were evaluated in order to find correlations between these habits and mortality rates caused by COVID-19 using machine learning techniques that group the countries together according to the different distribution of fat, energy, and protein across 23 different types of food, as well as the amount ingested in kilograms. Results shown how obesity and the high consumption of fats appear in countries with the highest death rates, whereas countries with a lower rate have a higher level of cereal consumption accompanied by a lower total average intake of kilocalories.
