Artificial Intelligence for Infectious Disease Prediction and Prevention: A Comprehensive Review
Selestine Melchane, Youssef Elmir, Farid Kacimi, Larbi Boubchir
TL;DR
This review maps AI-driven approaches for infectious disease prediction and prevention across three data-driven objectives: using Public Health Data to forecast regional spread, using Patients' Medical Data to detect infection, and combining both data streams to estimate population-level dynamics. It catalogs a spectrum of methods—Naive Bayes, clustering, LSTM, Transformer, SSL, CNN, and Transformer-based models—applied to numerical time-series, textual, image, and clinical data, highlighting performance trends and data-specific challenges. Key contributions include identifying when specific data types (e.g., geospatial images or routine blood tests) yield strongest signals, and illustrating how hybrid models (e.g., T-SIRGAN, TEMPO, MOAU) improve long-range forecasting and mutation prediction. The paper emphasizes practical implications, such as the need for real-time data fusion, privacy-preserving analyses, and careful handling of data biases to realize AI’s potential in outbreak forecasting and control.
Abstract
Artificial Intelligence (AI) and infectious diseases prediction have recently experienced a common development and advancement. Machine learning (ML) apparition, along with deep learning (DL) emergence, extended many approaches against diseases apparition and their spread. And despite their outstanding results in predicting infectious diseases, conflicts appeared regarding the types of data used and how they can be studied, analyzed, and exploited using various emerging methods. This has led to some ongoing discussions in the field. This research aims not only to provide an overview of what has been accomplished, but also to highlight the difficulties related to the types of data used, and the learning methods applied for each research objective. It categorizes these contributions into three areas: predictions using Public Health Data to prevent the spread of a transmissible disease within a region; predictions using Patients' Medical Data to detect whether a person is infected by a transmissible disease; and predictions using both Public and patient medical data to estimate the extent of disease spread in a population. The paper also critically assesses the potential of AI and outlines its limitations in infectious disease management.
