Modular Deep-Learning-Based Early Warning System for Deadly Heatwave Prediction
Shangqing Xu, Zhiyuan Zhao, Megha Sharma, José María Martín-Olalla, Alexander Rodríguez, Gregory A. Wellenius, B. Aditya Prakash
TL;DR
DeepTherm tackles the lack of heat-related mortality data by introducing a modular dual-prediction framework that separately forecasts all-cause and baseline mortality to infer excess mortality and predict deadly heatwaves. The system combines a Transformer-based all-cause predictor, a Quasi-Poisson baseline predictor, and a decision module with adjustable thresholds, and it leverages near-future synoptic typing data to anticipate heatwave events. Evaluated across 12 Spanish provinces and their capitals (1995–2023), DeepTherm demonstrates robust, regionally diverse performance with strong L1 detection and meaningful L2 results, while showing stability across years and demographic groups. It outperforms baselines that do not utilize historical all-cause mortality data, and its modular design supports flexible deployment under data availability and decision-cost constraints.
Abstract
Severe heatwaves in urban areas significantly threaten public health, calling for establishing early warning strategies. Despite predicting occurrence of heatwaves and attributing historical mortality, predicting an incoming deadly heatwave remains a challenge due to the difficulty in defining and estimating heat-related mortality. Furthermore, establishing an early warning system imposes additional requirements, including data availability, spatial and temporal robustness, and decision costs. To address these challenges, we propose DeepTherm, a modular early warning system for deadly heatwave prediction without requiring heat-related mortality history. By highlighting the flexibility of deep learning, DeepTherm employs a dual-prediction pipeline, disentangling baseline mortality in the absence of heatwaves and other irregular events from all-cause mortality. We evaluated DeepTherm on real-world data across Spain. Results demonstrate consistent, robust, and accurate performance across diverse regions, time periods, and population groups while allowing trade-off between missed alarms and false alarms.
