Deep Learning and Foundation Models for Weather Prediction: A Survey
Jimeng Shi, Azam Shirali, Bowen Jin, Sizhe Zhou, Wei Hu, Rahuul Rangaraj, Shaowen Wang, Jiawei Han, Zhaonan Wang, Upmanu Lall, Yanzhao Wu, Leonardo Bobadilla, Giri Narasimhan
TL;DR
This survey assesses how deep learning and foundation models are transforming weather prediction by contrasting data-driven approaches with physics-based numerical models. It organizes methods into three training paradigms—deterministic predictive learning, probabilistic generative learning, and pre-training with fine-tuning—and examines backbones such as Transformers, Graph Neural Networks, and diffusion/GAN architectures, across general-purpose and domain-specific contexts. The paper catalogues applications (precipitation, air quality, SST, flood, drought, tropical cyclones, wildfire), datasets, and open-source resources, and outlines a roadmap emphasizing trustworthy AI, retrieval-augmented models, weather-constrained generation, multi-modal learning, and data management. Its key contribution lies in offering a comprehensive taxonomy, up-to-date syntheses of models and datasets, and a forward-looking agenda to bridge research with practical, reliable, and scalable weather prediction. The findings underscore the potential of diffusion and foundation-model approaches to provide calibrated uncertainty and transferability, while also highlighting challenges in data quality, resolution mismatch, and computational demands for large-scale pre-trained weather models.
Abstract
Physics-based numerical models have been the bedrock of atmospheric sciences for decades, offering robust solutions but often at the cost of significant computational resources. Deep learning (DL) models have emerged as powerful tools in meteorology, capable of analyzing complex weather and climate data by learning intricate dependencies and providing rapid predictions once trained. While these models demonstrate promising performance in weather prediction, often surpassing traditional physics-based methods, they still face critical challenges. This paper presents a comprehensive survey of recent deep learning and foundation models for weather prediction. We propose a taxonomy to classify existing models based on their training paradigms: deterministic predictive learning, probabilistic generative learning, and pre-training and fine-tuning. For each paradigm, we delve into the underlying model architectures, address major challenges, offer key insights, and propose targeted directions for future research. Furthermore, we explore real-world applications of these methods and provide a curated summary of open-source code repositories and widely used datasets, aiming to bridge research advancements with practical implementations while fostering open and trustworthy scientific practices in adopting cutting-edge artificial intelligence for weather prediction. The related sources are available at https://github.com/JimengShi/ DL-Foundation-Models-Weather.
