Table of Contents
Fetching ...

Interpretable Machine Learning for Weather and Climate Prediction: A Survey

Ruyi Yang, Jingyu Hu, Zihao Li, Jianli Mu, Tingzhao Yu, Jiangjiang Xia, Xuhong Li, Aritra Dasgupta, Haoyi Xiong

TL;DR

This survey addresses the need for interpretable machine learning in weather and climate prediction by contrasting black-box models with explanations that improve trust, debugging, and scientific insight. It categorizes explainability into post-hoc and intrinsically interpretable models, reviewing methods such as SHAP, LRP, Grad-CAM, and LIME, and discusses self-explaining architectures like linear, tree-based, and attention-based models. Key contributions include mapping explainability techniques to meteorological tasks, identifying gaps in mechanistic interpretability and evaluation benchmarks, and outlining how explanations can guide physics-informed improvements. The work underscores the practical impact of explanations for trust, scientific discovery, and the integration of data-driven methods with physical principles, especially in the context of large foundation models for weather forecasting.

Abstract

Advanced machine learning models have recently achieved high predictive accuracy for weather and climate prediction. However, these complex models often lack inherent transparency and interpretability, acting as "black boxes" that impede user trust and hinder further model improvements. As such, interpretable machine learning techniques have become crucial in enhancing the credibility and utility of weather and climate modeling. In this survey, we review current interpretable machine learning approaches applied to meteorological predictions. We categorize methods into two major paradigms: 1) Post-hoc interpretability techniques that explain pre-trained models, such as perturbation-based, game theory based, and gradient-based attribution methods. 2) Designing inherently interpretable models from scratch using architectures like tree ensembles and explainable neural networks. We summarize how each technique provides insights into the predictions, uncovering novel meteorological relationships captured by machine learning. Lastly, we discuss research challenges around achieving deeper mechanistic interpretations aligned with physical principles, developing standardized evaluation benchmarks, integrating interpretability into iterative model development workflows, and providing explainability for large foundation models.

Interpretable Machine Learning for Weather and Climate Prediction: A Survey

TL;DR

This survey addresses the need for interpretable machine learning in weather and climate prediction by contrasting black-box models with explanations that improve trust, debugging, and scientific insight. It categorizes explainability into post-hoc and intrinsically interpretable models, reviewing methods such as SHAP, LRP, Grad-CAM, and LIME, and discusses self-explaining architectures like linear, tree-based, and attention-based models. Key contributions include mapping explainability techniques to meteorological tasks, identifying gaps in mechanistic interpretability and evaluation benchmarks, and outlining how explanations can guide physics-informed improvements. The work underscores the practical impact of explanations for trust, scientific discovery, and the integration of data-driven methods with physical principles, especially in the context of large foundation models for weather forecasting.

Abstract

Advanced machine learning models have recently achieved high predictive accuracy for weather and climate prediction. However, these complex models often lack inherent transparency and interpretability, acting as "black boxes" that impede user trust and hinder further model improvements. As such, interpretable machine learning techniques have become crucial in enhancing the credibility and utility of weather and climate modeling. In this survey, we review current interpretable machine learning approaches applied to meteorological predictions. We categorize methods into two major paradigms: 1) Post-hoc interpretability techniques that explain pre-trained models, such as perturbation-based, game theory based, and gradient-based attribution methods. 2) Designing inherently interpretable models from scratch using architectures like tree ensembles and explainable neural networks. We summarize how each technique provides insights into the predictions, uncovering novel meteorological relationships captured by machine learning. Lastly, we discuss research challenges around achieving deeper mechanistic interpretations aligned with physical principles, developing standardized evaluation benchmarks, integrating interpretability into iterative model development workflows, and providing explainability for large foundation models.
Paper Structure (30 sections, 5 figures, 3 tables)

This paper contains 30 sections, 5 figures, 3 tables.

Figures (5)

  • Figure 1: Explainable machine learning in meteorological prediction and the structure of the survey
  • Figure 2: Three major families of post-hoc explanation methods in weather and climate prediction: (a) Perturbation-based explanation, (b) Game Theory-based explanation (Referred & modified from SHAP lundberg2017unified), (c) Gradient-based explanation.
  • Figure 3: The feature attribution results of different input features in MLP prediction models using LIME and SHAP interpretation methods. The MLP model is used to forecast the weather temperature task. The data comes from the weather data of Lincoln, Nebraska, which is provided by Weather Undergroun and covers 997 days since January 4, 2015 (Here, MLP model and data are both from https://github.com/Rite188/Simple-DNN-on-weather-forcast).
  • Figure 4: Visualization comparison between three explainability methods. Here $\text{LRP}_{comp}$ and $\text{LRP}_{z}$ use two different rules of backpropagation. The figure is adapted from Bommer et al. bommer2023finding.
  • Figure 5: An example of using explainability to guide iterative physical consistency improvement of a machine learning model (DNN model) until no faulty corrections (red boundaries) are detected.