Multidata Causal Discovery for Statistical Hurricane Intensity Forecasting

Saranya Ganesh S.; Frederick Iat-Hin Tam; Milton S. Gomez; Marie McGraw; Mark DeMaria; Kate Musgrave; Jakob Runge; Tom Beucler

Multidata Causal Discovery for Statistical Hurricane Intensity Forecasting

Saranya Ganesh S., Frederick Iat-Hin Tam, Milton S. Gomez, Marie McGraw, Mark DeMaria, Kate Musgrave, Jakob Runge, Tom Beucler

TL;DR

This study tackles the difficulty of predicting Atlantic hurricane intensity by applying multidata causal discovery to identify predictors with direct causal influence on intensity changes. The authors replicate SHIPS predictors using ERA5/TC PRIMED data and test causal feature selection against correlation and random-forest baselines, showing superior generalization for short lead times. They extend SHIPS with six causally chosen predictors (SHIPS+) and demonstrate that nonlinear modeling with MLPs further improves skill, especially beyond 72 hours. The Hurricane Larry case study and operational-like SHIPS tests confirm that SHIPS+ with nonlinear modeling yields tangible forecast improvements and greater interpretability by focusing on physically meaningful drivers. The work highlights a path toward more empirical, causally grounded hurricane intensity forecasts that generalize better to unseen storms.

Abstract

Improving statistical forecasts of Atlantic hurricane intensity is limited by complex nonlinear interactions and difficulty in identifying relevant predictors. Conventional methods prioritize correlation or fit, often overlooking confounding variables and limiting generalizability to unseen tropical storms. To address this, we leverage a multidata causal discovery framework with a replicated dataset based on Statistical Hurricane Intensity Prediction Scheme (SHIPS) using ERA5 meteorological reanalysis. We conduct multiple experiments to identify and select predictors causally linked to hurricane intensity changes. We train multiple linear regression models to compare causal feature selection with no selection, correlation, and random forest feature importance across five forecast lead times from 1 to 5 days (24 to 120 hours). Causal feature selection consistently outperforms on unseen test cases, especially for lead times shorter than 3 days. The causal features primarily include vertical shear, mid-tropospheric potential vorticity and surface moisture conditions, which are physically significant yet often underutilized in hurricane intensity predictions. Further, we build an extended predictor set (SHIPS+) by adding selected features to the standard SHIPS predictors. SHIPS+ yields increased short-term predictive skill at lead times of 24, 48, and 72 hours. Adding nonlinearity using multilayer perceptron further extends skill to longer lead times, despite our framework being purely regional and not requiring global forecast data. Operational SHIPS tests confirm that three of the six added causally discovered predictors improve forecasts, with the largest gains at longer lead times. Our results demonstrate that causal discovery improves hurricane intensity prediction and pave the way toward more empirical forecasts.

Multidata Causal Discovery for Statistical Hurricane Intensity Forecasting

TL;DR

Abstract

Multidata Causal Discovery for Statistical Hurricane Intensity Forecasting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (22)