Forecasting precipitation in the Arctic using probabilistic machine learning informed by causal climate drivers
Madhurima Panja, Dhiman Das, Tanujit Chakraborty, Arnob Ray, R. Athulya, Chittaranjan Hens, Syamal K. Dana, Nuncio Murukesh, Dibakar Ghosh
TL;DR
The paper tackles Arctic precipitation forecasting under data scarcity and strong multivariate drivers by fusing scale-aware causal analysis with probabilistic machine learning. It combines wavelet coherence to characterize driver–precipitation dependencies with a Synergistic-Unique-Redundant Decomposition (SURD) to quantify multivariate information transfer, and then trains exogenous, boosting-based models (notably XGBoostX) with conformal prediction to yield calibrated forecast intervals. The study shows that synergistic multivariate interactions dominate the predictive information, and that XGBoostX with exogenous climatic drivers provides the most accurate and stable forecasts for Bear Island and Ny-Ålesund, accompanied by reliable uncertainty quantification. This framework improves interpretability and operational usefulness for Arctic early warning and risk management, while highlighting areas for future enhancement, including tail modeling via Extreme Value Theory and incorporating spatial dependencies.
Abstract
Understanding and forecasting precipitation events in the Arctic maritime environments, such as Bear Island and Ny-Ålesund, is crucial for assessing climate risk and developing early warning systems in vulnerable marine regions. This study proposes a probabilistic machine learning framework for modeling and predicting the dynamics and severity of precipitation. We begin by analyzing the scale-dependent relationships between precipitation and key atmospheric drivers (e.g., temperature, relative humidity, cloud cover, and air pressure) using wavelet coherence, which captures localized dependencies across time and frequency domains. To assess joint causal influences, we employ Synergistic-Unique-Redundant Decomposition, which quantifies the impact of interaction effects among each variable on future precipitation dynamics. These insights inform the development of data-driven forecasting models that incorporate both historical precipitation and causal climate drivers. To account for uncertainty, we employ the conformal prediction method, which enables the generation of calibrated non-parametric prediction intervals. Our results underscore the importance of utilizing a comprehensive framework that combines causal analysis with probabilistic forecasting to enhance the reliability and interpretability of precipitation predictions in Arctic marine environments.
