DPSformer: A long-tail-aware model for improving heavy rainfall prediction
Zenghui Huang, Ting Shu, Zhonglei Wang, Yang Lu, Yan Yan, Wei Zhong, Hanzi Wang
TL;DR
DPSformer reframes heavy rainfall forecasting as a long-tailed learning problem and introduces a dual-path Transformer architecture with a high-resolution spatial branch to boost representation of rare, high-impact events. By adaptively fusing global backbone features with localized high-resolution cues and employing a dual-loss training scheme, the model significantly improves heavy rainfall prediction, elevating CSI from $0.012$ to $0.067$ for events $\geq$ $50$ mm/6 h and achieving top-tail FSS above $0.4$ for the most extreme cases. The approach is validated with a tropical cyclone case (e.g., Tropical Storm Doksuri) and attribution analyses showing physically consistent feature usage (e.g., emphasis on upper-level moisture and deep lifting), while uncertainty is quantified via conformal prediction. This work offers a practical, long-tailed paradigm for operational heavy rainfall forecasting, with potential applicability to other rare, high-impact meteorological phenomena and integration with multi-source data for broader generalization.
Abstract
Accurate and timely forecasting of heavy rainfall remains a critical challenge for modern society. Precipitation exhibits a highly imbalanced distribution: most observations record no or light rain, while heavy rainfall events are rare. Such an imbalanced distribution obstructs deep learning models from effectively predicting heavy rainfall events. To address this challenge, we treat rainfall forecasting explicitly as a long-tailed learning problem, identifying the insufficient representation of heavy rainfall events as the primary barrier to forecasting accuracy. Therefore, we introduce DPSformer, a long-tail-aware model that enriches representation of heavy rainfall events through a high-resolution branch. For heavy rainfall events $ \geq $ 50 mm/6 h, DPSformer lifts the Critical Success Index (CSI) of a baseline Numerical Weather Prediction (NWP) model from 0.012 to 0.067. For the top 1% coverage of heavy rainfall events, its Fraction Skill Score (FSS) exceeds 0.45, surpassing existing methods. Our work establishes an effective long-tailed paradigm for heavy rainfall prediction, offering a practical tool to enhance early warning systems and mitigate the societal impacts of extreme weather events.
