Self-Supervised Learning with Probabilistic Density Labeling for Rainfall Probability Estimation
Junha Lee, Sojung An, Sujeong You, Namik Cho
TL;DR
This work tackles rainfall probability estimation by post-processing NWP forecasts with a self-supervised framework. It introduces SSLPDL, which leverages masked modeling and a deformable convolution encoder to learn variable dependencies, followed by transfer learning to precipitation segmentation. A key contribution is probabilistic density labeling, which smooths class probabilities near rainfall thresholds to mitigate heavy-rain imbalance. Experiments on the RDAPS dataset show SSLPDL improves spatiotemporal bias correction and extends forecast lead times, highlighting practical gains for regional rainfall prediction and extreme-event awareness.
Abstract
Numerical weather prediction (NWP) models are fundamental in meteorology for simulating and forecasting the behavior of various atmospheric variables. The accuracy of precipitation forecasts and the acquisition of sufficient lead time are crucial for preventing hazardous weather events. However, the performance of NWP models is limited by the nonlinear and unpredictable patterns of extreme weather phenomena driven by temporal dynamics. In this regard, we propose a \textbf{S}elf-\textbf{S}upervised \textbf{L}earning with \textbf{P}robabilistic \textbf{D}ensity \textbf{L}abeling (SSLPDL) for estimating rainfall probability by post-processing NWP forecasts. Our post-processing method uses self-supervised learning (SSL) with masked modeling for reconstructing atmospheric physics variables, enabling the model to learn the dependency between variables. The pre-trained encoder is then utilized in transfer learning to a precipitation segmentation task. Furthermore, we introduce a straightforward labeling approach based on probability density to address the class imbalance in extreme weather phenomena like heavy rain events. Experimental results show that SSLPDL surpasses other precipitation forecasting models in regional precipitation post-processing and demonstrates competitive performance in extending forecast lead times. Our code is available at https://github.com/joonha425/SSLPDL
