Self-Supervised Pre-Training for Precipitation Post-Processor
Sojung An, Junha Lee, Jiyeon Jang, Inchae Na, Wooyeon Park, Sujeong You
TL;DR
This work addresses the challenge of extending forecast lead times for local precipitation, especially for heavy rainfall, under climate-driven uncertainty. It introduces a self-supervised pre-training scheme on masked 3D atmospheric variables to learn physics-informed latent representations, followed by transfer learning to a precipitation segmentation task. A continuous labeling strategy is proposed to mitigate extreme class imbalance by smoothing probabilistic targets rather than using one-hot labels. Empirical results on regional NWP post-processing show improved heavy-rain detection and localization, outperforming baselines such as Metnet and RDAPS, with practical implications for more reliable short-term precipitation forecasts.
Abstract
Obtaining a sufficient forecast lead time for local precipitation is essential in preventing hazardous weather events. Global warming-induced climate change increases the challenge of accurately predicting severe precipitation events, such as heavy rainfall. In this paper, we propose a deep learning-based precipitation post-processor for numerical weather prediction (NWP) models. The precipitation post-processor consists of (i) employing self-supervised pre-training, where the parameters of the encoder are pre-trained on the reconstruction of the masked variables of the atmospheric physics domain; and (ii) conducting transfer learning on precipitation segmentation tasks (the target domain) from the pre-trained encoder. In addition, we introduced a heuristic labeling approach to effectively train class-imbalanced datasets. Our experiments on precipitation correction for regional NWP show that the proposed method outperforms other approaches.
