Oya: Deep Learning for Accurate Global Precipitation Estimation
Emmanuel Asiedu Brempong, Mohammed Alewi Hassen, MohamedElfatih MohamedKhair, Vusumuzi Dube, Santiago Hincapie Potes, Olivia Graham, Amanie Brik, Amy McGovern, George J. Huffman, Jason Hickey
TL;DR
The paper tackles the global need for accurate, timely precipitation estimates in data-sparse regions by introducing Oya, a two-stage UNet framework that leverages the full VIS–IR spectrum from geostationary satellites. By training against high-quality CORRA v07 data and pretraining on IMERG Final, the approach effectively mitigates extreme class imbalance and achieves quasi-global coverage across multiple GEO satellites. Results show that Oya outperforms GEO-only baselines and approaches the accuracy of real-time IMERG Early, while remaining competitive with the research-grade IMERG Final, particularly for light to moderate precipitation. Ablations confirm the value of using all GEO channels, data augmentation, patch context, and LDS losses, and the authors provide a publicly available quasi-global precipitation dataset generated from the models.
Abstract
Accurate precipitation estimation is critical for hydrological applications, especially in the Global South where ground-based observation networks are sparse and forecasting skill is limited. Existing satellite-based precipitation products often rely on the longwave infrared channel alone or are calibrated with data that can introduce significant errors, particularly at sub-daily timescales. This study introduces Oya, a novel real-time precipitation retrieval algorithm utilizing the full spectrum of visible and infrared (VIS-IR) observations from geostationary (GEO) satellites. Oya employs a two-stage deep learning approach, combining two U-Net models: one for precipitation detection and another for quantitative precipitation estimation (QPE), to address the inherent data imbalance between rain and no-rain events. The models are trained using high-resolution GPM Combined Radar-Radiometer Algorithm (CORRA) v07 data as ground truth and pre-trained on IMERG-Final retrievals to enhance robustness and mitigate overfitting due to the limited temporal sampling of CORRA. By leveraging multiple GEO satellites, Oya achieves quasi-global coverage and demonstrates superior performance compared to existing competitive regional and global precipitation baselines, offering a promising pathway to improved precipitation monitoring and forecasting.
