Precipitation nowcasting of satellite data using physically-aligned neural networks
Antônio Catão, Melvin Poveda, Leonardo Voltarelli, Paulo Orenstein
TL;DR
This work tackles the challenge of sparse radar coverage for short-term precipitation nowcasting by introducing TUPANN, a satellite-only neural network with physically aligned structure. TUPANN decomposes forecasting into motion inference via a variational encoder–decoder supervised by optical flow, latent evolution through a lead-time conditioned MaxViT transformer, and a differentiable warp for advection-based reconstruction. Across GOES-16 RRQPE and IMERG datasets in four climates, it achieves state-of-the-art CSI and HSS, especially at higher rainfall thresholds, and demonstrates robust cross-city and multi-city transferability with near real-time latency. The model provides interpretable motion fields and operational practicality, enabling radar-sparse regions to access skillful nowcasts, while future work will address uncertainty quantification and cross-platform generalization.
Abstract
Accurate short-term precipitation forecasts predominantly rely on dense weather-radar networks, limiting operational value in places most exposed to climate extremes. We present TUPANN (Transferable and Universal Physics-Aligned Nowcasting Network), a satellite-only model trained on GOES-16 RRQPE. Unlike most deep learning models for nowcasting, TUPANN decomposes the forecast into physically meaningful components: a variational encoder-decoder infers motion and intensity fields from recent imagery under optical-flow supervision, a lead-time-conditioned MaxViT evolves the latent state, and a differentiable advection operator reconstructs future frames. We evaluate TUPANN on both GOES-16 and IMERG data, in up to four distinct climates (Rio de Janeiro, Manaus, Miami, La Paz) at 10-180min lead times using the CSI and HSS metrics over 4-64 mm/h thresholds. Comparisons against optical-flow, deep learning and hybrid baselines show that TUPANN achieves the best or second-best skill in most settings, with pronounced gains at higher thresholds. Training on multiple cities further improves performance, while cross-city experiments show modest degradation and occasional gains for rare heavy-rain regimes. The model produces smooth, interpretable motion fields aligned with numerical optical flow and runs in near real time due to the low latency of GOES-16. These results indicate that physically aligned learning can provide nowcasts that are skillful, transferable and global.
