DiffObs: Generative Diffusion for Global Forecasting of Satellite Observations
Jason Stock, Jaideep Pathak, Yair Cohen, Mike Pritchard, Piyush Garg, Dale Durran, Morteza Mardani, Noah Brenowitz
TL;DR
DiffObs addresses the challenge of globally forecasting daily precipitation by training an autoregressive diffusion model on satellite-derived data to produce probabilistic day-ahead forecasts and stable multi-month rollouts. The method estimates $p(\mathbf{x}_{t} | \mathbf{x}_{t-1})$ without priors, using a conditioned diffusion process built on an EDM-inspired architecture. Results show DiffObs reproduces key tropical variability features, including the Madden–Julian Oscillation and convectively coupled Kelvin waves, as evidenced by Hovmöller analyses and Wheeler–Kiladis spectra, though artifacts such as dateline discontinuities and excess variance remain. The work demonstrates the potential of diffusion models trained on increasingly sparse and differentiated observations for subseasonal and climate prediction, while outlining future avenues to improve temporal distributions and cross-validated comparisons with reanalysis data.
Abstract
This work presents an autoregressive generative diffusion model (DiffObs) to predict the global evolution of daily precipitation, trained on a satellite observational product, and assessed with domain-specific diagnostics. The model is trained to probabilistically forecast day-ahead precipitation. Nonetheless, it is stable for multi-month rollouts, which reveal a qualitatively realistic superposition of convectively coupled wave modes in the tropics. Cross-spectral analysis confirms successful generation of low frequency variations associated with the Madden--Julian oscillation, which regulates most subseasonal to seasonal predictability in the observed atmosphere, and convectively coupled moist Kelvin waves with approximately correct dispersion relationships. Despite secondary issues and biases, the results affirm the potential for a next generation of global diffusion models trained on increasingly sparse, and increasingly direct and differentiated observations of the world, for practical applications in subseasonal and climate prediction.
