3D Cloud reconstruction through geospatially-aware Masked Autoencoders
Stella Girtsou, Emiliano Diaz Salas-Porras, Lilli Freischem, Joppe Massant, Kyriaki-Margarita Bintsi, Guiseppe Castiglione, William Jones, Michael Eisinger, Emmanuel Johnson, Anna Jungbluth
TL;DR
The paper addresses the need for real-time 3D cloud data to reduce uncertainties in climate models by reconstructing 3D cloud structures from geostationary MSG/SEVIRI imagery. It proposes self-supervised pretraining using Masked Autoencoders (MAE) on unlabeled data and a geospatial SatMAE that encodes time and location, followed by fine-tuning on ~47k image–profile pairs aligned with CloudSat CPR radar profiles to produce 3D cloud volumes of size $90×256×256$. SatMAE with time and coordinate encodings yields the best RMSE, PSNR, and SSIM, especially in the tropical convection belt, outperforming a supervised U-Net baseline. This framework enables higher-fidelity, near-real-time 3D cloud products and can be extended to ESA EarthCARE data for long-term climate-relevant cloud datasets.
Abstract
Clouds play a key role in Earth's radiation balance with complex effects that introduce large uncertainties into climate models. Real-time 3D cloud data is essential for improving climate predictions. This study leverages geostationary imagery from MSG/SEVIRI and radar reflectivity measurements of cloud profiles from CloudSat/CPR to reconstruct 3D cloud structures. We first apply self-supervised learning (SSL) methods-Masked Autoencoders (MAE) and geospatially-aware SatMAE on unlabelled MSG images, and then fine-tune our models on matched image-profile pairs. Our approach outperforms state-of-the-art methods like U-Nets, and our geospatial encoding further improves prediction results, demonstrating the potential of SSL for cloud reconstruction.
