PDE foundation models are skillful AI weather emulators for the Martian atmosphere
Johannes Schmude, Sujit Roy, Liping Wang, Theodore van Kessel, Levente Klein, Marcus Freitag, Eloisa Bentivegna, Robert Manson-Sawko, Bjorn Lutjens, Manil Maskey, Campbell Watson, Rahul Ramachandran, Juan Bernabe-Moreno
TL;DR
This work demonstrates that PDE foundation models pretrained on a diverse set of PDE solutions can be adapted to weather emulation for the Martian atmosphere, addressing data scarcity typical of planetary contexts. By extending a 2D PDE-FM (Poseidon) to 3D via axial vertical attention and preserving pretraining, the authors achieve a 34.4% improvement on a held-out Martian year, even with sparse initial conditions and modest compute (~13 GPU hours). The OpenMARS dataset provides Martian atmospheric fields used for training and evaluation, and the approach shows robust performance under data sparsity and across vertical levels, highlighting data-efficient pathways for real-world, data-limited atmospheric modeling. The results point to broader applicability of PDE-FMs to Earth regional weather in data-poor regimes and suggest architectural directions (e.g., multi-scale graph models) to further enhance performance when forcings and curved geometries are considered.
Abstract
We show that AI foundation models that are pretrained on numerical solutions to a diverse corpus of partial differential equations can be adapted and fine-tuned to obtain skillful predictive weather emulators for the Martian atmosphere. We base our work on the Poseidon PDE foundation model for two-dimensional systems. We develop a method to extend Poseidon from two to three dimensions while keeping the pretraining information. Moreover, we investigate the performance of the model in the presence of sparse initial conditions. Our results make use of four Martian years (approx.~34 GB) of training data and a median compute budget of 13 GPU hours. We find that the combination of pretraining and model extension yields a performance increase of 34.4\% on a held-out year. This shows that PDEs-FMs can not only approximate solutions to (other) PDEs but also anchor models for real-world problems with complex interactions that lack a sufficient amount of training data or a suitable compute budget.
