Simultaneous emulation and downscaling with physically-consistent deep learning-based regional ocean emulators
Leonard Lupin-Jimenez, Moein Darman, Subhashis Hazarika, Tianning Wu, Michael Gray, Ruyoing He, Anthony Wong, Ashesh Chattopadhyay
TL;DR
The paper addresses the challenge of stable, long-term regional ocean emulation by combining a data-driven autoregressive forecast with a physics-constrained downscaling step for the GoM region. It uses a Fourier Neural Operator (FNO2D) forecast trained on low-resolution GLORYS data at $8$ km, paired with two downscaling architectures (UNET and a Variational Autoencoder with PatchGAN) to produce high-resolution outputs at $4$ km, guided by a loss that blends grid-space and spectral information with $\lambda=0.2$. A key contribution is the online fine-tuning of the downscaling models to correct drift and bias introduced by model error and resolution differences, enabling accurate long-term statistics and physically plausible spectra over decadal time scales. The results demonstrate robust short-term skill, faithful spectral properties, and stable long-term means and variability, all while offering substantial computational speedups relative to physics-based models. The framework represents a practical path toward fast, large-scale regional ocean emulation with quantitative uncertainty-relevant metrics suitable for rapid analyses and scenario exploration.
Abstract
Building on top of the success in AI-based atmospheric emulation, we propose an AI-based ocean emulation and downscaling framework focusing on the high-resolution regional ocean over Gulf of Mexico. Regional ocean emulation presents unique challenges owing to the complex bathymetry and lateral boundary conditions as well as from fundamental biases in deep learning-based frameworks, such as instability and hallucinations. In this paper, we develop a deep learning-based framework to autoregressively integrate ocean-surface variables over the Gulf of Mexico at $8$ Km spatial resolution without unphysical drifts over decadal time scales and simulataneously downscale and bias-correct it to $4$ Km resolution using a physics-constrained generative model. The framework shows both short-term skills as well as accurate long-term statistics in terms of mean and variability.
