Data-Driven Probabilistic Air-Sea Flux Parameterization
Jiarong Wu, Pavel Perezhogin, David John Gagne, Brandon Reichl, Aneesh C. Subramanian, Elizabeth Thompson, Laure Zanna
TL;DR
The paper develops a probabilistic framework for air-sea flux parameterization by modeling each flux component as a conditional Gaussian $y \sim \mathcal{N}(\mu(\mathbf{X}),\sigma^{2}(\mathbf{X}))$ and learning $\mu$ and $\sigma$ with separate neural networks trained on eddy-covariance data. The mean flux from the ANN is comparable to established bulk algorithms (e.g., COARE) while the predicted uncertainty $\sigma(\mathbf{X})$ enables stochastic sampling for ensemble simulations. Evaluations show that latent heat flux benefits most from the data-driven mean estimation, with regional scores varying by geography; the framework also reveals nontrivial structure in how $\mu_{Q_S}$ depends on inputs, including nonzero flux when $T_a-T_o=0$. Tests in GOTM demonstrate seasonally varying SST and MLD responses to flux changes, with the largest stochastic spread during spring restratification, highlighting the practical importance of incorporating flux variability into coupled models.
Abstract
Accurately quantifying air-sea fluxes is important for understanding air-sea interactions and improving coupled weather and climate systems. This study introduces a probabilistic framework to represent the highly variable nature of air-sea fluxes, which is missing in deterministic bulk algorithms. Assuming Gaussian distributions conditioned on the input variables, we use artificial neural networks and eddy-covariance measurement data to estimate the mean and variance by minimizing negative log-likelihood loss. The trained neural networks provide alternative mean flux estimates to existing bulk algorithms, and quantify the uncertainty around the mean estimates. Stochastic parameterization of air-sea turbulent fluxes can be constructed by sampling from the predicted distributions. Tests in a single-column forced upper-ocean model suggest that changes in flux algorithms influence sea surface temperature and mixed layer depth seasonally. The ensemble spread in stochastic runs is most pronounced during spring restratification.
