Static and auto-regressive neural emulation of phytoplankton biomass dynamics from physical predictors in the global ocean
Mahima Lakra, Ronan Fablet, Lucas Drumetz, Etienne Pauthenet, Elodie Martinez
TL;DR
This study addresses the challenge of modeling global phytoplankton biomass dynamics by leveraging physical ocean predictors and deep learning. It systematically compares static and auto-regressive neural emulators (CNN, ConvLSTM, AFNO/4CastNet, and UNet) using OC-CCI Chlorophyll-a satellite data, with eight supporting physical drivers, and evaluates performance via EOF-based diagnostics and standard metrics. The UNet architecture emerges as the strongest static emulator, while auto-regressive UNet variants extend forecast skill up to about six months, highlighting a Lyapunov-time–scale limit in Chl dynamics; however, low-frequency amplitude remains challenging to capture. Overall, the work demonstrates that physics-informed neural emulators can reconstruct and forecast phytoplankton dynamics, offering a data-driven tool to monitor ocean health and support ecosystem management in a changing climate, with direct applicability to long-term satellite data fusion and short-term forecasting.
Abstract
Phytoplankton is the basis of marine food webs, driving both ecological processes and global biogeochemical cycles. Despite their ecological and climatic significance, accurately simulating phytoplankton dynamics remains a major challenge for biogeochemical numerical models due to limited parameterizations, sparse observational data, and the complexity of oceanic processes. Here, we explore how deep learning models can be used to address these limitations predicting the spatio-temporal distribution of phytoplankton biomass in the global ocean based on satellite observations and environmental conditions. First, we investigate several deep learning architectures. Among the tested models, the UNet architecture stands out for its ability to reproduce the seasonal and interannual patterns of phytoplankton biomass more accurately than other models like CNNs, ConvLSTM, and 4CastNet. When using one to two months of environmental data as input, UNet performs better, although it tends to underestimate the amplitude of low-frequency changes in phytoplankton biomass. Thus, to improve predictions over time, an auto-regressive version of UNet was also tested, where the model uses its own previous predictions to forecast future conditions. This approach works well for short-term forecasts (up to five months), though its performance decreases for longer time scales. Overall, our study shows that combining ocean physical predictors with deep learning allows for reconstruction and short-term prediction of phytoplankton dynamics. These models could become powerful tools for monitoring ocean health and supporting marine ecosystem management, especially in the context of climate change.
