AIFL: A Global Daily Streamflow Forecasting Model Using Deterministic LSTM Pre-trained on ERA5-Land and Fine-tuned on IFS
Maria Luisa Taccari, Kenza Tazi, Oisín M. Morrison, Andreas Grafberger, Juan Colonese, Corentin Carton de Wiart, Christel Prudhomme, Cinzia Mazzetti, Matthew Chantry, Florian Pappenberger
TL;DR
AIFL addresses the reanalysis-to-forecast domain shift in global streamflow forecasting by training a deterministic LSTM on ERA5-Land data and then fine-tuning on IFS forecasts, using a CARAVAN-based global basin set. The two-stage transfer learning approach yields robust skill, achieving a median $KGE'$ of $0.66$ and NSE of $0.53$ on 2021–2024 tests across 2,003 basins, while maintaining near-perfect volume balance and zero-false-alarm flood detection (precision = 1.0) for 1.5–50 year events. In head-to-head benchmarking with Google’s global model, AIFL is competitive overall and outperforms at roughly 43% of shared stations, particularly in smaller basins where its single-stage deterministic pipeline remains stable. The work establishes a practical, reproducible baseline for global flood forecasting and highlights future directions toward probabilistic forecasts and multi-source forcing to enhance recall of extremes. $KGE'$ and $NSE$ metrics demonstrate statistically meaningful predictive skill under operational forcing, underscoring the model’s potential for real-world deployment.
Abstract
Reliable global streamflow forecasting is essential for flood preparedness and water resource management, yet data-driven models often suffer from a performance gap when transitioning from historical reanalysis to operational forecast products. This paper introduces AIFL (Artificial Intelligence for Floods), a deterministic LSTM-based model designed for global daily streamflow forecasting. Trained on 18,588 basins curated from the CARAVAN dataset, AIFL utilises a novel two-stage training strategy to bridge the reanalysis-to-forecast domain shift. The model is first pre-trained on 40 years of ERA5-Land reanalysis (1980-2019) to capture robust hydrological processes, then fine-tuned on operational Integrated Forecasting System (IFS) control forecasts (2016-2019) to adapt to the specific error structures and biases of operational numerical weather prediction. To our knowledge, this is the first global model trained end-to-end within the CARAVAN ecosystem. On an independent temporal test set (2021-2024), AIFL achieves high predictive skill with a median modified Kling-Gupta Efficiency (KGE') of 0.66 and a median Nash-Sutcliffe Efficiency (NSE) of 0.53. Benchmarking results show that AIFL is highly competitive with current state-of-the-art global systems, achieving comparable accuracy while maintaining a transparent and reproducible forcing pipeline. The model demonstrates exceptional reliability in extreme-event detection, providing a streamlined and operationally robust baseline for the global hydrological community.
