The Merit of River Network Topology for Neural Flood Forecasting
Nikolas Kirschstein, Yixuan Sun
TL;DR
This paper examines whether river network topology improves neural flood forecasting. Using an end-to-end GNN on the LamaH-CE dataset, it systematically varies adjacency definitions (including isolated, binary, physical, and learned weights) and evaluates performance across multiple splits. The key finding is that topology provides negligible benefits over isolated gauges, and learned edge weights do not meaningfully correlate with static physical relationships, even under deep or subnetworks. The work suggests focusing on spike forecasting and richer metadata (e.g., inter-gauge propagation times) or DAG-specific models to realize any potential gains from graph structure.
Abstract
Climate change exacerbates riverine floods, which occur with higher frequency and intensity than ever. The much-needed forecasting systems typically rely on accurate river discharge predictions. To this end, the SOTA data-driven approaches treat forecasting at spatially distributed gauge stations as isolated problems, even within the same river network. However, incorporating the known topology of the river network into the prediction model has the potential to leverage the adjacency relationship between gauges. Thus, we model river discharge for a network of gauging stations with GNNs and compare the forecasting performance achieved by different adjacency definitions. Our results show that the model fails to benefit from the river network topology information, both on the entire network and small subgraphs. The learned edge weights correlate with neither of the static definitions and exhibit no regular pattern. Furthermore, the GNNs struggle to predict sudden, narrow discharge spikes. Our work hints at a more general underlying phenomenon of neural prediction not always benefitting from graphical structure and may inspire a systematic study of the conditions under which this happens.
