RiverScope: High-Resolution River Masking Dataset
Rangel Daroya, Taylor Rowley, Jonathan Flores, Elisa Friedmann, Fiona Bennitt, Heejin An, Travis Simmons, Marissa Jean Hughes, Camryn L Kluetmeier, Solomon Kica, J. Daniel Vélez, Sarah E. Esenther, Thomas E. Howard, Yanqi Ye, Audrey Turcotte, Colin Gleason, Subhransu Maji
TL;DR
RiverScope addresses the need for fine-scale river monitoring by delivering a global 3 m/pixel PlanetScope dataset with expert water masks (1,145 images over 2,577 km^2) co-registered to SWOT, SWORD, and Sentinel-2 for cross-sensor benchmarking. The work benchmarks 27 segmentation and width-estimation models across architectures and pretraining regimes, and introduces a global river width benchmark achieving a median error of 7.2 meters, far outperforming Landsat, Sentinel, and SWOT-derived widths. It demonstrates that 4-channel multispectral inputs with learned linear adaptors and high-resolution training yield state-of-the-art segmentation and width estimates, while also analyzing cost-accuracy trade-offs among sensors. This resource enables fine-scale hydrological modeling, supports climate adaptation, and invites the ML community to advance multi-sensor river monitoring.
Abstract
Surface water dynamics play a critical role in Earth's climate system, influencing ecosystems, agriculture, disaster resilience, and sustainable development. Yet monitoring rivers and surface water at fine spatial and temporal scales remains challenging -- especially for narrow or sediment-rich rivers that are poorly captured by low-resolution satellite data. To address this, we introduce RiverScope, a high-resolution dataset developed through collaboration between computer science and hydrology experts. RiverScope comprises 1,145 high-resolution images (covering 2,577 square kilometers) with expert-labeled river and surface water masks, requiring over 100 hours of manual annotation. Each image is co-registered with Sentinel-2, SWOT, and the SWOT River Database (SWORD), enabling the evaluation of cost-accuracy trade-offs across sensors -- a key consideration for operational water monitoring. We also establish the first global, high-resolution benchmark for river width estimation, achieving a median error of 7.2 meters -- significantly outperforming existing satellite-derived methods. We extensively evaluate deep networks across multiple architectures (e.g., CNNs and transformers), pretraining strategies (e.g., supervised and self-supervised), and training datasets (e.g., ImageNet and satellite imagery). Our best-performing models combine the benefits of transfer learning with the use of all the multispectral PlanetScope channels via learned adaptors. RiverScope provides a valuable resource for fine-scale and multi-sensor hydrological modeling, supporting climate adaptation and sustainable water management.
