SuperBench: A Super-Resolution Benchmark Dataset for Scientific Machine Learning
Pu Ren, N. Benjamin Erichson, Junyi Guo, Shashank Subramanian, Omer San, Zarija Lukic, Michael W. Mahoney
TL;DR
SuperBench introduces the first standardized, high-resolution SR benchmark for Scientific ML, combining fluid turbulence (NSKT), cosmology hydrodynamics (Nyx), and ERA5 weather data up to $2048\times2048$ with ~439 GB total. It defines realistic degradation modes (bicubic, uniform down-sampling with noise, LR simulations) and evaluates SR methods with pixel-level, perceptual, and domain-specific physics metrics, highlighting challenges in preserving fundamental laws when using purely data-driven approaches. Baselines including SwinIR, FNO, and physics-constrained variants reveal that incorporating domain knowledge (e.g., continuity constraints, energy spectrum alignment) improves physical fidelity over pure pixel accuracy. The dataset and evaluation framework are openly hosted and designed to extend to temporal/spatiotemporal and 3D SR, aiming to accelerate science-driven SR research while encouraging responsible use and reproducibility.
Abstract
Super-resolution (SR) techniques aim to enhance data resolution, enabling the retrieval of finer details, and improving the overall quality and fidelity of the data representation. There is growing interest in applying SR methods to complex spatiotemporal systems within the Scientific Machine Learning (SciML) community, with the hope of accelerating numerical simulations and/or improving forecasts in weather, climate, and related areas. However, the lack of standardized benchmark datasets for comparing and validating SR methods hinders progress and adoption in SciML. To address this, we introduce SuperBench, the first benchmark dataset featuring high-resolution datasets, including data from fluid flows, cosmology, and weather. Here, we focus on validating spatial SR performance from data-centric and physics-preserved perspectives, as well as assessing robustness to data degradation tasks. While deep learning-based SR methods (developed in the computer vision community) excel on certain tasks, despite relatively limited prior physics information, we identify limitations of these methods in accurately capturing intricate fine-scale features and preserving fundamental physical properties and constraints in scientific data. These shortcomings highlight the importance and subtlety of incorporating domain knowledge into ML models. We anticipate that SuperBench will help to advance SR methods for science.
