Globus Service Enhancements for Exascale Applications and Facilities
Weijian Zheng, Jack Kordas, Tyler J. Skluzacek, Raj Kettimuthu, Ian Foster
TL;DR
The paper tackles the challenge of moving extremely large data files in exascale workflows where traditional Globus/GridFTP optimizations for many small files are insufficient. It proposes client-side chunking to partition large files across multiple data movers (DTNs) and to overlap transfer with integrity checking via ERET/ESTO; performance is evaluated on three facilities using Lustre file systems. The experiments show substantial gains: up to 9.5× speedups for single large-file transfers, significant reductions in checksum overhead when chunking is used, and large influence of Lustre striping configurations. The work highlights practical implications for exascale data workflows and points to automation and further optimization of integrity checks and storage-system configurations.
Abstract
Many extreme-scale applications require the movement of large quantities of data to, from, and among leadership computing facilities, as well as other scientific facilities and the home institutions of facility users. These applications, particularly when leadership computing facilities are involved, can touch upon edge cases (e.g., terabyte files) that had not been a focus of previous Globus optimization work, which had emphasized rather the movement of many smaller (megabyte to gigabyte) files. We report here on how automated client-driven chunking can be used to accelerate both the movement of large files and the integrity checking operations that have proven to be essential for large data transfers. We present detailed performance studies that provide insights into the benefits of these modifications in a range of file transfer scenarios.
