Accelerating Time-to-Science by Streaming Detector Data Directly into Perlmutter Compute Nodes
Samuel S. Welborn, Bjoern Enders, Chris Harris, Peter Ercius, Deborah J. Bard
TL;DR
The paper tackles the I/O bottlenecks of high-rate detector data by introducing a RAM-to-RAM streaming workflow that transfers data directly from NCEM to NERSC using a ZeroMQ-based pipeline and a Clone-pattern distributed state system. It integrates with Distiller via a streaming session manager, enabling web-initiated streaming jobs through the NERSC Superfacility API. The approach yields up to a 14× throughput increase for small datasets and a substantial reduction in timing variability for large datasets, demonstrating faster and more reliable time-to-analysis while bypassing traditional NFS/scratch I/O. This streaming paradigm holds promise for broader adoption with further automation and decoupled service architectures, reducing dependence on shared file systems for time-sensitive experiments.
Abstract
Recent advancements in detector technology have significantly increased the size and complexity of experimental data, and high-performance computing (HPC) provides a path towards more efficient and timely data processing. However, movement of large data sets from acquisition systems to HPC centers introduces bottlenecks owing to storage I/O at both ends. This manuscript introduces a streaming workflow designed for an high data rate electron detector that streams data directly to compute node memory at the National Energy Research Scientific Computing Center (NERSC), thereby avoiding storage I/O. The new workflow deploys ZeroMQ-based services for data production, aggregation, and distribution for on-the-fly processing, all coordinated through a distributed key-value store. The system is integrated with the detector's science gateway and utilizes the NERSC Superfacility API to initiate streaming jobs through a web-based frontend. Our approach achieves up to a 14-fold increase in data throughput and enhances predictability and reliability compared to a I/O-heavy file-based transfer workflow. Our work highlights the transformative potential of streaming workflows to expedite data analysis for time-sensitive experiments.
