Large Data Acquisition and Analytics at Synchrotron Radiation Facilities
Aashish Panta, Giorgio Scorzelli, Amy A. Gooch, Werner Sun, Katherine S. Shanks, Suchismita Sarker, Devin Bougie, Keara Soloway, Rolf Verberg, Tracy Berman, Glenn Tarcea, John Allison, Michela Taufer, Valerio Pascucci
TL;DR
This work addresses the challenge of managing and making real-time sense of terabytes-to-petabytes of synchrotron data under tight beamtime constraints. It introduces a modular web-based framework anchored by the NSDF EntryPoint, and two dashboards for data acquisition and data evaluation that enable remote, real-time monitoring, quality control, and data-driven decision making. Deployed on CHESS beamlines ID3A, ID3B, and ID4B and tested with 43 research groups, the system processed 50–100 TB and over 10 million files by late 2024, demonstrating scalability, accessibility, and workflow improvements. The approach, built on OpenVisus and NSDF integration, offers a transferable solution for other facilities to enhance scientific productivity and collaboration at scale.
Abstract
Synchrotron facilities like the Cornell High Energy Synchrotron Source (CHESS) generate massive data volumes from complex beamline experiments, but face challenges such as limited access time, the need for on-site experiment monitoring, and managing terabytes of data per user group. We present the design, deployment, and evaluation of a framework that addresses CHESS's data acquisition and management issues. Deployed on a secure CHESS server, our system provides real time, web-based tools for remote experiment monitoring and data quality assessment, improving operational efficiency. Implemented across three beamlines (ID3A, ID3B, ID4B), the framework managed 50-100 TB of data and over 10 million files in late 2024. Testing with 43 research groups and 86 dashboards showed reduced overhead, improved accessibility, and streamlined data workflows. Our paper highlights the development, deployment, and evaluation of our framework and its transformative impact on synchrotron data acquisition.
