Tools and Advancements towards Data Standardization of the MAGIC Collaboration
C. P. Walther, C. Nigro, D. Elsässer, W. Rhode
TL;DR
This work addresses the challenge of long-term, interoperable data stewardship in very-high-energy gamma-ray astronomy by implementing a GADF-aligned DL3 data format for MAGIC data and introducing two open-source tools: magic_dl3, which converts proprietary MAGIC data into standardized DL3 with a Gamma-ray Instrument Response Function, and autoMAGIC, a database-driven automation framework that produces reproducible, scalable analysis configurations. The authors validate these tools against the established MAGIC MARS pipeline and Gammapy using Crab Nebula and Mrk421 datasets, demonstrating good agreement in counts, IRFs, spectra, and light curves, even under varying observing conditions such as moonlight. The results illustrate that standardized DL3 data can be analyzed with existing tools while preserving traceability and reproducibility, supporting legacy data use and future open observatory workflows. Overall, the work provides a practical path toward large-scale standardized data production and long-term data preservation in VHE gamma-ray astronomy, with broad implications for interoperability and FAIR data principles. The introduced automation reduces human error and accelerates data production, enabling more efficient data reuse and multi-messenger collaborations.
Abstract
Gamma-ray astronomy is able to acquire large data volumes that astronomers use to draw scientific conclusions from. Ensuring the possibility of accessing and utilizing this data also after the lifetime of currently running experiments requires the use of a standardized data format. Following the data standardization format proposed by the gamma-ray astronomy community, we present 104 h of the first production of 166 h of data from the MAGIC Imaging Air Cherenkov Telescopes in standardized data format. Six datasets were processed from which three are presented, all of which have been analyzed and validated through comparison using the open-source software Gammapy and the MAGIC analysis software MARS. Furthermore, looking towards a large-scale production of standardized data and a legacy of the data taken by the MAGIC experiment, we have developed and implemented the automated database-driven MAGIC data reduction tool autoMAGIC which offers a reliable and reproducible way to produce high-level datasets. By utilizing the automatization of parameter configuration choices, the software allows for a reduction of human error as well as an acceleration in the production of standardized data. Here, we also show comparable results for data processed with manual and automatic methods.
