LEP Data@EDM4hep: mitigating data loss risks by increasing data FAIRness, with a view on FCC-ee
Jacopo Fanini, Gerardo Ganis, Marcello Maggi
Abstract
The LEP data represents the most precise and highest centre-of-mass energy sample of $e^+e^-$ collision data collected to date. Numerous scientific articles have been published since the conclusion of the experiments, underscoring the ongoing relevance of this dataset and the need to secure its long-term availability according to FAIR data preservation principles. These data could also play a crucial new role in the context of the evaluation of the physics potential of FCC-ee, due to the overlapping centre-of-mass energies, offering a valuable benchmark for detector performance and physics analyses. To fulfill this role, the data should be made available in EDM4hep, the standardized event data format currently developed in the context of the common HEP software ecosystem Key4hep. Migrating to EDM4hep would not only beneficial to future studies but also significantly mitigate the risk of data loss, increase accessibility and interoperability, hence facilitate long-term data preservation. A proof of concept workflow to perform the migration has been developed and successfully applied to ALEPH data.
