A Virtual Laboratory for Managing Computational Experiments
Eleni Adamidi, Panayiotis Deligiannis, Nikos Foutris, Thanasis Vergoulis
TL;DR
SCHEMA lab tackles reproducibility challenges in computational science by capturing rich experiment metadata and integrating containerized task execution with high-level experiment management. The approach combines a front-end and a back-end API that orchestrate tasks and workflows on Kubernetes via TESK, with components for quotas, experiments, and file handling to preserve provenance. Key contributions include a modular architecture, open-source availability, and a roadmap to RO-Crate export and broader workflow-language support. The work has practical impact by enabling scalable, transparent, and shareable computational experiments across diverse infrastructures.
Abstract
Computational experiments have become essential for scientific discovery, allowing researchers to test hypotheses, analyze complex datasets, and validate findings. However, as computational experiments grow in scale and complexity, ensuring reproducibility and managing detailed metadata becomes increasingly challenging, especially when orchestrating complex sequence of computational tasks. To address these challenges we have developed a virtual laboratory called SCHEMA lab, focusing on capturing rich metadata such as experiment configurations and performance metrics, to support computational reproducibility. SCHEMA lab enables researchers to create experiments by grouping together multiple executions and manage them throughout their life cycle. In this demonstration paper, we present the SCHEMA lab architecture, core functionalities, and implementation, emphasizing its potential to significantly enhance reproducibility and efficiency in computational research.
