WorkflowHub: a registry for computational workflows
Ove Johan Ragnar Gustafsson, Sean R. Wilkinson, Finn Bacall, Luca Pireddu, Stian Soiland-Reyes, Simone Leo, Stuart Owen, Nick Juty, José M. Fernández, Björn Grüning, Tom Brown, Hervé Ménager, Salvador Capella-Gutierrez, Frederik Coppens, Carole Goble
TL;DR
WorkflowHub presents a global, domain-agnostic registry that unifies diverse computational workflow resources to improve findability, accessibility, interoperability, and reusability. It establishes a data model that reflects real-world collaborations (Organisations, Teams, Spaces) and integrates with standards (Bioschemas, EDAM, RO-Crate) and platforms (GA4GH TRS, LifeMonitor) to enable end-to-end lifecycle support from development to citation. Key contributions include a wizard-driven workflow registration flow, robust attribution and citation via DOIs and RO-Crates, and extensive integrations that connect workflows to external repositories, execution platforms, and scholarly infrastructures. This registry aims to accelerate reproducible science by providing credit, improving discoverability, and enabling cross-domain workflow reuse, with active community engagement and ongoing onboarding of new domains and communities.
Abstract
The rising popularity of computational workflows is driven by the need for repetitive and scalable data processing, sharing of processing know-how, and transparent methods. As both combined records of analysis and descriptions of processing steps, workflows should be reproducible, reusable, adaptable, and available. Workflow sharing presents opportunities to reduce unnecessary reinvention, promote reuse, increase access to best practice analyses for non-experts, and increase productivity. In reality, workflows are scattered and difficult to find, in part due to the diversity of available workflow engines and ecosystems, and because workflow sharing is not yet part of research practice. WorkflowHub provides a unified registry for all computational workflows that links to community repositories, and supports both the workflow lifecycle and making workflows findable, accessible, interoperable, and reusable (FAIR). By interoperating with diverse platforms, services, and external registries, WorkflowHub adds value by supporting workflow sharing, explicitly assigning credit, enhancing FAIRness, and promoting workflows as scholarly artefacts. The registry has a global reach, with hundreds of research organisations involved, and more than 700 workflows registered.
