Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows
Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha
TL;DR
The paper addresses the challenge of designing and evaluating complex scientific workflows across heterogeneous HPC platforms by introducing workflow mini-apps. It presents a methodology to derive emulated tasks via a tunable, portable wfMiniAPI, and to assemble these into full workflow mini-apps using middleware such as RADICAL-Cybertools. Using two real-world workflows—the Inverse Problem and DeepDriveMD—the authors demonstrate fidelity in makespan, I/O, and resource utilization, portability across Polaris, Summit, and Frontier, and improved performance reproducibility at reduced cost. The work advances workflow engineering by providing a publishable API and a configurable framework to study scalability and execution-models without deploying full-scale, resource-heavy workflows.
Abstract
Workflows are critical for scientific discovery. However, the sophistication, heterogeneity, and scale of workflows make building, testing, and optimizing them increasingly challenging. Furthermore, their complexity and heterogeneity make performance reproducibility hard. In this paper, we propose workflow mini-apps as a tool to address the challenges in building and testing workflows while controlling the fidelity of representing realworld workflows. Workflow mini-apps are deployed and run on various HPC systems and architectures without workflow-specific constraints. We offer insight into their design and implementation, providing an analysis of their performance and reproducibility. Workflow mini-apps thus advance the science of workflows by providing simple, portable, and managed (fidelity) representations of otherwise complex and difficult-to-control real workflows.
