A GPU-accelerated Molecular Docking Workflow with Kubernetes and Apache Airflow

Daniel Medeiros; Gabin Schieffer; Jacob Wahlgren; Ivy Peng

A GPU-accelerated Molecular Docking Workflow with Kubernetes and Apache Airflow

Daniel Medeiros, Gabin Schieffer, Jacob Wahlgren, Ivy Peng

TL;DR

This study investigates the transition and deployment of a GPU-accelerated molecular docking workflow that was designed for HPC systems onto a cloud-native environment with Kubernetes and Apache Airflow and provides a DAG-based implementation in Apache Airflow.

Abstract

Complex workflows play a critical role in accelerating scientific discovery. In many scientific domains, efficient workflow management can lead to faster scientific output and broader user groups. Workflows that can leverage resources across the boundary between cloud and HPC are a strong driver for the convergence of HPC and cloud. This study investigates the transition and deployment of a GPU-accelerated molecular docking workflow that was designed for HPC systems onto a cloud-native environment with Kubernetes and Apache Airflow. The case study focuses on state-of-of-the-art molecular docking software for drug discovery. We provide a DAG-based implementation in Apache Airflow and technical details for GPU-accelerated deployment. We evaluated the workflow using the SWEETLEAD bioinformatics dataset and executed it in a Cloud environment with heterogeneous computing resources. Our workflow can effectively overlap different stages when mapped onto different computing resources.

A GPU-accelerated Molecular Docking Workflow with Kubernetes and Apache Airflow

TL;DR

Abstract

A GPU-accelerated Molecular Docking Workflow with Kubernetes and Apache Airflow

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)