Enabling High Data Throughput Reinforcement Learning on GPUs: A Domain Agnostic Framework for Data-Driven Scientific Research
Tian Lan, Huan Wang, Caiming Xiong, Silvio Savarese
TL;DR
WarpSci presents a GPU-centered, domain-agnostic framework that eliminates CPU–GPU data transfers and enables thousands of concurrent RL simulations, addressing data-throughput bottlenecks in data-driven science. Built atop a unified in-place GPU data store and CUDA backend, WarpSci runs the entire RL workflow on GPUs and provides Python interfaces to streamline environment construction and interaction. Across classic control, multi-agent economics, and catalytic reaction problems, WarpSci achieves 10–100× throughput improvements and near-linear scaling, with faster and more stable convergence as data throughput increases. The framework demonstrates substantial practical impact for speeding up scientific RL studies, enabling domain-spanning experiments such as hydrogenation pathway exploration in catalysis and Haber–Bosch process optimization, using high-throughput, environment-agnostic simulations.
Abstract
We introduce WarpSci, a domain agnostic framework designed to overcome crucial system bottlenecks encountered in the application of reinforcement learning to intricate environments with vast datasets featuring high-dimensional observation or action spaces. Notably, our framework eliminates the need for data transfer between the CPU and GPU, enabling the concurrent execution of thousands of simulations on a single or multiple GPUs. This high data throughput architecture proves particularly advantageous for data-driven scientific research, where intricate environment models are commonly essential.
