GPUMC: A Stateless Model Checker for GPU Weak Memory Concurrency

Soham Chakraborty; S. Krishna; Andreas Pavlogiannis; Omkar Tuppe

GPUMC: A Stateless Model Checker for GPU Weak Memory Concurrency

Soham Chakraborty, S. Krishna, Andreas Pavlogiannis, Omkar Tuppe

TL;DR

The paper tackles correctness verification for GPU programs under weak memory concurrency by introducing GPUMC, a stateless model checker tailored to the scoped-RC11 ($SRC11$) model. It extends dynamic partial order reduction (DPOR) to GPUs, providing exploration, barrier-aware scheduling, and reads-from management while guaranteeing soundness, completeness, and optimality with polynomial space. A key feature is automatic repair of certain heterogeneous races, enabling transformation of pathological programs into race-free versions. Empirical evaluation shows GPUMC scales to larger GPU programs, detects races that other tools miss, and outperforms bounded-model-checking approaches like Dartagnan in both time and memory, offering a practical pathway to reliable GPU software engineering under weak memory models.

Abstract

GPU computing is embracing weak memory concurrency for performance improvement. However, compared to CPUs, modern GPUs provide more fine-grained concurrency features such as scopes, have additional properties like divergence, and thereby follow different weak memory consistency models. These features and properties make concurrent programming on GPUs more complex and error-prone. To this end, we present GPUMC, a stateless model checker to check the correctness of GPU shared-memory concurrent programs under scoped-RC11 weak memory concurrency model. GPUMC explores all possible executions in GPU programs to reveal various errors - races, barrier divergence, and assertion violations. In addition, GPUMC also automatically repairs these errors in the appropriate cases. We evaluate GPUMC with benchmarks and real-life GPU programs. GPUMC is efficient both in time and memory in verifying large GPU programs where state-of-the-art tools are timed out. In addition, GPUMC identifies all known errors in these benchmarks compared to the state-of-the-art tools.

GPUMC: A Stateless Model Checker for GPU Weak Memory Concurrency

TL;DR

The paper tackles correctness verification for GPU programs under weak memory concurrency by introducing GPUMC, a stateless model checker tailored to the scoped-RC11 (

) model. It extends dynamic partial order reduction (DPOR) to GPUs, providing exploration, barrier-aware scheduling, and reads-from management while guaranteeing soundness, completeness, and optimality with polynomial space. A key feature is automatic repair of certain heterogeneous races, enabling transformation of pathological programs into race-free versions. Empirical evaluation shows GPUMC scales to larger GPU programs, detects races that other tools miss, and outperforms bounded-model-checking approaches like Dartagnan in both time and memory, offering a practical pathway to reliable GPU software engineering under weak memory models.

GPUMC: A Stateless Model Checker for GPU Weak Memory Concurrency

TL;DR

Abstract

GPUMC: A Stateless Model Checker for GPU Weak Memory Concurrency

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (17)