Hardware Acceleration for HPS Algorithms in Two and Three Dimensions

Owen Melia; Daniel Fortunato; Jeremy Hoskins; Rebecca Willett

Hardware Acceleration for HPS Algorithms in Two and Three Dimensions

Owen Melia, Daniel Fortunato, Jeremy Hoskins, Rebecca Willett

TL;DR

The work develops a GPU-accelerated framework for the hierarchical Poincaré–Steklov ($HPS$) direct solvers for variable-coefficient elliptic PDEs, addressing both two- and three-dimensional problems. It introduces a 2D leaf-recomputation strategy to minimize data transfers and a 3D adaptive discretization to dramatically reduce peak memory, all within a high-order spectral collocation and nested-dissection structure. An open-source JAX implementation enables seamless automatic differentiation, enabling forward and inverse problems such as high-frequency wave scattering and linearized Poisson–Boltzmann simulations. The results show substantial GPU speedups and memory savings, demonstrating the practicality of high-order, GPU-accelerated $HPS$ solvers for large-scale 2D/3D applications with potential impact on optimization, inverse problems, and scientific computing workflows.

Abstract

We provide a flexible, open-source framework for hardware acceleration, namely massively-parallel execution on general-purpose graphics processing units (GPUs), applied to the hierarchical Poincaré--Steklov (HPS) family of algorithms for building fast direct solvers for linear elliptic partial differential equations. To take full advantage of the power of hardware acceleration, we propose two variants of HPS algorithms to improve performance on two- and three-dimensional problems. In the two-dimensional setting, we introduce a novel recomputation strategy that minimizes costly data transfers to and from the GPU; in three dimensions, we modify and extend the adaptive discretization technique of Geldermans and Gillman [2019] to greatly reduce peak memory usage. We provide an open-source implementation of these methods written in JAX, a high-level accelerated linear algebra package, which allows for the first integration of a high-order fast direct solver with automatic differentiation tools. We conclude with extensive numerical examples showing our methods are fast and accurate on two- and three-dimensional problems.

Hardware Acceleration for HPS Algorithms in Two and Three Dimensions

TL;DR

The work develops a GPU-accelerated framework for the hierarchical Poincaré–Steklov (

) direct solvers for variable-coefficient elliptic PDEs, addressing both two- and three-dimensional problems. It introduces a 2D leaf-recomputation strategy to minimize data transfers and a 3D adaptive discretization to dramatically reduce peak memory, all within a high-order spectral collocation and nested-dissection structure. An open-source JAX implementation enables seamless automatic differentiation, enabling forward and inverse problems such as high-frequency wave scattering and linearized Poisson–Boltzmann simulations. The results show substantial GPU speedups and memory savings, demonstrating the practicality of high-order, GPU-accelerated

solvers for large-scale 2D/3D applications with potential impact on optimization, inverse problems, and scientific computing workflows.

Hardware Acceleration for HPS Algorithms in Two and Three Dimensions

TL;DR

Abstract

Hardware Acceleration for HPS Algorithms in Two and Three Dimensions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (2)