Table of Contents
Fetching ...

EXaCTz: Guaranteed Extremum Graph and Contour Tree Preservation for Distributed- and GPU-Parallel Lossy Compression

Yuxiao Li, Mingze Xia, Xin Liang, Bei Wang, Hanqi Guo

Abstract

This paper introduces EXaCTz, a parallel algorithm that concurrently preserves extremum graphs and contour trees in lossy-compressed scalar field data. While error-bounded lossy compression is essential for large-scale scientific simulations and workflows, existing topology-preserving methods suffer from (1) a significant throughput disparity, where topology correction speeds are on the order of MB/s, lagging orders of magnitude behind compression speeds on the order of GB/s, (2) limited support for diverse topological descriptors, and (3) a lack of theoretical convergence bounds. To address these challenges, EXaCTz introduces a high-performance, bounded-iteration algorithm that enforces topological consistency by deriving targeted edits for decompressed data. Unlike prior methods that rely on explicit topology reconstruction, EXaCTz enforces consistent min/max neighbors of all vertices, along with global ordering among critical points. As such, the algorithm enforces consistent critical-point classification, saddle extremum connectivity, and the preservation of merge/split events. We theoretically prove the convergence of our algorithm, bounded by the longest path in a vulnerability graph that characterizes potential cascading effects during correction. Experiments on real-world datasets show that EXaCTz achieves a single-GPU throughput of up to 4.52 GB/s, outperforming the state-of-the-art contour-tree-preserving method (Gorski et al.) by up to 213x (with a single-core CPU implementation for fair comparison) and 3,285x (with a single-GPU version). In distributed environments, EXaCTz scales to 128 GPUs with 55.6\% efficiency (compared with 6.4\% for a naive parallelization), processing datasets of up to 512 GB in under 48 seconds and achieving an aggregate correction throughput of up to 32.69 GB/s.

EXaCTz: Guaranteed Extremum Graph and Contour Tree Preservation for Distributed- and GPU-Parallel Lossy Compression

Abstract

This paper introduces EXaCTz, a parallel algorithm that concurrently preserves extremum graphs and contour trees in lossy-compressed scalar field data. While error-bounded lossy compression is essential for large-scale scientific simulations and workflows, existing topology-preserving methods suffer from (1) a significant throughput disparity, where topology correction speeds are on the order of MB/s, lagging orders of magnitude behind compression speeds on the order of GB/s, (2) limited support for diverse topological descriptors, and (3) a lack of theoretical convergence bounds. To address these challenges, EXaCTz introduces a high-performance, bounded-iteration algorithm that enforces topological consistency by deriving targeted edits for decompressed data. Unlike prior methods that rely on explicit topology reconstruction, EXaCTz enforces consistent min/max neighbors of all vertices, along with global ordering among critical points. As such, the algorithm enforces consistent critical-point classification, saddle extremum connectivity, and the preservation of merge/split events. We theoretically prove the convergence of our algorithm, bounded by the longest path in a vulnerability graph that characterizes potential cascading effects during correction. Experiments on real-world datasets show that EXaCTz achieves a single-GPU throughput of up to 4.52 GB/s, outperforming the state-of-the-art contour-tree-preserving method (Gorski et al.) by up to 213x (with a single-core CPU implementation for fair comparison) and 3,285x (with a single-GPU version). In distributed environments, EXaCTz scales to 128 GPUs with 55.6\% efficiency (compared with 6.4\% for a naive parallelization), processing datasets of up to 512 GB in under 48 seconds and achieving an aggregate correction throughput of up to 32.69 GB/s.

Paper Structure

This paper contains 28 sections, 1 theorem, 15 figures, 6 tables, 1 algorithm.

Key Result

Theorem 4.1

Given a maximum of $N$ allowed edits per vertex, the EXaCTz iterative correction algorithm converges within at most $N \cdot \mathcal{D}_{max}(G_R)$ iterations. $\blacktriangleleft$$\blacktriangleleft$

Figures (15)

  • Figure 1: Distortions introduced by error-bounded lossy compression on the cosmology dataset. (a) Original data. (b) SZ3-decompressed data.
  • Figure 2: Illustration of integral paths and extremum graphs. (a) Extremum graph for minima captures the connectivity between minima (blue nodes) and saddles (white nodes). (b) Extremum graph for maxima captures the connectivity between maxima (red nodes) and saddles.
  • Figure 3: Illustration on merge/contour trees. (a) Scalar field $f$ with a level set $L(c)$. (b) Contour tree. (c) Merge tree of $f$. (d) Merge tree of $-f$.
  • Figure 4: Saddle classifications based on local connectivity: (a) standard, (b) join, and (c) split. $+$/$-$ indicate neighbors with higher/lower scalar values than the center point.
  • Figure 5: Merge tree construction from an extremum graph: (a) a scalar field $f$ with critical points (blue minima, white saddles); (b) initialization of the merge tree; (c) branch extraction via Extremum Graph Pairing (EGP).
  • ...and 10 more figures

Theorems & Definitions (1)

  • Theorem 4.1