A Prediction-Traversal Approach for Compressing Scientific Data on Unstructured Meshes with Bounded Error
Congrong Ren, Xin Liang, Hanqi Guo
TL;DR
The paper tackles the problem of compressing large-scale unstructured-mesh scientific data with bounded error. It introduces a prediction-traversal approach that reorganizes nodal data into sequences via seed-based traversal on the mesh dual graph and uses barycentric extrapolation to predict values within a user-defined bound, framed by SZ3-style encoding. To quantify fidelity on continuous domains, it proposes CMSE, a cellwise integral extension of MSE, and demonstrates strong compression ratios and quality across ocean/climate and CFD datasets. The work enables efficient storage and visualization for unstructured-mesh simulations and provides a foundation for extending bounded-error compression to time-varying multivariate mesh data and other unstructured data types.
Abstract
We explore an error-bounded lossy compression approach for reducing scientific data associated with 2D/3D unstructured meshes. While existing lossy compressors offer a high compression ratio with bounded error for regular grid data, methodologies tailored for unstructured mesh data are lacking; for example, one can compress nodal data as 1D arrays, neglecting the spatial coherency of the mesh nodes. Inspired by the SZ compressor, which predicts and quantizes values in a multidimensional array, we dynamically reorganize nodal data into sequences. Each sequence starts with a seed cell; based on a predefined traversal order, the next cell is added to the sequence if the current cell can predict and quantize the nodal data in the next cell with the given error bound. As a result, one can efficiently compress the quantized nodal data in each sequence until all mesh nodes are traversed. This paper also introduces a suite of novel error metrics, namely continuous mean squared error (CMSE) and continuous peak signal-to-noise ratio (CPSNR), to assess compression results for unstructured mesh data. The continuous error metrics are defined by integrating the error function on all cells, providing objective statistics across nonuniformly distributed nodes/cells in the mesh. We evaluate our methods with several scientific simulations ranging from ocean-climate models and computational fluid dynamics simulations with both traditional and continuous error metrics. We demonstrated superior compression ratios and quality than existing lossy compressors.
