MGARD: A multigrid framework for high-performance, error-controlled data compression and refactoring
Qian Gong, Jieyang Chen, Ben Whitney, Xin Liang, Viktor Reshniak, Tania Banerjee, Jaemoon Lee, Anand Rangarajan, Lipeng Wan, Nicolas Vidal, Qing Liu, Ana Gainaru, Norbert Podhorszki, Richard Archibald, Sanjay Ranka, Scott Klasky
TL;DR
MGARD tackles the data deluge from exascale simulations and large telescopes by introducing a multigrid adaptive reduction framework for loss-tolerant data compression and data refactoring. It uses a hierarchical representation $u'_{mc}$, quantization, and bitplane encoding, with error estimators guaranteeing fidelity in the $L^2$ norm and for quantities of interest, across structured and unstructured grids. The software features two API tiers (high-level and low-level), auto-tuning, multi-device out-of-core operation, and GPU-accelerated kernels, and it integrates with ADIOS for seamless I/O. Across plasma physics, cosmology, and radio astronomy, MGARD demonstrates dramatic storage reductions and I/O speedups while preserving scientific fidelity, including QoI-preserving post-processing and progressive recomposition via refactoring.
Abstract
We describe MGARD, a software providing MultiGrid Adaptive Reduction for floating-point scientific data on structured and unstructured grids. With exceptional data compression capability and precise error control, MGARD addresses a wide range of requirements, including storage reduction, high-performance I/O, and in-situ data analysis. It features a unified application programming interface (API) that seamlessly operates across diverse computing architectures. MGARD has been optimized with highly-tuned GPU kernels and efficient memory and device management mechanisms, ensuring scalable and rapid operations.
