SolarZip: An Efficient and Adaptive Compression Framework for Solar EUV Imaging Data
Zedong Liu, Song Tan, Alexander Warmuth, Frédéric Schuller, Yun Hong, Wenjing Huang, Yida Gu, Bojing Zhu, Guangming Tan, Dingwen Tao
TL;DR
SolarZip addresses the data-deluge challenge in solar EUV imaging by introducing an adaptive, error-bounded lossy compression framework tailored to Solar Orbiter EUI data. It combines a hybrid strategy with enhanced spline interpolation predictors and a two-stage evaluation that integrates distortion metrics with domain-specific post hoc analyses to ensure scientific usability. The framework achieves unprecedented compression ratios (up to $800\times$ for FSI and $500\times$ for HRI_EUV) and can reduce data transmission time by up to $270\times$, while preserving key solar structures and features. This approach provides a practical data management solution for deep-space missions and demonstrates the value of adaptive, observation-driven compression for complex, dynamic astronomical data.
Abstract
Context: With the advancement of solar physics research, next-generation solar space missions and ground-based telescopes face significant challenges in efficiently transmitting and/or storing large-scale observational data. Aims: We develop an efficient compression and evaluation framework for solar EUV data, specifically optimized for Solar Orbiter Extreme Ultraviolet Imager (EUI) data, significantly reducing data volume while preserving scientific usability. Methods: We systematically evaluated four error-bounded lossy compressors across two EUI datasets. However, the existing methods cannot perfectly handle the EUI datasets (with continuously changing distance and significant resolution differences). Motivated by this, we develop an adaptive hybrid compression strategy with optimized interpolation predictors. Moreover, we designed a two-stage evaluation framework integrating distortion analysis with downstream scientific workflows, ensuring that observational analysis is not affected at high compression ratios. Results: Our framework SolarZip achieved up to 800x reduction for Full Sun Imager (FSI) data and 500x for High Resolution Imager (HRI$_{\text{EUV}}$) data. It significantly outperformed both traditional and advanced algorithms, achieving 3-50x higher compression ratios than traditional algorithms, surpassing the second-best algorithm by up to 30%. Simulation experiments verified that SolarZip can reduce data transmission time by up to 270x while ensuring the preservation of scientific usability. Conclusions: The SolarZip framework significantly enhances solar observational data compression efficiency while preserving scientific usability by dynamically selecting optimal compression methods based on observational scenarios and user requirements. This provides a promising data management solution for deep space missions like Solar Orbiter.
