Lightweight posterior construction for gravitational-wave catalogs with the Kolmogorov-Arnold network
Wenshuai Liu, Yiming Dong, Ziming Wang, Lijing Shao
TL;DR
This paper tackles the data-storage and transmission burden of gravitational-wave posterior catalogs by introducing a Kolmogorov-Arnold network (KAN)–based neural density estimator. By replacing fixed activations with learnable edge-wise splines, KAN yields interpretable, efficient density models whose posteriors can be reconstructed from compact neural network weights or closed-form analytic expressions, drastically reducing storage needs. The authors demonstrate two data products—high-fidelity neural network weights and analytic PDF expressions—that enable rapid posterior resampling with minimal loss of fidelity, validated on verifiable benchmark distributions, real GW posteriors, and hierarchical population inference. This lightweight catalog framework promises scalable, real-time, and transmission-efficient GW analyses for next-generation detectors, while preserving the statistical accuracy required for population studies. Key mathematical quantities include the posterior P(oldsymbol{ heta}|d) and the evidence Z(d), both of which are preserved in the surrogate representations.
Abstract
Neural density estimation has seen widespread applications in the gravitational-wave (GW) data analysis, which enables real-time parameter estimation for compact binary coalescences and enhances rapid inference for subsequent analysis such as population inference. In this work, we explore the application of using the Kolmogorov-Arnold network (KAN) to construct efficient and interpretable neural density estimators for lightweight posterior construction of GW catalogs. By replacing conventional activation functions with learnable splines, KAN achieves superior interpretability, higher accuracy, and greater parameter efficiency on related scientific tasks. Leveraging this feature, we propose a KAN-based neural density estimator, which ingests megabyte-scale GW posterior samples and compresses them into model weights of tens of kilobytes. Subsequently, analytic expressions requiring only several kilobytes can be further distilled from these neural network weights with minimal accuracy trade-off. In practice, GW posterior samples with fidelity can be regenerated rapidly using the model weights or analytic expressions for subsequent analysis. Our lightweight posterior construction strategy is expected to facilitate user-level data storage and transmission, paving a path for efficient analysis of numerous GW events in the next-generation GW detectors.
