PETRA: From the LISA global fit to a catalog of Galactic binaries
Aaron D. Johnson, Javier Roulet, Katerina Chatziioannou, Michele Vallisneri, Chris G. Trejo, Kyle A. Gersbach
TL;DR
Petra provides a principled postprocessing method to turn a trans-dimensional, label-switching global fit into a catalog of Galactic binaries by relabeling samples to maximize a product-of-marginals representation $p'_{cat}$ while tracking each source's probability of astrophysical origin $P^*$. It formalizes the problem with an invertible labeling $\ell$ and optimizes the relabeling and auxiliary Gaussian distributions $q(\theta_{\alpha}|\phi_{\alpha})=\mathcal{N}(\theta_{\alpha}|\mu_{\alpha},\Sigma_{\alpha})$ via KL-divergence, connecting to the information loss $I_{loss}=-H(p'_{rel})+\sum_\alpha H(p'_{\alpha})$. Demonstrations on toy models and a mock LISA dataset show Petra can robustly resolve overlapping and multimodal sources, producing catalog posteriors $p_{\alpha}(\theta)$ with astrophysical-origin probabilities $P^*_{\alpha}$ that separate real signals from noise or confusion. Implemented in the open-source package petra_catalogs, Petra operates in postprocessing and is applicable to outputs from any global-fit sampler, offering a practical path forward for constructing interpretable catalogs from complex gravitational-wave data analyses.
Abstract
The Laser Interferometer Space Antenna (LISA) will detect mHz gravitational waves from many astrophysical sources, including millions of compact binaries in the Galaxy, thousands of which may be individually resolvable. The large number of signals overlapping in the LISA dataset requires a \emph{global fit} in which an unknown number of sources are modeled simultaneously. This introduces a \emph{label-switching ambiguity} for sources in the same class, making it challenging to distill a traditional astronomical catalog from global-fit posteriors. We present a method to construct a catalog by post-processing the global-fit posterior, relabeling samples to minimize the statistical divergence between the global fit and a factorized catalog representation. The resulting catalog consists of the source posterior distributions and their probabilities of having an astrophysical origin. We demonstrate our algorithm on two toy models and on a small simulated LISA dataset of Galactic binaries. Our method is implemented in the open-source Python package \textsc{petra\_catalogs}, and it can be applied in postprocessing to the output of any global-fit sampler.
