Enhancing the reliability of machine learning for gravitational wave parameter estimation with attention-based models
Hibiki Iwanaga, Mahoro Matsuyama, Yousuke Itoh
TL;DR
The paper tackles the computational burden of Bayesian gravitational-wave parameter estimation by training two Vision Transformer–based models on spectrograms to estimate the effective spin $\chi_{\text{eff}}$ and chirp mass $\mathcal{M}$ from binary black hole signals. By leveraging attention maps, the authors verify that predictions rely on physically meaningful spectrogram regions and quantify how glitches bias estimates, showing that attention can flag unreliable results. An uncertainty-evaluation pipeline inspired by Monte Carlo ideas demonstrates 90% intervals broadly consistent with reference posterior estimates, with total inference time reduced to around six minutes. The approach offers a pathway to rapid, reliable GW parameter estimation and introduces a practical diagnostic tool for glitch robustness that could inform future training and automatic reliability checks in real data analyses.
Abstract
We introduce a technique to enhance the reliability of gravitational wave parameter estimation results produced by machine learning. We develop two independent machine learning models based on the Vision Transformer to estimate effective spin and chirp mass from spectrograms of gravitational wave signals from binary black hole mergers. To enhance the reliability of these models, we utilize attention maps to visualize the areas our models focus on when making predictions. This approach enables demonstrating that both models perform parameter estimation based on physically meaningful information. Furthermore, by leveraging these attention maps, we demonstrate a method to quantify the impact of glitches on parameter estimation. We show that as the models focus more on glitches, the parameter estimation results become more strongly biased. This suggests that attention maps could potentially be used to distinguish between cases where the results produced by the machine learning model are reliable and cases where they are not.
