Flexible Gravitational-Wave Parameter Estimation with Transformers
Annalena Kofler, Maximilian Dax, Stephen R. Green, Jonas Wildberger, Nihar Gupte, Jakob H. Macke, Jonathan Gair, Alessandra Buonanno, Bernhard Schölkopf
TL;DR
The paper tackles the need for flexible, scalable gravitational-wave parameter estimation as data conditions vary across detectors and frequency ranges. It introduces Dingo-T1, a transformer-based encoder coupled with a normalizing flow, enabling full amortization to handle missing data and diverse analysis settings with a single model. Through extensive tests on LVK O3 data, it demonstrates improved sample efficiency, rapid inference, and the ability to perform IMR consistency tests, highlighting significant practical gains for real-time and catalog-level GW analyses. The work paves the way for generalized, task-agnostic GW inference and robust handling of incomplete data in current and future observatories.
Abstract
Gravitational-wave data analysis relies on accurate and efficient methods to extract physical information from noisy detector signals, yet the increasing rate and complexity of observations represent a growing challenge. Deep learning provides a powerful alternative to traditional inference, but existing neural models typically lack the flexibility to handle variations in data analysis settings. Such variations accommodate imperfect observations or are required for specialized tests, and could include changes in detector configurations, overall frequency ranges, or localized cuts. We introduce a flexible transformer-based architecture paired with a training strategy that enables adaptation to diverse analysis settings at inference time. Applied to parameter estimation, we demonstrate that a single flexible model -- called Dingo-T1 -- can (i) analyze 48 gravitational-wave events from the third LIGO-Virgo-KAGRA Observing Run under a wide range of analysis configurations, (ii) enable systematic studies of how detector and frequency configurations impact inferred posteriors, and (iii) perform inspiral-merger-ringdown consistency tests probing general relativity. Dingo-T1 also improves median sample efficiency on real events from a baseline of 1.4% to 4.2%. Our approach thus demonstrates flexible and scalable inference with a principled framework for handling missing or incomplete data -- key capabilities for current and next-generation observatories.
