AttenGW: A Lightweight Attention-Based Multi-Detector Gravitational-Wave Detection Pipeline
Victoria Tiki, Eliu Huerta
TL;DR
AttenGW addresses robust, low-latency multi-detector gravitational-wave detection by replacing graph-based detector fusion with a lightweight cross-attention module that fuses per-detector representations produced by a Hierarchical Dilated Convolutional Network. The end-to-end pipeline combines GWOSC-based preprocessing, a data generator with injected signals, and a PyTorch Lightning training/inference framework, totaling approximately $193{,}000$ trainable parameters. On real February 2020 O3b data and synthetic O3a data, a single AttenGW model achieves substantially lower background triggers than previous graph-based ensembles while maintaining competitive sensitivity, and a three-model ensemble matches or surpasses the six-model baseline. The work provides a reproducible, open-source software stack and demonstrates the practicality of attention-based aggregation for multi-detector GW searches, with future potential for longer signals, missing detectors, and online fine-tuning.
Abstract
We present AttenGW, an attention-based multi-detector gravitational-wave detection model and accompanying software stack designed for analysis of real LIGO data. AttenGW combines a per-detector hierarchical dilated convolutional network with an attention-based aggregation module that enforces cross-detector coherence, providing an alternative to graph-based aggregation schemes used in previous work. The pipeline adopts a LIGO-style preprocessing and data-loading workflow based on GWOSC time series, with standard whitening and filtering, and is released as a documented Python/PyTorch package. We benchmark AttenGW using simulated injections to estimate sensitive volume and on real O3 data, focusing on the February 2020 segment previously used to evaluate a spatiotemporal graph ensemble. On this month of data, a single AttenGW model reduces the false-positive rate relative to a single graph-based detector by a factor of a few, and an ensemble of three AttenGW models matches the performance of the corresponding six-model ensemble. Injection studies on real LIGO noise further indicate that attention-based aggregation yields stable performance on non-Gaussian backgrounds.
