Template-Free Gravitational Wave Detection with CWT-LSTM Autoencoders: A Case Study of Run-Dependent Calibration Effects in LIGO Data
Jericho Cain
TL;DR
This work presents a template-free approach to gravitational wave detection by combining Continuous Wavelet Transform (CWT) preprocessing with a Long Short-Term Memory (LSTM) autoencoder. The method learns detector-noise characteristics from unlabeled data and flags deviations as potential signals, achieving 97.0% precision and 96.1% recall on LIGO O4 data, with an AUC of 0.994. A key finding is the discovery and resolution of cross-run batch effects in GWOSC data, where multi-run training produced run-dependent reconstruction errors; restricting to a single, well-calibrated run (O4) eliminated these biases and yielded robust performance. The study demonstrates that template-free anomaly detection can rival supervised approaches while preserving discovery potential for signals with unexpected morphologies, and provides methodological guidance for handling multi-epoch astrophysical datasets in ML pipelines.
Abstract
Gravitational wave detection requires sophisticated signal processing to identify weak astrophysical signals buried in instrumental noise. Traditional matched filtering approaches face computational challenges with diverse signal morphologies and non-stationary noise. This work presents an unsupervised deep learning methodology integrating Continuous Wavelet Transform (CWT) preprocessing with Long Short-Term Memory (LSTM) autoencoder architecture for template-free gravitational wave detection. We train and evaluate our model on LIGO H1 data from Observing Run 4 (O4, 2023-2024), comprising 126 confirmed gravitational wave events from the GWTC-4.0 catalog and 1991 noise segments. During development, we discovered that reconstruction errors from multi-run training (O1-O4) clustered by observing run rather than astrophysical parameters, revealing systematic batch effects from GWOSC's evolving calibration procedures. Following LIGO's established practice of per-run optimization, we adopted single-run (O4) training, which eliminated these batch effects and improved recall from 52% to 96% while maintaining 97% precision. The final model achieves 97.0% precision, 96.1% recall, F1-score 96.6%, and ROC-AUC 0.994 on 102 test signals and 399 noise segments. The reconstruction error distribution shows clean separation between noise (mean 0.48) and signals (mean 0.77). This unsupervised, template-free approach demonstrates that anomaly detection can achieve performance competitive with supervised methods while enabling discovery of signals with unexpected morphologies. Our identification and resolution of cross-run batch effects provides methodological guidance for future machine learning applications to multi-epoch gravitational wave datasets.
