Dilated convolutional neural network for detecting extreme-mass-ratio inspirals
Tianyu Zhao, Yue Zhou, Ruijun Shi, Zhoujian Cao, Zhixiang Ren
TL;DR
This work targets detection of extreme mass ratio inspirals (EMRIs) in space-based gravitational-wave data by introducing DECODE, an end-to-end, frequency-domain detector based on a dilated causal convolutional network. By training on synthetic TDI-1.5 data and processing year-long multichannel streams, DECODE achieves high detection rates (e.g., true positive rate around $96$–$97\%$ at a false alarm rate of $1\%$) with inference times under $0.01$ s per sample, across accumulated SNRs from $50$ to $240$. The method leverages causal, dilated convolutions to capture long-range dependencies in frequency domain data, and demonstrates interpretability via activation maps and generalization across waveform models (AAK, AK, XSPEG). The study highlights the potential of fast, robust EMRI detection in future space-based GW analyses and outlines paths for improvement, such as incorporating TDI-2.0 and phase information to further boost performance.
Abstract
The detection of Extreme Mass Ratio Inspirals (EMRIs) is intricate due to their complex waveforms, extended duration, and low signal-to-noise ratio (SNR), making them more challenging to be identified compared to compact binary coalescences. While matched filtering-based techniques are known for their computational demands, existing deep learning-based methods primarily handle time-domain data and are often constrained by data duration and SNR. In addition, most existing work ignores time-delay interferometry (TDI) and applies the long-wavelength approximation in detector response calculations, thus limiting their ability to handle laser frequency noise. In this study, we introduce DECODE, an end-to-end model focusing on EMRI signal detection by sequence modeling in the frequency domain. Centered around a dilated causal convolutional neural network, trained on synthetic data considering TDI-1.5 detector response, DECODE can efficiently process a year's worth of multichannel TDI data with an SNR of around 50. We evaluate our model on 1-year data with accumulated SNR ranging from 50 to 120 and achieve a true positive rate of 96.3% at a false positive rate of 1%, keeping an inference time of less than 0.01 seconds. With the visualization of three showcased EMRI signals for interpretability and generalization, DECODE exhibits strong potential for future space-based gravitational wave data analyses.
