Improving Earth-like planet detection in radial velocity using deep learning
Yinan Zhao, Xavier Dumusque, Michael Cretignier, Andrew Collier Cameron, David W. Latham, Mercedes López-Morales, Michel Mayor, Alessandro Sozzetti, Rosario Cosentino, Isidro Gómez-Vargas, Francesco Pepe, Stephane Udry
TL;DR
Stellar activity obscures low-mass exoplanets in radial velocity measurements, limiting Earth-like planet detection. The authors introduce a CNN that operates on a compact shell spectral representation to model activity-induced line-shape variations and predict RV, FWHM, and BIS, while avoiding fitting planetary signals. They demonstrate substantial gains on the Sun, Alpha Centauri B, and Tau Ceti, achieving detection thresholds down to ~0.2–0.7 m/s over representative period ranges, and show the method can significantly mitigate activity in solar data. The approach is data-driven and star-adaptive, scalable via Optuna hyperparameter tuning, and holds promise for approaching Earth-like planet sensitivity with further enhancements such as flux-effect inputs and transfer learning.
Abstract
Many novel methods have been proposed to mitigate stellar activity for exoplanet detection as the presence of stellar activity in radial velocity (RV) measurements is the current major limitation. Unlike traditional methods that model stellar activity in the RV domain, more methods are moving in the direction of disentangling stellar activity at the spectral level. The goal of this paper is to present a novel convolutional neural network-based algorithm that efficiently models stellar activity signals at the spectral level, enhancing the detection of Earth-like planets. We trained a convolutional neural network to build the correlation between the change in the spectral line profile and the corresponding RV, full width at half maximum (FWHM) and bisector span (BIS) values derived from the classical cross-correlation function. This algorithm has been tested on three intensively observed stars: Alpha Centauri B (HD128621), Tau ceti (HD10700), and the Sun. By injecting simulated planetary signals at the spectral level, we demonstrate that our machine learning algorithm can achieve, for HD128621 and HD10700, a detection threshold of 0.5 m/s in semi-amplitude for planets with periods ranging from 10 to 300 days. This threshold would correspond to the detection of a $\sim$4$\mathrm{M}_{\oplus}$ in the habitable zone of those stars. On the HARPS-N solar dataset, our algorithm is even more efficient at mitigating stellar activity signals and can reach a threshold of 0.2 m/s, which would correspond to a 2.2$\mathrm{M}_{\oplus}$ planet on the orbit of the Earth. To the best of our knowledge, it is the first time that such low detection thresholds are reported for the Sun, but also for other stars, and therefore this highlights the efficiency of our convolutional neural network-based algorithm at mitigating stellar activity in RV measurements.
