Segmented-Polynomial-fitting Least Squares (SPLS): An optimized algorithm to find Earth twins
Shuyue Zheng, Fabo Feng, Yicheng Rui
TL;DR
Segmented-Polynomial-fitting Least Squares (SPLS) introduces a joint model for transits and background trends using a segmented double-polynomial framework to enhance detection of weak, long-period Earth-analog transits. The method employs Bayes-factor–driven trend-order selection and a periodogram significance metric (SDE) derived from a log-likelihood difference, with a three-step approximation to accelerate computation. Injection–recovery tests on Kepler data show SPLS achieving higher true-positive rates and lower false positives than standard detrending-detection pipelines, including a 97% recovery rate for Kepler single-planet systems. SPLS thereby improves sensitivity to Earth twins in current and upcoming missions (Kepler, TESS, PLATO, Earth 2.0), albeit with higher computational costs that could be mitigated by GPU implementation and further methodological refinements.
Abstract
Detecting Earth twins remains challenging because their shallow, long-period transits are difficult to distinguish from background noise. Motivated by the challenge, we developed Segmented-Polynomial-fitting Least Squares (SPLS), a new algorithm that simultaneously fits planetary transits and background trends using a segmented double polynomial model. Prior to signal detection, the optimal polynomial order for the trend component is selected using Bayes factor-based model comparison. During the periodogram search, the Signal Detection Efficiency metric is used to assess signal significance. The algorithm is accelerated by a three-step approximation and nonlinear parameter sampling tailored to SPLS. We compare the performance of SPLS with traditional detrending-detection approaches across different orbital periods, signal-to-noise ratios (SNR), planet radii and stellar noise levels on an injection-recovery test. When detecting signals with periods between 10 and 480 days and SNRs below 9, SPLS achieves at least a 22.6% higher true positive rate than other methods at the same 10% false positive rate. Using the threshold determined from the Receiver Operating Characteristic curve analysis, our method also recovers the most true signals while yielding the fewest false positives among all injected samples, and reaches a 97% recovery fraction in Kepler confirmed single-planet systems. The tests demonstrate that SPLS improves the detection of transiting planets, particularly for low-SNR, long-period signals. It offers the potential for finding Earth twins in future applications to data from Kepler, TESS, and upcoming PLATO and Earth 2.0 missions.
