A New Statistical Model of Star Speckles for Learning to Detect and Characterize Exoplanets in Direct Imaging Observations
Théo Bodrito, Olivier Flasseur, Julien Mairal, Jean Ponce, Maud Langlois, Anne-Marie Lagrange
TL;DR
This work tackles the challenge of detecting and characterizing exoplanets in direct imaging by modeling the nuisance speckle field with a multi-scale, convolutional statistical framework that exploits rotational and spectral symmetries. It couples this interpretable statistical model with an end-to-end learnable prior, enabling simultaneous detection and flux/position estimation while providing uncertainty quantification. The method demonstrates improved detection performance, especially in ASDI data, and robust flux/position estimates across varied data quality, validated on SPHERE/VLT datasets and synthetic injections, with calibration to control false-alarm rates. The approach is computationally efficient, scalable to large surveys, and extensible to higher spectral resolution and extended structures like circumstellar disks.
Abstract
The search for exoplanets is an active field in astronomy, with direct imaging as one of the most challenging methods due to faint exoplanet signals buried within stronger residual starlight. Successful detection requires advanced image processing to separate the exoplanet signal from this nuisance component. This paper presents a novel statistical model that captures nuisance fluctuations using a multi-scale approach, leveraging problem symmetries and a joint spectral channel representation grounded in physical principles. Our model integrates into an interpretable, end-to-end learnable framework for simultaneous exoplanet detection and flux estimation. The proposed algorithm is evaluated against the state of the art using datasets from the SPHERE instrument operating at the Very Large Telescope (VLT). It significantly improves the precision-recall trade-off, notably on challenging datasets that are otherwise unusable by astronomers. The proposed approach is computationally efficient, robust to varying data quality, and well suited for large-scale observational surveys.
