Understanding Exoplanet Habitability: A Bayesian ML Framework for Predicting Atmospheric Absorption Spectra
Vasuda Trehan, Kevin H. Knuth, M. J. Way
TL;DR
The paper tackles the challenge of predicting exoplanet atmospheric absorption spectra from roughly 30 planetary parameters to enable Bayesian parameter inference. It proposes a forward surrogate based on PCHIP splines, trained on real Earth spectra and ROCKE-3D synthetic data, and demonstrates a proof-of-concept in a 1D, 1-parameter, six-bin setting, where each bin height $h_b$ depends on the parameters via $h_b = F_b(p_1, \dots, p_{30})$. Nested sampling is used to infer spline knot positions and heights, and Bayesian Adaptive Exploration identifies informative sampling locations to reduce predictive uncertainty. The results show accurate recovery of the spectral shapes with quantified uncertainty, while highlighting scaling challenges to the full 30-parameter, 20-bin problem and the need for more scalable forward models.
Abstract
The evolution of space technology in recent years, fueled by advancements in computing such as Artificial Intelligence (AI) and machine learning (ML), has profoundly transformed our capacity to explore the cosmos. Missions like the James Webb Space Telescope (JWST) have made information about distant objects more easily accessible, resulting in extensive amounts of valuable data. As part of this work-in-progress study, we are working to create an atmospheric absorption spectrum prediction model for exoplanets. The eventual model will be based on both collected observational spectra and synthetic spectral data generated by the ROCKE-3D general circulation model (GCM) developed by the climate modeling program at NASA's Goddard Institute for Space Studies (GISS). In this initial study, spline curves are used to describe the bin heights of simulated atmospheric absorption spectra as a function of one of the values of the planetary parameters. Bayesian Adaptive Exploration is then employed to identify areas of the planetary parameter space for which more data are needed to improve the model. The resulting system will be used as a forward model so that planetary parameters can be inferred given a planet's atmospheric absorption spectrum. This work is expected to contribute to a better understanding of exoplanetary properties and general exoplanet climates and habitability.
