Knobs and dials of retrieving JWST transmission spectra. II. Impacts of pipeline-level differences on retrieval posteriors
Simon Schleich, Sudeshna Boro Saikia, Quentin Changeat, Manuel Güdel, Aiko Voigt, Ingo Waldmann
TL;DR
This study interrogates how JWST transmission spectra retrieved for WASP-39 b depend on data-reduction and input spectra. By generating random perturbations of SP-TW and comparing with two independently reduced spectra RU-23 and CA-24 using TauREx3, the authors identify three posterior archetypes: stable Gaussian posteriors for H$_2$O and CO$_2$, stable upper limits for CO and CH$_4$, and unstable heavy-tailed posteriors for SO$_2$, C$_2$H$_2$, and CH$_4$-related features, with the $p$-$T$ profile and Rp robust to perturbations. They demonstrate that independent reductions yield differing posteriors, challenging robust interpretation and underscoring the need for carefully chosen credible intervals (e.g., $CCI_{95}$) when reporting exoplanet atmospheric constraints. The results highlight the importance of accounting for pipeline- and data-reduction systematics in JWST-era atmospheric retrievals and provide guidance for transparent uncertainty reporting in future analyses.
Abstract
Since the launch of JWST, observations of exoplanetary atmospheres have seen a revolution in data quality. Given that atmospheric parameter inferences depend heavily on the underlying data, a re-evaluation of current methodologies is warranted to assess the reliability of these results. We investigate the impact of variations in input spectra on atmospheric retrievals for the hot Jupiter WASP-39 b using JWST transit data. Specifically, we analyse the reliability of parameter estimations from random perturbations of the underlying spectrum and their sensitivity to three transmission spectra derived from the same observational data. Using the NIRSpec PRISM observation from a single transit of WASP-39 b, we perform retrievals with the TauREx framework. As a baseline, we use a spectrum derived with the Eureka! data reduction pipeline. To evaluate retrieval reliability, we analyse posterior distributions under deviations from this spectrum. We simulate random noise by performing retrievals on scattered instances of this spectrum and compare them with retrievals based on existing spectra reduced from the same raw observation. Our analysis identifies three types of posterior distributions: (1) Stable, Gaussian distributions for species constrained across the entire spectrum (e.g., H2O, CO2); (2) Uniform posteriors with upper bounds for weakly constrained species (e.g., CO, CH4); and (3) Unstable, heavy-tailed posteriors for species constrained by minor spectrum features (e.g., SO2, C2H2). We find that other parameters, such as the planetary radius and p-T profile, are stable under spectral perturbations. Posterior distributions differ for retrievals on independently reduced transmission spectra from the same raw data, complicating interpretation, particularly for skewed distributions. Based on this, we advocate for careful assessment and selection of credible interval sizes to reflect this.
