A normalizing flow approach for the inference of star cluster properties from unresolved broadband photometry I: Comparison to spectral energy distribution fitting
Daniel Walter, Victor F. Ksoll, Ralf S. Klessen, Mederic Boquien, Aida Wofford, Francesco Belfiore, Daniel A. Dale, Kathryn Grasha, David A. Thilker, Leonardo Ubeda, Thomas G. Williams
TL;DR
This work tackles the challenge of inferring star-cluster properties from unresolved broadband photometry, where grid-based SED fitting becomes computationally prohibitive with nuisance parameters. It introduces a normalizing-flow framework using a conditional invertible neural network (cINN) trained on $5 imes 10^6$ CIGALE-generated photometries to learn $p(oldsymbol{ heta} obreak obreak| obreak obreak oldsymbol{x})$ for $(oldsymbol{ heta} = (\log_{10} t, \, \log_{10} m, \, E_{B-V}))$ conditioned on five-band photometry and ancillary inputs. The approach yields reasonable posterior approximations on both synthetic tests and PHANGS DR3 data, enabling efficient density estimation and sampling, with results broadly agreeing with PHANGS MLEs except where mode selection and noise-model choices drive differences. The method is particularly advantageous when forward-model nuisance parameters render the likelihood intractable or when large inference catalogs demand amortized, likelihood-free density estimation, and it sets the stage for extending to more parameters in follow-up work.
Abstract
Estimating properties of star clusters from unresolved broadband photometry is a challenging problem that is classically tackled by spectral energy distribution (SED) fitting methods that are based on simple stellar population models. However, because of their exponential scaling, grid-based methods suffer from computational limitations. In addition, nuisance parameters in the model can make the computation of the likelihood function intractable. These limitations can be overcome by modern generative deep learning methods that offer flexible and powerful tools for modeling high-dimensional posterior distributions and fast inference from learned data. We present a normalizing flow approach for the inference of cluster age, mass, and reddening from Hubble Space Telescope broadband photometry. In particular, we explore our network's behavior on an inference problem that has been analyzed in previous works. We used the SED modeling code CIGALE to create a dataset of synthetic photometric observations for $5 \times 10^6$ mock star clusters. Subsequently, this data set was used to train a coupling-based flow in the form of a conditional invertible neural network (cINN) to predict posterior probability distributions for cluster age, mass, and reddening from photometric observations. We predicted cluster parameters for the 'Physics at High Angular resolution in Nearby GalaxieS' (PHANGS) Data Release 3 catalog. To evaluate the capabilities of the network, we compared our results to the publicly available PHANGS estimates and found that the estimates agree reasonably well. We demonstrate that normalizing flow methods can be a viable tool for the inference of cluster parameters, and argue that this approach is especially useful when nuisance parameters make the computation of the likelihood intractable and in scenarios that require efficient density estimation.
