Flexible Simulation Based Inference for Galaxy Photometric Fitting with Synthesizer
Thomas Harvey, Christopher C. Lovell, Sophie Newman, Christopher J. Conselice, Duncan Austin, William J. Roper, Aswin P. Vijayan, Stephen M. Wilkins, Patricia Iglesias-Navarro, Vadim Rusakov, Qiong Li, Nathan Adams, Kai Magdwick, Caio M. Goolsby, Marc Huertas-Company, Matthew Ho
TL;DR
This work presents synference, a scalable SBI framework for galaxy SED fitting that couples the Synthesizer forward model with the LtU-ILI validation toolkit to produce full Bayesian posteriors for galaxy properties from photometry. By training on up to $10^6$ simulated galaxies with an 8-parameter physical model and 14-band JWST/HST data, the authors demonstrate excellent parameter recovery (e.g., $R^2=0.99$ for $\log M_*$) and well-calibrated posteriors, while achieving substantial speedups (roughly $10^3$--$10^5$) over nested sampling or MCMC. The framework supports flexible SFHs, emission components, and multiple SPS grids, enabling rapid model comparison (e.g., BPASS vs FSPS) and joint redshift-parameter inference (Model 2) on real data from the JADES GOODS-South field. The results underscore the potential of amortized SBI to transform analyses of next-generation surveys by delivering fast, fully Bayesian inferences across massive galaxy catalogs, with robust validation and extensible support for spectroscopy and complex forward models.
Abstract
We introduce Synference, a new, flexible Python framework for galaxy SED fitting using simulation-based inference (SBI). Synference leverages the Synthesizer package for flexible forward-modelling of galaxy SEDs and integrates the LtU-ILI package to ensure best practices in model training and validation. In this work we demonstrate Synference by training a neural posterior estimator on $10^6$ simulated galaxies, based on a flexible 8-parameter physical model, to infer galaxy properties from 14-band HST and JWST photometry. We validate this model, demonstrating excellent parameter recovery (e.g. R$^2>$0.99 for M$_\star$) and accurate posterior calibration against nested sampling results. We apply our trained model to 3,088 spectroscopically-confirmed galaxies in the JADES GOODS-South field. The amortized inference is exceptionally fast, having nearly fixed cost per posterior evaluation and processing the entire sample in $\sim$3 minutes on a single CPU (18 galaxies/CPU/sec), a $\sim$1700$\times$ speedup over traditional nested sampling or MCMC techniques. We demonstrate Synference's ability to simultaneously infer photometric redshifts and physical parameters, and highlight its utility for rapid Bayesian model comparison by demonstrating systematic stellar mass differences between two commonly used stellar population synthesis models. Synference is a powerful, scalable tool poised to maximise the scientific return of next-generation galaxy surveys.
