Table of Contents
Fetching ...

AGNBoost: A Machine Learning Approach to AGN Identification with JWST/NIRCam+MIRI Colors and Photometry

Kurt Hamblin, Allison Kirkpatrick, Bren E. Backhaus, Gregory Troiani, Jeyhan S. Kartaltepe, Dale D. Kocevski, Anton M. Koekemoer, Erini Lambrides, Casey Papovich, Kaila Ronayne, Guang Yang, Micaela B. Bagley, Mark Dickinson, Steven L. Finkelstein, Pablo Arrabal Haro, Fabio Pacucci, Jonathan R. Trump, Nor Pirzkal, Alexander de la Vega, Edgar Perez Vidal, L. Y. Aaron Yung

TL;DR

The flexible framework allows straightforward incorporation of additional photometric bands and re-training for other variables, and AGNBoost's computational efficiency makes it well-suited for wide-sky surveys requiring rapid AGN identification and redshift estimation.

Abstract

We present AGNBoost, a machine learning framework utilizing XGBoostLSS to identify AGN and estimate redshifts from JWST NIRCam and MIRI photometry. AGNBoost constructs 66 input features from 7 NIRCam and 4 MIRI bands to predict the fraction of mid-IR $3$--$30\,μ$m emission attributable to an AGN power law ($\text{frac}_{\text{AGN}}$) and photometric redshift. Each model is trained on $10^6$ simulated galaxies from CIGALE. Models are tested on mock CIGALE galaxies, an independent set of empirically-derived templates, and 748 observations from the JWST MIRI EGS Galaxy and AGN (MEGA) survey. On idealized noise-free mock CIGALE galaxies, AGNBoost achieves $15\%$ outlier fractions of $1.63\%$ ($\text{frac}_{\text{AGN}}$) and $0.15\%$ (redshift), with $σ_{\text{RMSE}} = 0.045$ for $\text{frac}_{\text{AGN}}$ and $σ_{\text{NMAD}} = 0.004$ for redshift. When realistic photometric uncertainties are introduced, performance remains robust with median predictions on the 1:1 relation, though outlier fractions increase to $4.38\%$ and $3.35\%$, respectively. On the independent template set, AGNBoost identifies $92.6\%$ of AGN candidates with $\text{frac}_{\text{AGN}} > 0.3$ and $100\%$ with $\text{frac}_{\text{AGN}} > 0.5$, demonstrating generalization beyond the training distribution. On MEGA galaxies with spectroscopic redshifts, AGNBoost achieves $σ_{\text{NMAD}} = 0.056$ and $19.79\%$ outliers. AGNBoost $\text{frac}_{\text{AGN}}$ estimates broadly agree with CIGALE fitting ($σ_{\text{RMSE}} = 0.178$, $11.96\%$ outliers). The flexible framework allows straightforward incorporation of additional photometric bands and re-training for other variables. AGNBoost's computational efficiency makes it well-suited for wide-sky surveys requiring rapid AGN identification and redshift estimation.

AGNBoost: A Machine Learning Approach to AGN Identification with JWST/NIRCam+MIRI Colors and Photometry

TL;DR

The flexible framework allows straightforward incorporation of additional photometric bands and re-training for other variables, and AGNBoost's computational efficiency makes it well-suited for wide-sky surveys requiring rapid AGN identification and redshift estimation.

Abstract

We present AGNBoost, a machine learning framework utilizing XGBoostLSS to identify AGN and estimate redshifts from JWST NIRCam and MIRI photometry. AGNBoost constructs 66 input features from 7 NIRCam and 4 MIRI bands to predict the fraction of mid-IR --m emission attributable to an AGN power law () and photometric redshift. Each model is trained on simulated galaxies from CIGALE. Models are tested on mock CIGALE galaxies, an independent set of empirically-derived templates, and 748 observations from the JWST MIRI EGS Galaxy and AGN (MEGA) survey. On idealized noise-free mock CIGALE galaxies, AGNBoost achieves outlier fractions of () and (redshift), with for and for redshift. When realistic photometric uncertainties are introduced, performance remains robust with median predictions on the 1:1 relation, though outlier fractions increase to and , respectively. On the independent template set, AGNBoost identifies of AGN candidates with and with , demonstrating generalization beyond the training distribution. On MEGA galaxies with spectroscopic redshifts, AGNBoost achieves and outliers. AGNBoost estimates broadly agree with CIGALE fitting (, outliers). The flexible framework allows straightforward incorporation of additional photometric bands and re-training for other variables. AGNBoost's computational efficiency makes it well-suited for wide-sky surveys requiring rapid AGN identification and redshift estimation.

Paper Structure

This paper contains 3 sections, 1 figure.

Figures (1)

  • Figure 1: A single frame of an animation illustrating the mid-IR color properties of SF+AGN composites, SFGs, and mid-IR weak galaxies. The AGN, Composite, and SFG templates are from kirkpatrick_2015. Here, M82 m82_paper serves as a proxy for a SFG with weak PAH features. Without prior redshift information, it is not possible to draw boundaries in this color space that robustly separate these classes of sources. The full animation can be viewed here: https://hamblin-ku.github.io/ColorAnimation/sed_color_animation.mp4.