Estimating cluster masses: a comparative study between machine learning and maximum likelihood
Raeed Mundow, Adi Nusser
TL;DR
This paper tackles estimating cluster virial masses $M_v$ from the surrounding galaxy distribution without member identification. It compares a physics-informed maximum-likelihood estimator (MLE) that relies on universal, scaled CAH profiles with a data-driven convolutional autoencoder CNN (AE-CNN) trained on MDPL2 mock catalogs. The AE-CNN achieves lower scatter in $\log M_v$ than the MLE, notably with redshift-space input ($0.10$ dex vs $0.16$ dex) and with velocity-based inputs ($0.12$ dex and $0.16$ dex), even in the presence of inhomogeneous Malmquist bias. The results illustrate a trade-off between interpretability and flexibility, showing that the AE-CNN effectively learns the posterior mean from data while the MLE remains transparent under its universal-profile assumptions.
Abstract
We compare an autoencoder convolutional neural network (AE-CNN) with a conventional maximum-likelihood estimator (MLE) for inferring cluster virial masses, $M_v$, directly from the galaxy distribution around clusters, without identifying members or interlopers. The AE-CNN is trained on mock galaxy catalogues, whereas the MLE assumes that clusters of similar mass share the same phase-space galaxy profile. Conceptually, the MLE returns an unbiased estimate of $\log M_v$ at fixed true mass, whereas the AE-CNN approximates the posterior mean, so the true $\log M_v$ is unbiased at fixed estimate. Using MDPL2 mock clusters with redshift space number density as input, the AE-CNN attains an rms scatter of $0.10\,\textrm{dex}$ between predicted and true $\log M_v$, compared with $0.16\,\textrm{dex}$ for the MLE. With inputs based on mean peculiar velocities, binned in redshift space or observed distance, the AE-CNN achieves scatters of $0.12\,\textrm{dex}$ and $0.16\,\textrm{dex}$, respectively, despite strong inhomogeneous Malmquist bias.
