Finding the boundary: Using galaxy membership to inform galaxy cluster extent through machine learning
Christine Hao, Stephanie O'Neil, Mark Vogelsberger, Vinh Tran, Lamiya Mowla, Joshua S. Speagle
TL;DR
This study uses the large-volume hydrodynamic simulation suite IllustrisTNG (specifically TNG300-1) to quantify where a galaxy's membership in the cluster versus the field transitions as a function of distance to the nearest cluster. A supervised deep neural network is trained on intrinsic galaxy properties (six primary features, with extensions to fifteen) to classify galaxies as cluster-like or field-like, yielding a probabilistic transition region characterized by a zero-point radius $r_0$ and a stacked probability profile $P(r)_{\mathrm{stack}}$ that are analyzed across cluster mass bins. The results show the transition is broad and intrinsically scattered, extending to $\sim 1{-}1.2\,R_{200,\mathrm{mean}}$ and increasing with cluster mass as $r_0 \propto M_{200,\mathrm{mean}}^{0.10}$, with dynamical properties probing deeper into the core while gas and stellar indicators vary with mass. These findings imply that conventional hard boundaries like $R_{200}$ or the splashback radius do not fully capture environmental preprocessing and highlight the value of a probabilistic, property-based boundary to study cluster environments in both simulations and observations. The work also demonstrates that categorizing galaxy properties by their underlying physics reveals distinct transition behaviours, offering a data-driven perspective on how ram-pressure stripping and related processes shape galaxy evolution in cluster outskirts.
Abstract
The spatial extent of the environment's impact on galaxies marks a transitional region between cluster and field galaxies. We present a data-driven method to identify this region in galaxy clusters with masses $M_{200\rm ,mean}>10^{13} M_{\odot}$ at $z = 0$. Using resolved galaxy samples from the largest simulation volume of IllustrisTNG (TNG300-1), we examine how galaxy properties vary as a function of distance to the closest cluster. We train neural networks to classify galaxies into cluster and field galaxies based on their intrinsic properties. Using this classifier, we present the first quantitative and probabilistic map of the transition region. It is represented as a broad and intrinsically scattered region near cluster outskirts, rather than a sharp physical boundary. This is the physical detection of a mixed population. In order to determine transition regions of different physical processes by training property-specific models, we categorise galaxy properties based on their underlying physics, i.e. gas, stellar, and dynamical. Changes to the dynamical properties dominate the innermost regions of the clusters of all masses. Stellar properties and gas properties, on the other hand, exhibit transitions at similar locations for low mass clusters, yet gas properties have transitions in the outermost regions for high mass clusters. These results have implications for cluster environmental studies in both simulations and observations, particularly in refining the definition of cluster boundaries while considering environmental preprocessing and how galaxies evolve under the effect of the cluster environment.
