On the sensitivity of different galaxy properties to warm dark matter
Belén Costanza, Bonny Y. Wang, Francisco Villaescusa-Navarro, Alex M. Garcia, Jonah C. Rose, Mark Vogelsberger, Paul Torrey, Arya Farahi, Xuejian Shen, Ilem Leisher
TL;DR
This paper investigates how sensitively galaxy properties respond to warm dark matter by analyzing 1,024 DREAMS cosmological hydrodynamical simulations. It uses a combination of machine learning methods—an MLP with normalizing flows on global galaxy-property statistics and a GNN with normalizing flows on halo substructure—to infer the WDM mass, complemented by symbolic regression for interpretable relations. The key finding is that the gas content of subhalos, encapsulated in the gas-mass distribution and its zeros, provides the strongest constraint on $m_{\rm WDM}$ (with $R^2$ up to ~0.94 when using global statistics), while halo-level information adds only marginal improvements beyond these global descriptors. The results suggest that large-scale galaxy-population statistics largely govern WDM sensitivity, with potential extensions to observational probes such as HI, and they demonstrate robust methodology for constraining dark matter physics from complex galaxy formation simulations.
Abstract
We study the impact of warm dark matter (WDM) particle mass on galaxy properties using 1,024 state-of-the-art cosmological hydrodynamical simulations from the DREAMS project. We begin by using a Multilayer Perceptron (MLP) coupled with a normalizing flow to explore global statistical descriptors of galaxy populations, such as the mean, standard deviation, and histograms of 14 galaxy properties. We find that subhalo gas mass is the most informative feature for constraining the WDM mass, achieving a determination coefficient of R^2 = 0.9. We employ symbolic regression to extract simple, interpretable relations with the WDM particle mass. Finally, we adopt a more localized approach by selecting individual dark matter halos and using a Graph Neural Network (GNN) with a normalizing flow to infer the WDM mass, incorporating subhalo properties as node features and global simulation statistics as graph-level features. The GNN approach yields only a residual improvement over MLP models based solely on global features, indicating that most of the predictive power resides in the global descriptors, with only marginal gains from halo-level information.
