How DREAMS are made: Emulating Satellite Galaxy and Subhalo Populations with Diffusion Models and Point Clouds
Tri Nguyen, Francisco Villaescusa-Navarro, Siddharth Mishra-Sharma, Carolina Cuesta-Lazaro, Paul Torrey, Arya Farahi, Alex M. Garcia, Jonah C. Rose, Stephanie O'Neil, Mark Vogelsberger, Xuejian Shen, Cian Roche, Daniel Anglés-Alcázar, Nitya Kallivayalil, Julian B. Muñoz, Francis-Yan Cyr-Racine, Sandip Roy, Lina Necib, Kassidy E. Kollmann
TL;DR
This work introduces NeHOD, a hybrid emulator that achieves hydrodynamic-like accuracy for painting galaxies and subhalos onto dark matter halos at a fraction of the computational cost of full hydrodynamic simulations. The framework combines conditional normalizing flows for halos and central galaxies with a Transformer-based variational diffusion model to generate satellite galaxies represented as a 3D point cloud, thereby preserving small-scale structure. Trained on the DREAMS TNG-WDM MW zoom-in suite, NeHOD jointly captures complex dependencies on DM properties and baryonic feedback, reproducing halo/central statistics and satellite statistics such as the SSMF, SHMR, and the concentration–mass relation across a wide parameter space. While NeHOD excels in field-level modeling and parameter exploration, it shows modest underprediction of small-scale clustering and velocity-space details, pointing to future enhancements in environmental conditioning, symmetry incorporation, and larger training sets. Overall, NeHOD offers a scalable, differentiable, and flexible tool for generating realistic mock catalogs for galaxy clustering, lensing, and beyond, with open-source code and broad applicability to DM and baryonic physics studies.
Abstract
The connection between galaxies and their host dark matter (DM) halos is critical to our understanding of cosmology, galaxy formation, and DM physics. To maximize the return of upcoming cosmological surveys, we need an accurate way to model this complex relationship. Many techniques have been developed to model this connection, from Halo Occupation Distribution (HOD) to empirical and semi-analytic models to hydrodynamic. Hydrodynamic simulations can incorporate more detailed astrophysical processes but are computationally expensive; HODs, on the other hand, are computationally cheap but have limited accuracy. In this work, we present NeHOD, a generative framework based on variational diffusion model and Transformer, for painting galaxies/subhalos on top of DM with an accuracy of hydrodynamic simulations but at a computational cost similar to HOD. By modeling galaxies/subhalos as point clouds, instead of binning or voxelization, we can resolve small spatial scales down to the resolution of the simulations. For each halo, NeHOD predicts the positions, velocities, masses, and concentrations of its central and satellite galaxies. We train NeHOD on the TNG-Warm DM suite of the DREAMS project, which consists of 1024 high-resolution zoom-in hydrodynamic simulations of Milky Way-mass halos with varying warm DM mass and astrophysical parameters. We show that our model captures the complex relationships between subhalo properties as a function of the simulation parameters, including the mass functions, stellar-halo mass relations, concentration-mass relations, and spatial clustering. Our method can be used for a large variety of downstream applications, from galaxy clustering to strong lensing studies.
