Geometric deep learning for galaxy-halo connection: a case study for galaxy intrinsic alignments
Yesukhei Jagvaral, Francois Lanusse, Rachel Mandelbaum
TL;DR
This work addresses intrinsic alignments as a major systematic in weak lensing by building a geometry-aware emulator for the galaxy-halo connection. It combines a conditional score-based diffusion model on mixed Euclidean and SO(3) data with an $E(3)$-equivariant graph neural network to jointly model galaxy scalar properties and 3D orientations within halos, conditioned on the tidal field. Trained and validated against IllustrisTNG-100, the generated samples reproduce the joint scalar distributions and IA statistics (ellipticity–direction correlations and projected $w_{g+}$) with good statistical fidelity across scales and subpopulations. This diffusion-geometric framework offers a scalable, physically informed approach to producing realistic mock catalogs for next-generation surveys like Rubin/LSST, aiding IA mitigation and pipeline validation.
Abstract
Forthcoming cosmological imaging surveys, such as the Rubin Observatory LSST, require large-scale simulations encompassing realistic galaxy populations for a variety of scientific applications. Of particular concern is the phenomenon of intrinsic alignments (IA), whereby galaxies orient themselves towards overdensities, potentially introducing significant systematic biases in weak gravitational lensing analyses if they are not properly modeled. Due to computational constraints, simulating the intricate details of galaxy formation and evolution relevant to IA across vast volumes is impractical. As an alternative, we propose a Deep Generative Model trained on the IllustrisTNG-100 simulation to sample 3D galaxy shapes and orientations to accurately reproduce intrinsic alignments along with correlated scalar features. We model the cosmic web as a set of graphs, each graph representing a halo with nodes representing the subhalos/galaxies. The architecture consists of a SO(3) $\times$ $\mathbb{R}^n$ diffusion generative model, for galaxy orientations and $n$ scalars, implemented with E(3) equivariant Graph Neural Networks that explicitly respect the Euclidean symmetries of our Universe. The model is able to learn and predict features such as galaxy orientations that are statistically consistent with the reference simulation. Notably, our model demonstrates the ability to jointly model Euclidean-valued scalars (galaxy sizes, shapes, and colors) along with non-Euclidean valued SO(3) quantities (galaxy orientations) that are governed by highly complex galactic physics at non-linear scales.
