Constraining dark matter halo profiles with symbolic regression
Alicia Martín, Tariq Yasin, Deaglan J. Bartlett, Harry Desmond, Pedro G. Ferreira
TL;DR
This work introduces Exhaustive Symbolic Regression (ESR) to constrain dark matter halo density profiles directly from observations, using MDL to trade off fit quality and complexity. By applying ESR to mock weak-lensing Excess Surface Density data, the authors demonstrate that the standard NFW profile can be recovered in low-noise regimes, while higher noise levels favor simpler, less parameter-rich functions; a generalized NFW with a global outer slope also closely matches NFW predictions. The method is designed to be simulation-independent and extends to combining data from multiple halos via local/global parameter strategies, offering a transparent, interpretable alternative to black-box ML approaches. Beyond lensing, ESR can be extended to galactic rotation curves, enabling data-driven constraints on three-dimensional halo profiles while accounting for baryonic contributions. This framework thus provides a flexible, principled path to test halo-model assumptions and guide interpretation of upcoming high-precision surveys.
Abstract
Dark matter haloes are typically characterised by radial density profiles with fixed forms motivated by simulations (e.g. NFW). However, simulation predictions depend on uncertain dark matter physics and baryonic modelling. Here, we present a method to constrain halo density profiles directly from observations using Exhaustive Symbolic Regression (ESR), a technique that searches the space of analytic expressions for the function that best balances accuracy and simplicity for a given dataset. We test the approach on mock weak lensing excess surface density (ESD) data of synthetic clusters with NFW profiles. Motivated by real data, we assign each ESD data point a constant fractional uncertainty and vary this uncertainty and the number of clusters to probe how data precision and sample size affect model selection. For fractional errors around 5%, ESR recovers the NFW profile even from samples as small as 20 clusters. At higher uncertainties representative of current surveys, simpler functions are favoured over NFW, though it remains competitive. This preference arises because weak lensing errors are smallest in the outskirts, causing the fits to be dominated by the outer profile. ESR therefore provides a robust, simulation-independent framework both for testing mass models and determining which features of a halo's density profile are genuinely constrained by the data.
