Table of Contents
Fetching ...

Model-agnostic basis functions for the 2-point correlation function of dark matter in linear theory

Aseem Paranjape, Ravi K. Sheth

TL;DR

This work addresses model-agnostic BAO analyses by seeking a minimal, theory-driven basis for the linear 2pcf, ξ_{ m lin}(r;\boldsymbol{\theta}). It introduces BiSequential, a neural-network architecture that factorizes the problem into a radial basis b_i(r) and cosmology-dependent weights w_i(\boldsymbol{\theta}), yielding a 9-function basis that describes ξ_{ m lin}(r) with ~0.6% accuracy across curved wCDM models near a fiducial ΛCDM. The authors demonstrate sub-percent residuals across elementary, basic, and stringent tests and show robust recovery of BAO features (peak, linear point, zero-crossing), significantly outperforming monomial bases. The approach provides a flexible, model-agnostic compression framework for BAO analyses, with potential extension to modified gravity and massive neutrino scenarios, and offers concrete pathways to integrate with data covariances via orthogonalization if needed. Overall, the paper presents a practical ML-driven method to compress and generalize the linear 2pcf across a broad cosmological landscape, enabling more powerful, model-independent BAO inferences.

Abstract

We consider approximating the linearly evolved 2-point correlation function (2pcf) of dark matter $ξ_{\rm lin}(r;\boldsymbolθ)$ in a cosmological model with parameters $\boldsymbolθ$ as the linear combination $ξ_{\rm lin}(r;\boldsymbolθ)\approx\sum_i\,b_i(r)\,w_i(\boldsymbolθ)$, where the functions $\mathcal{B}=\{b_i(r)\}$ form a $\textit{model-agnostic basis}$ for the linear 2pcf. This decomposition is important for model-agnostic analyses of the baryon acoustic oscillation (BAO) feature in the nonlinear 2pcf of galaxies that fix $\mathcal{B}$ and leave the coefficients $\{w_i\}$ free. To date, such analyses have made simple but sub-optimal choices for $\mathcal{B}$, such as monomials. We develop a machine learning framework for systematically discovering a $\textit{minimal}$ basis $\mathcal{B}$ that describes $ξ_{\rm lin}(r)$ near the BAO feature in a wide class of cosmological models. We use a custom architecture, denoted $\texttt{BiSequential}$, for a neural network (NN) that explicitly realizes the separation between $r$ and $\boldsymbolθ$ above. The optimal NN trained on data in which only $\{Ω_{\rm m},h\}$ are varied in a $\textit{flat}$ $Λ$CDM model produces a basis $\mathcal{B}$ comprising $9$ functions capable of describing $ξ_{\rm lin}(r)$ to $\sim0.6\%$ accuracy in $\textit{curved}$ $w$CDM models varying 7 parameters within $\sim5\%$ of their fiducial, flat $Λ$CDM values. Scales such as the peak, linear point and zero-crossing of $ξ_{\rm lin}(r)$ are also recovered with very high accuracy. We compare our approach to other compression schemes in the literature, and speculate that $\mathcal{B}$ may also encompass $ξ_{\rm lin}(r)$ in modified gravity models near our fiducial $Λ$CDM model. Using our basis functions in model-agnostic BAO analyses can potentially lead to significant statistical gains.

Model-agnostic basis functions for the 2-point correlation function of dark matter in linear theory

TL;DR

This work addresses model-agnostic BAO analyses by seeking a minimal, theory-driven basis for the linear 2pcf, ξ_{ m lin}(r;\boldsymbol{\theta}). It introduces BiSequential, a neural-network architecture that factorizes the problem into a radial basis b_i(r) and cosmology-dependent weights w_i(\boldsymbol{\theta}), yielding a 9-function basis that describes ξ_{ m lin}(r) with ~0.6% accuracy across curved wCDM models near a fiducial ΛCDM. The authors demonstrate sub-percent residuals across elementary, basic, and stringent tests and show robust recovery of BAO features (peak, linear point, zero-crossing), significantly outperforming monomial bases. The approach provides a flexible, model-agnostic compression framework for BAO analyses, with potential extension to modified gravity and massive neutrino scenarios, and offers concrete pathways to integrate with data covariances via orthogonalization if needed. Overall, the paper presents a practical ML-driven method to compress and generalize the linear 2pcf across a broad cosmological landscape, enabling more powerful, model-independent BAO inferences.

Abstract

We consider approximating the linearly evolved 2-point correlation function (2pcf) of dark matter in a cosmological model with parameters as the linear combination , where the functions form a for the linear 2pcf. This decomposition is important for model-agnostic analyses of the baryon acoustic oscillation (BAO) feature in the nonlinear 2pcf of galaxies that fix and leave the coefficients free. To date, such analyses have made simple but sub-optimal choices for , such as monomials. We develop a machine learning framework for systematically discovering a basis that describes near the BAO feature in a wide class of cosmological models. We use a custom architecture, denoted , for a neural network (NN) that explicitly realizes the separation between and above. The optimal NN trained on data in which only are varied in a CDM model produces a basis comprising functions capable of describing to accuracy in CDM models varying 7 parameters within of their fiducial, flat CDM values. Scales such as the peak, linear point and zero-crossing of are also recovered with very high accuracy. We compare our approach to other compression schemes in the literature, and speculate that may also encompass in modified gravity models near our fiducial CDM model. Using our basis functions in model-agnostic BAO analyses can potentially lead to significant statistical gains.

Paper Structure

This paper contains 20 sections, 7 equations, 11 figures.

Figures (11)

  • Figure 1: Effect of truncation of integral over $r$ to the range $r\in[30h^{-1}{\rm Mpc},150h^{-1}{\rm Mpc}]$ when evaluating the Gaussian convolution $\xi_0(s|\sigma)$ of $\xi_{\rm lin}(r)$ (equation \ref{['eq:gaussconv']}) for various values of $\sigma$. We see that the truncated integrals achieve better than $1\%$ accuracy over the range $65\leq s/(h^{-1}{\rm Mpc})\leq120$, suitable for model-agnostic BAO analyses.
  • Figure 2: 2-dimensional (projections of the) $\boldsymbol{\theta}$ vectors used for the basic (black) and stringent (red) tests of our primary analysis, in the $(\Omega_{\rm m},h)$ plane. The horizontal and vertical dotted lines mark the boundary used for testing the edge accuracy of the model in section \ref{['sec:results']}.
  • Figure 3: Illustration of the BiSequential architecture for approximating $\xi_{\rm lin}(r;\boldsymbol{\theta})\approx\sum_{i=1}^{n_\ast}b_i(r)\,w_i(\boldsymbol{\theta})$. See text for details. The same setup generalizes to approximation problems of the type $f(\mathbf{x};\boldsymbol{\theta})\approx\sum_{i=1}^{n_\ast}B_i(\mathbf{x})\,W_i(\boldsymbol{\theta})$ for arbitrary vector input $\mathbf{x}$.
  • Figure 4: (Left panel): Basis functions discovered by applying the training and validation procedure described in section \ref{['subsec:train-val-exc']}, where the training data were obtained by varying the 2 parameters $\{\Omega_{\rm m},h\}$, with $\xi_{\rm lin}(r)$ evaluated on the full range $30\leq r/(h^{-1}{\rm Mpc}) \leq 150$, as described in section \ref{['subsec:train-val-data']}. The colour bar indicates integer labels assigned to each function. (Right panel): Distributions of residuals obtained for the 'elementary' (blue dotted), 'basic' (black dashed) and 'stringent' (red solid) tests described in section \ref{['subsec:tests']}, using the corresponding data sets described in section \ref{['subsec:test-data']}. See section \ref{['sec:results']} for a discussion of the results.
  • Figure 5: (Upper panels): Comparison of the best-fit (solid) and true (dashed) functions $\xi_{\rm lin}(r;(\Omega_{\rm m},h))$ obtained in the 'basic' (left panel) and 'stringent' tests (right panel), using the basis functions shown in Fig. \ref{['fig:biNN2p-basis-residuals']}. We display results for 10 (25) randomly chosen parameter combinations out of 50 (250) in the left (right) panels. The horizontal dotted line in each panel marks the value zero. (Lower panels): Median (thick lines) and central 68% interval (bands) of the distribution of residuals $\delta$ calculated at each scale $r$. The solid brown line and brown band for each test shows the residual distribution when using all the $\boldsymbol{\theta}$ vectors (i.e., 50 and 250, respectively, for the basic and stringent tests). The dashed blue line and blue band shows residuals when restricting to $\boldsymbol{\theta}$ vectors in the boundary of the 2-dimensional projection shown in Fig. \ref{['fig:test-data']}, which comprise $\sim11\%$ of the respective samples.
  • ...and 6 more figures