Multiscale Flow for Robust and Optimal Cosmological Analysis
Biwei Dai, Uros Seljak
TL;DR
This work tackles the challenge of extracting maximal, robust cosmological information from nonlinear 2D weak-lensing fields by learning the full field-level likelihood $p(x|y)$ with a Haar wavelet–based multiscale normalizing flow. The method factorizes the likelihood across scales via multiresolution analysis, modeling each scale’s term with dedicated conditional NF blocks and combining them to recover the full density. A two-stage training procedure—generative likelihood optimization followed by discriminative calibration using $p(y|x)$ samples—yields calibrated posteriors and enables robust detection of distribution shifts, such as baryonic effects, while preserving strong cosmological constraints. The results show substantial improvements in figure-of-merit over traditional summaries (power spectrum, peak counts, scattering transform, CNN), effective baryon marginalization, and the ability to generate realistic mock weak-lensing maps, with potential applicability to other intensity maps and 3D fields.
Abstract
We propose Multiscale Flow, a generative Normalizing Flow that creates samples and models the field-level likelihood of two-dimensional cosmological data such as weak lensing. Multiscale Flow uses hierarchical decomposition of cosmological fields via a wavelet basis, and then models different wavelet components separately as Normalizing Flows. The log-likelihood of the original cosmological field can be recovered by summing over the log-likelihood of each wavelet term. This decomposition allows us to separate the information from different scales and identify distribution shifts in the data such as unknown scale-dependent systematics. The resulting likelihood analysis can not only identify these types of systematics, but can also be made optimal, in the sense that the Multiscale Flow can learn the full likelihood at the field without any dimensionality reduction. We apply Multiscale Flow to weak lensing mock datasets for cosmological inference, and show that it significantly outperforms traditional summary statistics such as power spectrum and peak counts, as well as novel Machine Learning based summary statistics such as scattering transform and convolutional neural networks. We further show that Multiscale Flow is able to identify distribution shifts not in the training data such as baryonic effects. Finally, we demonstrate that Multiscale Flow can be used to generate realistic samples of weak lensing data.
