Denoising weak lensing mass maps with diffusion model and generative adversarial network
Shohei D. Aoyama, Ken Osato, Masato Shirasaki
TL;DR
The paper addresses the challenge of shape-noise contamination in weak-lensing mass maps by evaluating two denoising approaches: a diffusion-model (DM) and a pix2pix-based conditional GAN, using a large set of $\kappa$TNG mock WL maps. DM provides more stable training and enables multiple denoised realizations per input, yielding higher fidelity in both pixel-level reconstructions and cosmological statistics, notably recovering the angular power spectrum $C(\ell)$ up to $\ell \lesssim 6000$ and the one-point PDF more accurately than GAN. Across multiple statistics, including the bispectrum and scattering transform, DM samples show lower variance and robustness compared with GAN-derived realizations. The results imply that DM-based denoising can sharpen WL signals and support more reliable small-scale cosmological inferences from mass maps.
Abstract
The matter distribution of the Universe can be mapped through the weak gravitational lensing (WL) effect: small distortions of the shapes of distant galaxies, which reflects the inhomogeneity of the cosmic density field. The most dominant contaminant in the WL effect is the shape noise; the signal is diluted due to the finite number of source galaxies. In order to explore the full potential of WL measurements, sharpening the signal by removing the shape noise from the observational data, i.e., WL denoising, is a pressing issue. Machine learning approaches, in particular, deep generative models, have proven effective at the WL denoising task. We implement a denoising model based on the diffusion model (DM) and conduct systematic in-depth comparisons with generative adversarial networks (GANs), which have been applied in previous works for WL denoising. Utilizing the large suite of mock simulations of WL observations, we demonstrate that DM surpasses GAN in the WL denosing task in multiple aspects: (1) the training process is more stable, (2) taking the average of multiple samples from DM can robustly reproduce the true signal, and (3) DM can recover various statistics with higher accuracy.
