Hybrid summary statistics: neural weak lensing inference beyond the power spectrum
T. Lucas Makinen, Alan Heavens, Natalia Porqueres, Tom Charnock, Axel Lapel, Benjamin D. Wandelt
TL;DR
This work addresses the information loss inherent in relying solely on two-point weak lensing statistics by introducing a hybrid approach that augments physics-based summaries with neural summaries optimized to add information beyond the power spectrum. Building on IMNN, the method learns neural compressions that complement an existing statistic, here the tomographic angular power spectrum, using an information-update Fisher framework. The authors implement a lightweight, physically-informed neural network with multipole kernel embeddings to produce additional summaries, and validate the approach with tomographic convergence maps across multiple resolutions and noise levels. Across both low- and high-noise regimes, the hybrid statistics achieve substantial gains in Fisher information (ranging from roughly 3× to over 8×) and yield tighter cosmological constraints, particularly on $oldsymbol{ heta}=(oldsymbol{ m \Omega_m}, S_8)$. The framework is simulation-based, scalable, and broadly applicable to other datasets, offering a path toward more efficient, interpretable implicit inferences for large-scale structure surveys.
Abstract
In inference problems, we often have domain knowledge which allows us to define summary statistics that capture most of the information content in a dataset. In this paper, we present a hybrid approach, where such physics-based summaries are augmented by a set of compressed neural summary statistics that are optimised to extract the extra information that is not captured by the predefined summaries. The resulting statistics are very powerful inputs to simulation-based or implicit inference of model parameters. We apply this generalisation of Information Maximising Neural Networks (IMNNs) to parameter constraints from tomographic weak gravitational lensing convergence maps to find summary statistics that are explicitly optimised to complement angular power spectrum estimates. We study several dark matter simulation resolutions in low- and high-noise regimes. We show that i) the information-update formalism extracts at least $3\times$ and up to $8\times$ as much information as the angular power spectrum in all noise regimes, ii) the network summaries are highly complementary to existing 2-point summaries, and iii) our formalism allows for networks with smaller, physically-informed architectures to match much larger regression networks with far fewer simulations needed to obtain asymptotically optimal inference.
