Stabilizing simulation-based cosmological Fisher forecasts: a case study using the Voronoi volume function
Saee Dhawalikar, Aseem Paranjape, Shadab Alam
TL;DR
This work tackles the instability of derivative estimates in Fisher forecasts for halo-based cosmological statistics by introducing a two-step framework: random sub-sampling to stabilize noisy statistics and an optimization to select a stable, information-rich subset of data points. The authors demonstrate the method on the halo mass function and the Voronoi volume function across two N-body suites (Sinhagad and Sahyadri), showing up to a factor of ~4 improvement in constraining power and substantially better forecast stability across realizations. By defining quantitative metrics for derivative accuracy, information content, and cross-realization stability, and by using KL divergence to evaluate subset performance, the approach yields robust forecasts even with limited realizations. The framework is general and applicable to any statistic with noisy derivatives, offering a practical path to reliable, next-generation cosmological inferences for surveys like Euclid, DESI, and LSST.
Abstract
Forecasting cosmological constraints from halo-based statistics often suffers from instability in derivative estimates, especially when the number of simulations is limited. This instability reduces the reliability of Fisher forecasts and machine learning based approaches that use derivatives. We introduce a general framework that addresses this challenge by stabilizing the input statistic and then systematically identifying the optimal subset of summary statistics that maximizes cosmological information while simultaneously minimizing the instability of predicted constraints. We demonstrate this framework using the halo mass function as well as the Voronoi volume function (VVF), a summary statistic that captures beyond two-point clustering information. Applying our two-step procedure -- random sub-sampling followed by optimization -- improves the constraining power by up to a factor of 4, while also enhancing the stability of the forecasts across realizations. As surveys like Euclid, DESI, and LSST push toward tighter constraints, the ability to produce stable and accurate theoretical predictions is essential. Our results suggest that new summary statistics such as the VVF, combined with careful data curation and stabilization strategies, can play a key role in next-generation precision cosmology.
