On distance and velocity estimation in cosmology
Adi Nusser
TL;DR
The paper separates two Malmquist biases arising when reconstructing peculiar velocity fields from distance indicators: distance Malmquist bias ($dMB$) for individual distances and velocity Malmquist bias ($vMB$) that corrupts continuous velocity fields when using inferred positions. Using the Tully–Fisher relation as a concrete example, it shows that unbiased per-object velocities do not guarantee unbiased velocity fields and that even bias-corrected distances leave $vMB$ intact; the authors advocate placing galaxies at redshift coordinates ($s=cz/H_0$) for robust large-scale reconstructions, with a practical residual bias only on small scales set by $\sigma_v/H_0$. A modified Wiener filter is developed to marginalize over $P(r|d)$, but while it removes systematic bias, it reduces amplitude, and machine learning approaches converge to the Wiener filter in the Gaussian limit, suggesting limited gains. Overall, redshift-space placement emerges as the most reliable strategy for velocity-field reconstruction in typical surveys, with implications for velocity–gravity comparisons and growth-rate measurements, and the FP framework is discussed as an analogous distance-indicator context.
Abstract
Scatter in distance indicators introduces two conceptually distinct systematic biases when reconstructing peculiar velocity fields from redshifts and distances. The first is distance Malmquist bias (dMB) that affects individual distance estimates and can in principle be approximately corrected. The second is velocity Malmquist bias (vMB) that arises when constructing continuous velocity fields from scattered distance measurements: random scatter places galaxies at noisy spatial positions, introducing spurious velocity gradients that persist even when distances are corrected for dMB. Considering the Tully-Fisher relation as a concrete example, both inverse and forward formulations yield unbiased individual peculiar velocities for galaxies with the same true distance (the forward relation requires a selection-dependent correction), but neither eliminates vMB when galaxies are placed at their inferred distances. We develop a modified Wiener filter that properly encodes correlations between directly observed distance $d$ and true distance $r$ through the conditional probability $P(r|d)$, accounting for the distribution of true distances sampled by galaxies at observed distance $d$. Nonetheless, this modified filter yields suppressed amplitude estimates. Since machine learning autoencoders converge to the Wiener filter for Gaussian fields, they are unlikely to significantly improve velocity field estimation. We therefore argue that optimal reconstruction places galaxies at their observed redshifts rather than inferred distances; an approach effective when distance errors exceed $σ_v/H_0$, a condition satisfied for most galaxies in typical surveys beyond the nearby volume.
