Cosmo3DFlow: Wavelet Flow Matching for Spatial-to-Spectral Compression in Reconstructing the Early Universe
Md. Khairul Islam, Zeyu Xia, Ryan Goudjil, Jialu Wang, Arya Farahi, Judy Fox
TL;DR
Cosmo3DFlow tackles the high-dimensional cosmological initial-condition inference problem by marrying a 3D Discrete Wavelet Transform with flow matching in a multi-resolution 3D U-Net. By performing velocity-field learning in wavelet space and optionally enforcing a physically informed power spectrum, the method achieves deterministic, fast sampling with substantially reduced computation compared to diffusion baselines. It demonstrates up to 50x faster sampling at $128^3$ resolution while maintaining or improving fidelity across VRMSE, cross-correlation, and power-spectrum metrics on Quijote-derived datasets. The approach concentrates computation on information-dense structures (filaments and halos) through wavelet sparsity, enabling scalable inference for larger cosmological volumes.
Abstract
Reconstructing the early Universe from the evolved present-day Universe is a challenging and computationally demanding problem in modern astrophysics. We devise a novel generative framework, Cosmo3DFlow, designed to address dimensionality and sparsity, the critical bottlenecks inherent in current state-of-the-art methods for cosmological inference. By integrating 3D Discrete Wavelet Transform (DWT) with flow matching, we effectively represent high-dimensional cosmological structures. The Wavelet Transform addresses the ``void problem'' by translating spatial emptiness into spectral sparsity. It decouples high-frequency details from low-frequency structures through spatial compression, and wavelet-space velocity fields facilitate stable ordinary differential equation (ODE) solvers with large step sizes. Using large-scale cosmological $N$-body simulations, at $128^3$ resolution, we achieve up to $50\times$ faster sampling than diffusion models, combining a $10\times$ reduction in integration steps with lower per-step computational cost from wavelet compression. Our results enable initial conditions to be sampled in seconds, compared to minutes for previous methods.
