Coherent Source Subsampling: A Data-Driven Strategy for Restoring Causal-Acausal Symmetry in Ambient Seismic Wavefield Correlations
Sanket Narayan Bajad, Pawan Bharadwaj
TL;DR
Ambient-noise cross-correlations often suffer from biased Green's-function estimates due to nonuniform and time-varying source distributions, breaking causal–acausal symmetry. The authors introduce Coherent Source Subsampling (CSS), a data-driven conditioning framework that uses a discrete latent state $\lambda$ learned by a variational autoencoder to identify stationary-zone sources and compute conditional averages over coherent windows. By averaging only windows with high probability of belonging to the stationary state ($\lambda=1$) and branch-specific coherence, CSS substantially restores symmetry between causal and acausal components and yields more stable Rayleigh-wave dispersion curves, aligning with earthquake-based references for several station pairs. The approach reduces data requirements (typically 10–50% of windows) and is scalable to dense networks and DAS, offering a physically interpretable link between source statistics and noise-based Green's-function retrieval with broad potential for robust ambient-noise tomography in challenging environments.
Abstract
Ambient noise tomography relies on the assumption that the seismic wavefield is equipartitioned, meaning that energy is uniformly distributed among all directions. However, in practice, ambient noise sources are highly non-uniform in both spatial and temporal dimensions, resulting in biased estimation of the Green's function between stations. We introduce a data-driven method, Coherent Source Subsampling (CSS), which selects and averages only those cross-correlation time windows that are associated with the excitation of sources in the stationary-zone. By confining the ensemble average to these coherent subsets, CSS effectively mitigates the influence of anisotropic or intermittent sources and restores causal-acausal symmetry in the retrieved Green's functions. Applications to regional-scale ambient noise datasets demonstrate that CSS boosts inter-station coherence and enhances the reliability of surface-wave dispersion measurements, providing a physically interpretable bridge between source statistics and noise correlation theory.
