Fast nonparametric spectral density estimation from irregularly sampled data
Christopher J. Geoga, Paul G. Beckman
TL;DR
The paper tackles nonparametric spectral density estimation for processes observed at fully irregular locations by introducing a weighted nonuniform Fourier estimator hat{S}(ξ) whose weights are designed so that H_{α}(ω) closely matches the Fourier transform G(ω) of a window g. This quadrature-based design reduces aliasing, avoids ill-conditioning inherent in nonuniform inverse problems, and remains scalable via NUFFT-based computations; the authors provide a rigorous bias analysis and demonstrate effectiveness on large-scale 1D and 2D data, including up to a million observation points. A key contribution is the explicit framework for window selection (favoring highly concentrated prolate/Slepian functions with Kaiser as a practical alternative), together with a method to compute weights α by solving a linear system with Chebyshev-based sampling, supported by stability and rate guarantees under oversampling. The approach yields substantial improvements over Lomb-Scargle and related methods, as evidenced by theoretical aliasing bounds and numerical demonstrations, highlighting its potential for accurate, scalable spectral analysis in irregularly sampled, high-dimensional settings.
Abstract
We introduce a nonparametric spectral density estimator for continuous-time and continuous-space processes measured at fully irregular locations. Our estimator is constructed using a weighted nonuniform Fourier sum whose weights yield a high-accuracy quadrature rule with respect to a user-specified window function. The resulting estimator significantly reduces the aliasing seen in periodogram approaches and least squares spectral analysis, sidesteps the dangers of ill-conditioning of the nonuniform Fourier inverse problem, and can be adapted to a wide variety of irregular sampling settings. We describe methods for rapidly computing the necessary weights in various settings, making the estimator scalable to large datasets. We then provide a theoretical analysis of sources of bias, and close with demonstrations of the method's efficacy, including for processes that exhibit very slow spectral decay and are observed at up to a million locations in multiple dimensions.
