Inferring manifolds using Gaussian processes
David B Dunson, Nan Wu
TL;DR
Inferring manifolds from high-dimensional noisy data is addressed by MrGap, a local regression framework that replaces global manifold reconstruction with Gaussian process regression guided by the local covariance structure. The method builds local charts from the leading eigenvectors of a local covariance matrix, denoise points via GP regression, and interpolate new points using local parameterizations of the manifold, yielding probabilistic manifold reconstructions. Theoretical analysis provides bias/variance bounds for the local covariance under Gaussian noise, chart construction guarantees, and convergence results for the interpolation process, while numerical experiments demonstrate accurate denoising and smooth interpolation on Cassini Oval, a torus, and real bird vocalization data. The approach relaxes restrictive distributional assumptions and offers a practical pipeline for denoising and interpolating data near unknown manifolds with potential ecological and spectrogram applications.
Abstract
It is often of interest to infer lower-dimensional structure underlying complex data. As a flexible class of non-linear structures, it is common to focus on Riemannian manifolds. Most existing manifold learning algorithms replace the original data with lower-dimensional coordinates without providing an estimate of the manifold or using the manifold to denoise the original data. This article proposes a new methodology to address these problems, allowing interpolation of the estimated manifold between the fitted data points. The proposed approach is motivated by the novel theoretical properties of local covariance matrices constructed from samples near a manifold. Our results enable us to turn a global manifold reconstruction problem into a local regression problem, allowing for the application of Gaussian processes for probabilistic manifold reconstruction. In addition to the theory justifying our methodology, we provide simulated and real data examples to illustrate the performance.
