Manifold Diffusion Geometry: Curvature, Tangent Spaces, and Dimension
Iolo Jones
TL;DR
This work addresses robust geometric inference from finite, noisy samples that lie near manifolds. It develops diffusion-geometry–based estimators for the Laplacian, carré du champ, tangent spaces, pointwise and global dimension, and curvature tensors (Riemann, Ricci, scalar), using diffusion maps with variable bandwidth kernels. The approach yields parameter-free, noise-robust estimators that perform comparably to state-of-the-art on clean data but significantly outperform competitors under noise or sparsity, including improved dimension estimates and feasible curvature estimation. By grounding geometric quantities in diffusion geometry and the second fundamental form, the method enables reliable geometric ML on real-world, imperfect data and suggests directions for extending these tools to non-manifold data and higher-dimensional settings.
Abstract
We introduce novel estimators for computing the curvature, tangent spaces, and dimension of data from manifolds, using tools from diffusion geometry. Although classical Riemannian geometry is a rich source of inspiration for geometric data analysis and machine learning, it has historically been hard to implement these methods in a way that performs well statistically. Diffusion geometry lets us develop Riemannian geometry methods that are accurate and, crucially, also extremely robust to noise and low-density data. The methods we introduce here are comparable to the existing state-of-the-art on ideal dense, noise-free data, but significantly outperform them in the presence of noise or sparsity. In particular, our dimension estimate improves on the existing methods on a challenging benchmark test when even a small amount of noise is added. Our tangent space and scalar curvature estimates do not require parameter selection and substantially improve on existing techniques.
