Persistent homology detects curvature
Peter Bubenik, Michael Hull, Dhruv Patel, Benjamin Whittle
TL;DR
This work investigates how persistent homology, traditionally viewed as a topological proxy, encodes geometry such as curvature. It develops a framework that uses short persistence intervals and the average persistence landscape to solve inverse problems, notably estimating curvature from samples on disks of constant curvature $K\in[-2,2]$. Theoretical results show that equilateral triangles maximize persistence and that the persistence for equilateral triangles yields invertible dependence on $K$, while a continuous map from metric-measure spaces to $L^2(\mathbb{N}\times\mathbb{R})$ enables learning curvature from data. Computational demonstrations with Euclidean, spherical, and hyperbolic disks show accurate curvature estimation using supervised and unsupervised methods, and robustness to monotone-distance transforms via ordinal data. Overall, the paper provides a principled pipeline marrying persistent homology with statistical learning to extract geometric information from topological summaries and to address inverse problems in geometry.
Abstract
In topological data analysis, persistent homology is used to study the "shape of data". Persistent homology computations are completely characterized by a set of intervals called a bar code. It is often said that the long intervals represent the "topological signal" and the short intervals represent "noise". We give evidence to dispute this thesis, showing that the short intervals encode geometric information. Specifically, we prove that persistent homology detects the curvature of disks from which points have been sampled. We describe a general computational framework for solving inverse problems using the average persistence landscape, a continuous mapping from metric spaces with a probability measure to a Hilbert space. In the present application, the average persistence landscapes of points sampled from disks of constant curvature results in a path in this Hilbert space which may be learned using standard tools from statistical and machine learning.
