Noncommutative Model Selection and the Data-Driven Estimation of Real Cohomology Groups
Araceli Guzmán-Tristán, Antonio Rieser, Eduardo Velázquez-Richards
TL;DR
This work tackles the challenge of estimating the real cohomology groups $H^q(X;\mathbb{R})$ of a compact metric-measure space from finite, uniformly sampled points by introducing three data-driven methods that translate topology estimation into the analytic problem of fitting heat semigroups generated by combinatorial Hodge-Laplacians on weighted Vietoris-Rips models. The authors leverage model-selection-inspired criteria, notably relative von Neumann entropy, plus two alternative distance measures, to identify a scale $\hat{r}$ at which the $q$-dimensional spectral information best reveals the invariant, then recover $H^q(X;\mathbb{R})$ from the kernel of $\Delta_{q,\hat{r}}$. Empirical tests on datasets including points on a circle, two circles, and nonuniform two-circle configurations show strong performance for the entropy- and trace-based criteria, providing the first fully data-driven Betti-number estimations in a uniform-sampling setting and signaling directions for extending to broader distributions and convergence analysis. The approach offers a novel, nonpersistent, operator-theoretic pathway for topology-aware data analysis with potential computational advantages and opens avenues in noncommutative statistics for topological inference from finite samples.
Abstract
We propose three completely data-driven methods for estimating the real cohomology groups $H^k (X ; \mathbb{R})$ of a compact metric-measure space $(X, d_X, μ_X)$ embedded in a metric-measure space $(Y,d_Y,μ_Y)$, given a finite set of points $S$ sampled from a uniform distrbution $μ_X$ on $X$, possibly corrupted with noise from $Y$. We present the results of several computational experiments in the case that $X$ is embedded in $\mathbb{R}^n$, where two of the three algorithms performed well.
