Estimating the persistent homology of $\mathbb{R}^n$-valued functions using function-geometric multifiltrations
Ethan André, Jingyi Li, David Loiseaux, Steve Oudot
TL;DR
This work extends the scalar persistent-homology estimation framework to vector-valued functions by developing function-geometric multifiltrations and proving that the main estimator $H_*(\mathcal{R}^{\delta\to 2\delta}(\mathscr{f}|_P))$ remains an $\omega(2\delta)$-approximation of the target $H_*(\mathscr{f})$ under regularity assumptions. It introduces a fixed-radius, a varying-radius, and a Kan-extension-based multi-parameter approach to estimate the full $n$-parameter persistence of vector-valued functions, with robust noise tolerances and statistical convergence guarantees. The authors provide an algorithm to compute presentations of the image of morphisms between persistence modules, enabling practical computation of the estimators and their invariants (e.g., multigraded Betti numbers), and implement it in the multipers library. They establish consistency and (quasi-)minimax convergence rates under standard sampling models, including both known and unknown regularity of the sampling measure, and demonstrate the methods on synthetic and real biological data. The results offer a principled, scalable path for reliable multi-parameter persistent-homology estimation in high-dimensional, noisy settings, with direct applicability to biological and geometric data analysis.
Abstract
Given an unknown $\mathbb{R}^n$-valued function $f$ on a metric space $X$, can we approximate the persistent homology of $f$ from a finite sampling of $X$ with known pairwise distances and function values? This question has been answered in the case $n=1$, assuming $f$ is Lipschitz continuous and $X$ is a sufficiently regular geodesic metric space, and using filtered geometric complexes with fixed scale parameter for the approximation. In this paper we answer the question for arbitrary $n$, under similar assumptions and using function-geometric multifiltrations. Our analysis offers a different view on these multifiltrations by focusing on their approximation properties rather than on their stability properties. We also leverage the multiparameter setting to provide insight into the influence of the scale parameter, whose choice is central to this type of approach. From a practical standpoint, we show that our approximation results are robust to input noise, and that function-geometric multifiltrations have good statistical convergence properties. We also provide an algorithm to compute our estimators, and we use its implementation to conduct extensive experiments, on both synthetic and real biological data, in order to validate our theoretical results and to assess the practicality of our approach.
