$δ$-core subsampling, strong collapses and TDA
Elias Gabriel Minian
TL;DR
Persistent homology on large Vietoris-Rips filtrations is computationally demanding. The paper introduces the $δ$-core subsampling, a topology-preserving reduction based on strong collapses that removes dominated points to dramatically reduce the number of simplices while keeping global and local features. It provides formal development, uniqueness up to $δ$-equivalence, and complexity bounds, plus extensive experiments showing improved persistence diagram fidelity and substantial speedups over FPS and witness-based subsampling, with public code. This approach makes topology-aware PH scalable to large and heterogeneous datasets, enhancing the practical impact of topological data analysis.
Abstract
We introduce a subsampling method for topological data analysis based on strong collapses of simplicial complexes. Given a point cloud and a scale parameter $δ$, we construct a subsampling that preserves both global and local topological features while significantly reducing computational complexity of persistent homology calculations. We illustrate the effectiveness of our approach through experiments on synthetic and real datasets, showing improved persistence approximations compared to other subsampling techniques.
