Persistent Homology with Path-Representable Distances on Graph Data
Eunwoo Heo, Byeongchan Choi, Jae-Hun Jung
TL;DR
This work investigates how distance definitions on graphs influence persistent homology in topological data analysis. By introducing path-representable distances and focusing on cost-dominated cases, it proves a 1D persistence barcode injection between such distances, extending beyond the classical $d_{ ext{weight}}$ vs $d_{ ext{edge}}$ comparison. The authors connect distance choices to a structured framework using path choices, domination notions, and minimum spanning trees, and show the injection holds for 1D PH but not in higher dimensions through counterexamples and experiments. The results illuminate how alternative graph distances reveal or hide topological features, with potential implications for graph-based data analysis and applications requiring robust topological summaries at multiple scales.
Abstract
Topological data analysis over graph has been actively studied to understand the underlying topological structure of data. However, limited research has been conducted on how different distance definitions impact persistent homology and the corresponding topological inference. To address this, we introduce the concept of path-representable distance in a general form and prove the main theorem for the case of cost-dominated distances. We found that a particular injection exists among the $1$-dimensional persistence barcodes of these distances with a certain condition. We prove that such an injection relation exists for $0$- and $1$-dimensional homology. For higher dimensions, we provide the counterexamples that show such a relation does not exist.
