Persistent reachability homology in machine learning applications
Luigi Caputi, Nicholas Meadows, Henri Riihimäki
TL;DR
The paper investigates persistent reachability homology (PRH) as a condensation-based, topology-driven feature extractor for directed graphs and applies it to epileptic seizure detection from EEG-derived networks. PRH computes homology on the reachability poset obtained after condensing strongly connected components, linking to Hochschild cohomology via $\mathrm{HH}^i(k\mathcal{R}(G))$ and enabling faster computations than the directed flag complex. Through a pipeline using Betti curves and their integrals as features in support vector machines, PRH generally outperforms the directed flag complex approach in 7 of 8 model comparisons, with best accuracies around $82\%$ on a dataset of 100 recordings from 16 patients. The results highlight that PRH captures complementary structural information and that combining multiple homology theories can enhance TDA-based machine learning on digraph-structured data.
Abstract
We explore the recently introduced persistent reachability homology (PRH) of digraph data, i.e. data in the form of directed graphs. In particular, we study the effectiveness of PRH in network classification task in a key neuroscience problem: epilepsy detection. PRH is a variation of the persistent homology of digraphs, more traditionally based on the directed flag complex (DPH). A main advantage of PRH is that it considers the condensations of the digraphs appearing in the persistent filtration and thus is computed from smaller digraphs. We compare the effectiveness of PRH to that of DPH and we show that PRH outperforms DPH in the classification task. We use the Betti curves and their integrals as topological features and implement our pipeline on support vector machine.
