Explaining the Power of Topological Data Analysis in Graph Machine Learning
Funmilola Mary Taiwo, Umar Islambekov, Cuneyt Gurcan Akcora
TL;DR
This work rigorously evaluates the claims of Topological Data Analysis for graph machine learning by benchmarking persistence-based methods against Weisfeiler-Leman kernels and traditional graph features across nine graph-classification datasets. It finds that while TDA delivers robustness and interpretable insights, it does not consistently improve predictive accuracy and incurs substantial computational costs, especially for Vietoris-Rips filtrations. Betti-function vectorizations emerge as a relatively strong TDA signal, with most informative topology occurring at early filtration ranges, and a surrogate model plus Mapper analyses provide explainable avenues to harness TDA signals. The paper suggests using TDA as a complementary, interpretable layer within graph pipelines and emphasizes strategies to identify informative filtration ranges to enhance scalability and practical deployment.
Abstract
Topological Data Analysis (TDA) has been praised by researchers for its ability to capture intricate shapes and structures within data. TDA is considered robust in handling noisy and high-dimensional datasets, and its interpretability is believed to promote an intuitive understanding of model behavior. However, claims regarding the power and usefulness of TDA have only been partially tested in application domains where TDA-based models are compared to other graph machine learning approaches, such as graph neural networks. We meticulously test claims on TDA through a comprehensive set of experiments and validate their merits. Our results affirm TDA's robustness against outliers and its interpretability, aligning with proponents' arguments. However, we find that TDA does not significantly enhance the predictive power of existing methods in our specific experiments, while incurring significant computational costs. We investigate phenomena related to graph characteristics, such as small diameters and high clustering coefficients, to mitigate the computational expenses of TDA computations. Our results offer valuable perspectives on integrating TDA into graph machine learning tasks.
