Topological Blindspots: Understanding and Extending Topological Deep Learning Through the Lens of Expressivity
Yam Eitan, Yoav Gelberg, Guy Bar-Shalom, Fabrizio Frasca, Michael Bronstein, Haggai Maron
TL;DR
This work analyzes the expressivity of Higher-Order Message-Passing (HOMP) in Topological Deep Learning (TDL) from a topological perspective, proving that HOMP cannot distinguish CCs by fundamental invariants such as diameter, orientability, planarity, or homology. It introduces Multi-Cellular Networks (MCN) to achieve full expressivity and Scalable MCN (SMCN) to mitigate computational costs, showing that SMCN can outperform HOMP and expressive graph methods on benchmarks designed to test topological learning. The authors provide a topological indistinguishability criterion based on covering spaces, and demonstrate lifting/pooling can create CCs indistinguishable by HOMP yet differing in topology. Empirically, SMCN achieves superior performance on synthetic torus-based benchmarks and real-world lifted Zinc graphs, with improved capabilities in learning topological properties and diameters, while incurring modest runtime overhead. The results underscore the value of explicitly leveraging topological information in higher-order architectures for robust topological and metric reasoning in graphs and complexes.
Abstract
Topological deep learning (TDL) is a rapidly growing field that seeks to leverage topological structure in data and facilitate learning from data supported on topological objects, ranging from molecules to 3D shapes. Most TDL architectures can be unified under the framework of higher-order message-passing (HOMP), which generalizes graph message-passing to higher-order domains. In the first part of the paper, we explore HOMP's expressive power from a topological perspective, demonstrating the framework's inability to capture fundamental topological and metric invariants such as diameter, orientability, planarity, and homology. In addition, we demonstrate HOMP's limitations in fully leveraging lifting and pooling methods on graphs. To the best of our knowledge, this is the first work to study the expressivity of TDL from a \emph{topological} perspective. In the second part of the paper, we develop two new classes of architectures -- multi-cellular networks (MCN) and scalable MCN (SMCN) -- which draw inspiration from expressive GNNs. MCN can reach full expressivity, but scaling it to large data objects can be computationally expansive. Designed as a more scalable alternative, SMCN still mitigates many of HOMP's expressivity limitations. Finally, we create new benchmarks for evaluating models based on their ability to learn topological properties of complexes. We then evaluate SMCN on these benchmarks and on real-world graph datasets, demonstrating improvements over both HOMP baselines and expressive graph methods, highlighting the value of expressively leveraging topological information. Code and data are available at https://github.com/yoavgelberg/SMCN.
