Weisfeiler and Lehman Go Categorical
Seongjin Choi, Gahee Kim, Se-Young Yun
TL;DR
This work introduces the CatWL framework to unify higher-order lifting across data types by lifting objects into graded posets via functors, thereby enabling isomorphism-aware learning beyond standard WL. It formalizes a graded-WL refinement and shows how to instantiate neural architectures, termed F-CatMPN, through two principal functors for hypergraphs: the Incidence Poset Functor $I$ and the Symmetric Simplicial Complex Functor $S$. The authors prove that both $I$-CatWL and $S$-CatWL are not less powerful than HWL, and they validate these claims with extensive experiments on six real-world hypergraph benchmarks, finding that $I$-HIN excels on dense topologies while $S$-HIN shines on structure-rich, sparse hypergraphs. The results demonstrate the practical impact of category-theoretic lifting as a design principle for expressive, geometry-aware hypergraph learning, providing a scalable pathway to unify higher-order graph representations across modalities.
Abstract
While lifting map has significantly enhanced the expressivity of graph neural networks, extending this paradigm to hypergraphs remains fragmented. To address this, we introduce the categorical Weisfeiler-Lehman framework, which formalizes lifting as a functorial mapping from an arbitrary data category to the unifying category of graded posets. When applied to hypergraphs, this perspective allows us to systematically derive Hypergraph Isomorphism Networks, a family of neural architectures where the message passing topology is strictly determined by the choice of functor. We introduce two distinct functors from the category of hypergraphs: an incidence functor and a symmetric simplicial complex functor. While the incidence architecture structurally mirrors standard bipartite schemes, our functorial derivation enforces a richer information flow over the resulting poset, capturing complex intersection geometries often missed by existing methods. We theoretically characterize the expressivity of these models, proving that both the incidence-based and symmetric simplicial approaches subsume the expressive power of the standard Hypergraph Weisfeiler-Lehman test. Extensive experiments on real-world benchmarks validate these theoretical findings.
