Graph as Point Set
Xiyuan Wang, Pan Li, Muhan Zhang
TL;DR
This work reframes graph representation learning by mapping graphs to point sets via Symmetric Rank Decomposition of the augmented adjacency $L=D+A$, enabling permutation- and orthogonal-invariant set encoders to operate on graphs. The core contributions include PSRD coordinates and the Point Set Transformer (PST), which together achieve strong long-range and short-range expressivity, surpassing several GNNs and graph transformers on substructure counting and molecular-property benchmarks, while unifying various structural encodings under a principled, lossless framework. An alternative DeepSet-based encoder (PSDS) demonstrates the versatility of the approach. While offering substantial gains, the method retains scalability limitations inherent to Transformer-style attention on large graphs, suggesting future refinements such as sparse or linear attention to broaden applicability.
Abstract
Graph is a fundamental data structure to model interconnections between entities. Set, on the contrary, stores independent elements. To learn graph representations, current Graph Neural Networks (GNNs) primarily use message passing to encode the interconnections. In contrast, this paper introduces a novel graph-to-set conversion method that bijectively transforms interconnected nodes into a set of independent points and then uses a set encoder to learn the graph representation. This conversion method holds dual significance. Firstly, it enables using set encoders to learn from graphs, thereby significantly expanding the design space of GNNs. Secondly, for Transformer, a specific set encoder, we provide a novel and principled approach to inject graph information losslessly, different from all the heuristic structural/positional encoding methods adopted in previous graph transformers. To demonstrate the effectiveness of our approach, we introduce Point Set Transformer (PST), a transformer architecture that accepts a point set converted from a graph as input. Theoretically, PST exhibits superior expressivity for both short-range substructure counting and long-range shortest path distance tasks compared to existing GNNs. Extensive experiments further validate PST's outstanding real-world performance. Besides Transformer, we also devise a Deepset-based set encoder, which achieves performance comparable to representative GNNs, affirming the versatility of our graph-to-set method.
