Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification

Zijie Zhou; Zhaoqi Lu; Xuekai Wei; Rongqin Chen; Shenghui Zhang; Pak Lon Ip; Leong Hou U

Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification

Zijie Zhou, Zhaoqi Lu, Xuekai Wei, Rongqin Chen, Shenghui Zhang, Pak Lon Ip, Leong Hou U

TL;DR

Tokenphormer tackles the limitations of traditional GNNs and graph Transformers by introducing a structure-aware, multi-token representation for nodes. It constructs diverse tokens—walk-token (four walk types), SGPM-token, and hop-token—through graph serialization and a pre-training phase (SGPM) to cover local and global structure, then jointly learns them with a Transformer and attention-based readout. The authors provide theoretical analysis showing graph documents can distinguish non-isomorphic graphs and that token coverage improves with more tokens, while experiments on six homogeneous and heterogeneous benchmarks demonstrate state-of-the-art performance on node classification. The approach offers a scalable, flexible framework for structure-aware graph learning with robust generalization across graph types.

Abstract

Graph Neural Networks (GNNs) are widely used in graph data mining tasks. Traditional GNNs follow a message passing scheme that can effectively utilize local and structural information. However, the phenomena of over-smoothing and over-squashing limit the receptive field in message passing processes. Graph Transformers were introduced to address these issues, achieving a global receptive field but suffering from the noise of irrelevant nodes and loss of structural information. Therefore, drawing inspiration from fine-grained token-based representation learning in Natural Language Processing (NLP), we propose the Structure-aware Multi-token Graph Transformer (Tokenphormer), which generates multiple tokens to effectively capture local and structural information and explore global information at different levels of granularity. Specifically, we first introduce the walk-token generated by mixed walks consisting of four walk types to explore the graph and capture structure and contextual information flexibly. To ensure local and global information coverage, we also introduce the SGPM-token (obtained through the Self-supervised Graph Pre-train Model, SGPM) and the hop-token, extending the length and density limit of the walk-token, respectively. Finally, these expressive tokens are fed into the Transformer model to learn node representations collaboratively. Experimental results demonstrate that the capability of the proposed Tokenphormer can achieve state-of-the-art performance on node classification tasks.

Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification

TL;DR

Abstract

Tokenphormer: Structure-aware Multi-token Graph Transformer for Node Classification

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (7)