Fully-inductive Node Classification on Arbitrary Graphs
Jianan Zhao, Zhaocheng Zhu, Mikhail Galkin, Hesham Mostafa, Michael Bronstein, Jian Tang
TL;DR
The paper tackles the challenge of fully-inductive node classification, where test graphs can have new structures and entirely new feature and label spaces. It introduces GraphAny, combining analytical LinearGNNs with a learnable inductive attention mechanism based on entropy-normalized distance features to generalize across arbitrary graphs. The approach yields strong cross-graph generalization, training only the attention module while performing inference with non-parametric LinearGNNs, achieving a 67.26% average accuracy on 30 unseen graphs and substantial speedups over per-dataset transductive baselines. This work lays a foundation for cross-domain graph generalization and could inform future graph foundation-model developments by emphasizing permutation-invariance and robust dimension generalization.
Abstract
One fundamental challenge in graph machine learning is generalizing to new graphs. Many existing methods following the inductive setup can generalize to test graphs with new structures, but assuming the feature and label spaces remain the same as the training ones. This paper introduces a fully-inductive setup, where models should perform inference on arbitrary test graphs with new structures, feature and label spaces. We propose GraphAny as the first attempt at this challenging setup. GraphAny models inference on a new graph as an analytical solution to a LinearGNN, which can be naturally applied to graphs with any feature and label spaces. To further build a stronger model with learning capacity, we fuse multiple LinearGNN predictions with learned inductive attention scores. Specifically, the attention module is carefully parameterized as a function of the entropy-normalized distance features between pairs of LinearGNN predictions to ensure generalization to new graphs. Empirically, GraphAny trained on a single Wisconsin dataset with only 120 labeled nodes can generalize to 30 new graphs with an average accuracy of 67.26%, surpassing not only all inductive baselines, but also strong transductive methods trained separately on each of the 30 test graphs.
