Preference-driven Knowledge Distillation for Few-shot Node Classification

Xing Wei; Chunchun Chen; Rui Fan; Xiaofeng Cao; Sourav Medya; Wei Ye

Preference-driven Knowledge Distillation for Few-shot Node Classification

Xing Wei, Chunchun Chen, Rui Fan, Xiaofeng Cao, Sourav Medya, Wei Ye

TL;DR

This work tackles the challenge of few-shot node classification on text-attributed graphs by uniting large language models (LLMs) with diverse graph neural networks (GNNs) through a novel preference-driven knowledge distillation (PKD) framework. PKD introduces two specialized selectors: GNN-preference-driven Node Selector (GNS) to identify LLM-annotated nodes that most benefit teacher GNNs, and Node-preference-driven GNN Selector (NGS) to tailor per-node teacher choices via reinforcement learning. A Graph Topology Aware (GTA) prompting strategy enhances LLM understanding of graph structure, while a Distance-based Neighbor Selection (DNS) enriches prompts with robust neighborhood information. Through extensive experiments on nine real-world TAGs and multiple LLM variants, PKD achieves state-of-the-art or competitive performance in zero-/few-shot settings, highlighting the benefits of combining LLM reasoning with targeted, node-specific distillation from multiple GNN teachers. The approach advances scalable, label-efficient learning for complex graph-structured text data and points to future work on efficiency and broader applicability beyond TAGs.

Abstract

Graph neural networks (GNNs) can efficiently process text-attributed graphs (TAGs) due to their message-passing mechanisms, but their training heavily relies on the human-annotated labels. Moreover, the complex and diverse local topologies of nodes of real-world TAGs make it challenging for a single mechanism to handle. Large language models (LLMs) perform well in zero-/few-shot learning on TAGs but suffer from a scalability challenge. Therefore, we propose a preference-driven knowledge distillation (PKD) framework to synergize the complementary strengths of LLMs and various GNNs for few-shot node classification. Specifically, we develop a GNN-preference-driven node selector that effectively promotes prediction distillation from LLMs to teacher GNNs. To further tackle nodes' intricate local topologies, we develop a node-preference-driven GNN selector that identifies the most suitable teacher GNN for each node, thereby facilitating tailored knowledge distillation from teacher GNNs to the student GNN. Extensive experiments validate the efficacy of our proposed framework in few-shot node classification on real-world TAGs. Our code is be available.

Preference-driven Knowledge Distillation for Few-shot Node Classification

TL;DR

Abstract

Preference-driven Knowledge Distillation for Few-shot Node Classification

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (7)

Theorems & Definitions (3)