Table of Contents
Fetching ...

Dual-level Mixup for Graph Few-shot Learning with Fewer Tasks

Yonghao Liu, Mengyu Li, Fausto Giunchiglia, Lan Huang, Ximing Li, Xiaoyue Feng, Renchu Guan

TL;DR

This work tackles graph few-shot learning under the constraint of few meta-training tasks. It introduces SMILE, which combines degree-aware node representations with a dual-level mixup strategy (within-task and across-task) to enrich data and task distributions, backed by theoretical bounds that indicate reduced generalization error. Empirically, SMILE consistently outperforms baselines on in-domain and cross-domain tasks across multiple datasets, with ablations highlighting the importance of degree priors and task/sample augmentation. The approach is simple yet effective, offering practical improvements for graph-based learning when labeled data and task diversity are limited, and it provides a foundation for robust meta-learning in scarce-resource graph settings.

Abstract

Graph neural networks have been demonstrated as a powerful paradigm for effectively learning graph-structured data on the web and mining content from it.Current leading graph models require a large number of labeled samples for training, which unavoidably leads to overfitting in few-shot scenarios. Recent research has sought to alleviate this issue by simultaneously leveraging graph learning and meta-learning paradigms. However, these graph meta-learning models assume the availability of numerous meta-training tasks to learn transferable meta-knowledge. Such assumption may not be feasible in the real world due to the difficulty of constructing tasks and the substantial costs involved. Therefore, we propose a SiMple yet effectIve approach for graph few-shot Learning with fEwer tasks, named SMILE. We introduce a dual-level mixup strategy, encompassing both within-task and across-task mixup, to simultaneously enrich the available nodes and tasks in meta-learning. Moreover, we explicitly leverage the prior information provided by the node degrees in the graph to encode expressive node representations. Theoretically, we demonstrate that SMILE can enhance the model generalization ability. Empirically, SMILE consistently outperforms other competitive models by a large margin across all evaluated datasets with in-domain and cross-domain settings. Our anonymous code can be found here.

Dual-level Mixup for Graph Few-shot Learning with Fewer Tasks

TL;DR

This work tackles graph few-shot learning under the constraint of few meta-training tasks. It introduces SMILE, which combines degree-aware node representations with a dual-level mixup strategy (within-task and across-task) to enrich data and task distributions, backed by theoretical bounds that indicate reduced generalization error. Empirically, SMILE consistently outperforms baselines on in-domain and cross-domain tasks across multiple datasets, with ablations highlighting the importance of degree priors and task/sample augmentation. The approach is simple yet effective, offering practical improvements for graph-based learning when labeled data and task diversity are limited, and it provides a foundation for robust meta-learning in scarce-resource graph settings.

Abstract

Graph neural networks have been demonstrated as a powerful paradigm for effectively learning graph-structured data on the web and mining content from it.Current leading graph models require a large number of labeled samples for training, which unavoidably leads to overfitting in few-shot scenarios. Recent research has sought to alleviate this issue by simultaneously leveraging graph learning and meta-learning paradigms. However, these graph meta-learning models assume the availability of numerous meta-training tasks to learn transferable meta-knowledge. Such assumption may not be feasible in the real world due to the difficulty of constructing tasks and the substantial costs involved. Therefore, we propose a SiMple yet effectIve approach for graph few-shot Learning with fEwer tasks, named SMILE. We introduce a dual-level mixup strategy, encompassing both within-task and across-task mixup, to simultaneously enrich the available nodes and tasks in meta-learning. Moreover, we explicitly leverage the prior information provided by the node degrees in the graph to encode expressive node representations. Theoretically, we demonstrate that SMILE can enhance the model generalization ability. Empirically, SMILE consistently outperforms other competitive models by a large margin across all evaluated datasets with in-domain and cross-domain settings. Our anonymous code can be found here.

Paper Structure

This paper contains 23 sections, 3 theorems, 15 equations, 3 figures, 5 tables, 1 algorithm.

Key Result

Theorem 4.1

Assume that $\mathrm{X}, \mathrm{Y}$ and $\theta$ are bounded. For all $f\!\in\! \mathcal{F}_\nu$, where $\theta$ satisfies $\theta^\top\Sigma_\mathrm{X}\theta\!\leq\! \nu$, we have the following generalization gap bound, with probability at least $(1-\epsilon)$ over the training samples, where $m$ and $\mathrm{T}$ denote the number of nodes in the query set and the number of meta-training tasks.

Figures (3)

  • Figure 1: Model performance varies with the number of meta-training tasks across different datasets.
  • Figure 2: The overall architecture of SMILE.
  • Figure 3: Results vary with hyperparameters.

Theorems & Definitions (3)

  • Theorem 4.1
  • Corollary 4.2
  • Theorem 4.3