Automated Loss function Search for Class-imbalanced Node Classification
Xinyu Guo, Kai Wu, Xiaoyu Zhang, Jing Liu
TL;DR
AutoLINC introduces an automated loss-function search framework for class-imbalanced node classification, leveraging Monte Carlo Tree Search to explore a CFG-based loss expression space and a loss-check strategy to prune ineffective candidates. By encoding the loss as a parse tree with inputs $\hat{y}$, $y$, and class counts $N$, and evaluating candidates via a lightweight proxy task, AutoLINC discovers losses that outperform state-of-the-art hand-crafted losses across multiple GNN backbones and datasets. The approach demonstrates strong performance, transferability among homogeneous graph types, and notable speedups from its pruning strategies, while remaining adaptable to different graph domains and imbalance levels. The work highlights the importance of homophily for transferability and shows that combining loss-search with graph-aware baselines like GraphSHA yields scalable gains in imbalanced node classification.
Abstract
Class-imbalanced node classification tasks are prevalent in real-world scenarios. Due to the uneven distribution of nodes across different classes, learning high-quality node representations remains a challenging endeavor. The engineering of loss functions has shown promising potential in addressing this issue. It involves the meticulous design of loss functions, utilizing information about the quantities of nodes in different categories and the network's topology to learn unbiased node representations. However, the design of these loss functions heavily relies on human expert knowledge and exhibits limited adaptability to specific target tasks. In this paper, we introduce a high-performance, flexible, and generalizable automated loss function search framework to tackle this challenge. Across 15 combinations of graph neural networks and datasets, our framework achieves a significant improvement in performance compared to state-of-the-art methods. Additionally, we observe that homophily in graph-structured data significantly contributes to the transferability of the proposed framework.
