Class-Imbalanced Graph Learning without Class Rebalancing
Zhining Liu, Ruizhong Qiu, Zhichen Zeng, Hyunsik Yoo, David Zhou, Zhe Xu, Yada Zhu, Kommy Weldemariam, Jingrui He, Hanghang Tong
TL;DR
This work tackles class-imbalanced graph learning by revealing topological causes of minority bias: ambivalent message passing (AMP) and distant message passing (DMP). It introduces BAT, a lightweight, model-agnostic topological augmentation that identifies high-risk nodes through uncertainty and posterior likelihoods and expands their context with virtual class nodes, independent of class rebalancing. The authors provide theoretical results showing the minority class is more susceptible to AMP/DMP, with biases that grow with the imbalance ratio $\rho$, and demonstrate that BAT can markedly improve performance and reduce bias across diverse graphs and GNN backbones. Empirical results show BAT delivers consistent gains (up to 46.27\% in accuracy and up to 72.74\% in bias reduction) while maintaining efficiency, validating its practical utility as a complementary tool to CR techniques.
Abstract
Class imbalance is prevalent in real-world node classification tasks and poses great challenges for graph learning models. Most existing studies are rooted in a class-rebalancing (CR) perspective and address class imbalance with class-wise reweighting or resampling. In this work, we approach the root cause of class-imbalance bias from an topological paradigm. Specifically, we theoretically reveal two fundamental phenomena in the graph topology that greatly exacerbate the predictive bias stemming from class imbalance. On this basis, we devise a lightweight topological augmentation framework BAT to mitigate the class-imbalance bias without class rebalancing. Being orthogonal to CR, BAT can function as an efficient plug-and-play module that can be seamlessly combined with and significantly boost existing CR techniques. Systematic experiments on real-world imbalanced graph learning tasks show that BAT can deliver up to 46.27% performance gain and up to 72.74% bias reduction over existing techniques. Code, examples, and documentations are available at https://github.com/ZhiningLiu1998/BAT.
