R-GAT: Cancer Document Classification Leveraging Graph-Based Residual Network for Scenarios with Limited Data
Elias Hossain, Tasfia Nuzhat, Shamsul Masum, Shahram Rahimi, Noorbakhsh Amiri Golilarz
TL;DR
This work tackles cancer-abstract classification under data and compute constraints. It introduces R-GAT, a residual graph attention network that leverages multi-head graph attention to model semantic and relational dependencies without relying on large-scale pretraining. Through systematic benchmarking against traditional ML, deep learning, and transformer baselines, the study demonstrates that R-GAT achieves competitive accuracy with reduced variance and computational cost, while also providing a publicly released dataset of 1,875 PubMed abstracts to support reproducibility. The results suggest that lightweight graph-based architectures can be robust and practical alternatives to transformers in data-limited biomedical NLP settings, broadening the methodological toolkit for cancer informatics.
Abstract
Accurate classification of cancer-related biomedical abstracts is critical for advancing cancer informatics and supporting decision-making in healthcare research. Yet progress in this domain is often constrained by limited availability of labeled corpora and the high computational demands of transformer-based approaches. To address these challenges, we propose a Residual Graph Attention Network (R-GAT) that integrates multi-head attention with residual connections to capture semantic and relational dependencies in biomedical texts. Evaluated on a curated dataset of 1,875 PubMed abstracts spanning thyroid, colon, lung, and generic cancer topics, R-GAT achieves stable and competitive performance, comparable to transformer-based models such as BioBERT and BioClinicalBERT and strong classical baselines like Logistic Regression, while requiring significantly fewer computational resources. Ablation studies confirm the importance of attention and residual connections in ensuring robustness under limited-data conditions. To support reproducibility and facilitate future research, we also release the curated dataset. Together, these contributions demonstrate the value of lightweight graph-based architectures as reliable and resource-efficient alternatives to computationally intensive transformers in biomedical NLP.
