Implicit degree bias in the link prediction task
Rachith Aiyappa, Xin Wang, Munjung Kim, Ozgur Can Seckin, Jisung Yoon, Yong-Yeol Ahn, Sadamori Kojaku
TL;DR
The paper reveals a strong degree bias in the standard link prediction benchmark caused by edge sampling, where positive edges overrepresent high-degree nodes (with $p_{ ext{pos}}(k)=\frac{1}{\langle k \rangle} k p(k)$) while negatives remain degree-balanced as $p_{ ext{neg}}(k)=p(k)$. It introduces a degree-corrected benchmark that matches the degree distributions of positives and negatives, showing that degree-only methods lose their advantage and that alignment with recommendation tasks improves, including better training outcomes for GNNs on community-detection benchmarks. The work demonstrates that current evaluations can overfit to node degrees and that the corrected benchmark provides more faithful assessments and learning signals, ultimately improving structure discovery in graphs. These findings have practical implications for fairer, more robust graph ML evaluations and training, with code and data made available to promote reproducibility and broader adoption.
Abstract
Link prediction -- a task of distinguishing actual hidden edges from random unconnected node pairs -- is one of the quintessential tasks in graph machine learning. Despite being widely accepted as a universal benchmark and a downstream task for representation learning, the validity of the link prediction benchmark itself has been rarely questioned. Here, we show that the common edge sampling procedure in the link prediction task has an implicit bias toward high-degree nodes and produces a highly skewed evaluation that favors methods overly dependent on node degree, to the extent that a ``null'' link prediction method based solely on node degree can yield nearly optimal performance. We propose a degree-corrected link prediction task that offers a more reasonable assessment that aligns better with the performance in the recommendation task. Finally, we demonstrate that the degree-corrected benchmark can more effectively train graph machine-learning models by reducing overfitting to node degrees and facilitating the learning of relevant structures in graphs.
