Rethinking Regularization Methods for Knowledge Graph Completion
Linyu Li, Zhi Jin, Yuanpeng He, Dongming Jin, Haoran Duan, Zhengwei Tao, Xuan Zhang, Jiandong Li
TL;DR
The paper addresses pervasive overfitting in knowledge graph completion by rethinking regularization, introducing SPR, a sparse-regularization method that selectively penalizes dominant embedding coordinates while discarding less informative ones. SPR is designed to be universally applicable across CP/ComplEx-like tensor models, GNN-based approaches, and temporal KGC, and is supported by theoretical sparsification bounds and empirical gains across diverse datasets. Across benchmarks such as WN18RR, FB15K-237, YAGO3-10, and ICEWS, SPR reduces variance and enhances generalization, often raising MRR and Hits@k beyond baseline methods. The work also provides comparisons to dropout and discusses implications for optimizing model complexity, with plans to release code and data for reproducibility and future extensions to adaptive delta and NLP-based KGC systems.
Abstract
Knowledge graph completion (KGC) has attracted considerable attention in recent years because it is critical to improving the quality of knowledge graphs. Researchers have continuously explored various models. However, most previous efforts have neglected to take advantage of regularization from a deeper perspective and therefore have not been used to their full potential. This paper rethinks the application of regularization methods in KGC. Through extensive empirical studies on various KGC models, we find that carefully designed regularization not only alleviates overfitting and reduces variance but also enables these models to break through the upper bounds of their original performance. Furthermore, we introduce a novel sparse-regularization method that embeds the concept of rank-based selective sparsity into the KGC regularizer. The core idea is to selectively penalize those components with significant features in the embedding vector, thus effectively ignoring many components that contribute little and may only represent noise. Various comparative experiments on multiple datasets and multiple models show that the SPR regularization method is better than other regularization methods and can enable the KGC model to further break through the performance margin.
