Table of Contents
Fetching ...

DSparsE: Dynamic Sparse Embedding for Knowledge Graph Completion

Chuhong Yang, Bin Li, Nan Wu

TL;DR

DSparsE tackles knowledge graph completion by addressing overfitting and limited feature interaction through a dynamic sparse encoder (dynamic layer + relation-aware layer) and a deep residual decoder. All dense layers are replaced with sparse MLPs to preserve expressivity while reducing parameter count, with a gating mechanism enabling expert-style, input-dependent fusion. Empirical results on FB15k-237, WN18RR, and YAGO3-10 show state-of-the-art or competitive performance, and ablations confirm the critical roles of the encoder components, residual decoding, and sparsity settings. The approach also reveals semantic clustering in gating outputs, suggesting meaningful organization of entity-relation patterns and robust scalability to deeper architectures.

Abstract

Addressing the incompleteness problem in knowledge graph remains a significant challenge. Current knowledge graph completion methods have their limitations. For example, ComDensE is prone to overfitting and suffers from the degradation with the increase of network depth while InteractE has the limitations in feature interaction and interpretability. To this end, we propose a new method called dynamic sparse embedding (DSparsE) for knowledge graph completion. The proposed model embeds the input entity-relation pairs by a shallow encoder composed of a dynamic layer and a relation-aware layer. Subsequently, the concatenated output of the dynamic layer and relation-aware layer is passed through a projection layer and a deep decoder with residual connection structure. This model ensures the network robustness and maintains the capability of feature extraction. Furthermore, the conventional dense layers are replaced by randomly initialized sparse connection layers in the proposed method, which can mitigate the model overfitting. Finally, comprehensive experiments are conducted on the datasets of FB15k-237, WN18RR and YAGO3-10. It was demonstrated that the proposed method achieves the state-of-the-art performance in terms of Hits@1 compared to the existing baseline approaches. An ablation study is performed to examine the effects of the dynamic layer and relation-aware layer, where the combined model achieves the best performance.

DSparsE: Dynamic Sparse Embedding for Knowledge Graph Completion

TL;DR

DSparsE tackles knowledge graph completion by addressing overfitting and limited feature interaction through a dynamic sparse encoder (dynamic layer + relation-aware layer) and a deep residual decoder. All dense layers are replaced with sparse MLPs to preserve expressivity while reducing parameter count, with a gating mechanism enabling expert-style, input-dependent fusion. Empirical results on FB15k-237, WN18RR, and YAGO3-10 show state-of-the-art or competitive performance, and ablations confirm the critical roles of the encoder components, residual decoding, and sparsity settings. The approach also reveals semantic clustering in gating outputs, suggesting meaningful organization of entity-relation patterns and robust scalability to deeper architectures.

Abstract

Addressing the incompleteness problem in knowledge graph remains a significant challenge. Current knowledge graph completion methods have their limitations. For example, ComDensE is prone to overfitting and suffers from the degradation with the increase of network depth while InteractE has the limitations in feature interaction and interpretability. To this end, we propose a new method called dynamic sparse embedding (DSparsE) for knowledge graph completion. The proposed model embeds the input entity-relation pairs by a shallow encoder composed of a dynamic layer and a relation-aware layer. Subsequently, the concatenated output of the dynamic layer and relation-aware layer is passed through a projection layer and a deep decoder with residual connection structure. This model ensures the network robustness and maintains the capability of feature extraction. Furthermore, the conventional dense layers are replaced by randomly initialized sparse connection layers in the proposed method, which can mitigate the model overfitting. Finally, comprehensive experiments are conducted on the datasets of FB15k-237, WN18RR and YAGO3-10. It was demonstrated that the proposed method achieves the state-of-the-art performance in terms of Hits@1 compared to the existing baseline approaches. An ablation study is performed to examine the effects of the dynamic layer and relation-aware layer, where the combined model achieves the best performance.

Paper Structure

This paper contains 17 sections, 8 equations, 6 figures, 3 tables.

Figures (6)

  • Figure 1: The architecture of DSparsE. The encoding end is composed of a dynamic layer and a relation-aware layer. The decoding end is composed of several residual MLP layers. Note that all the dense layers are replaced by sparse layers with certain sparsity degrees.
  • Figure 2: Hits@1 of InteractE, ComDensE, and DSparsE under different sparsity degrees on FB15k-237.
  • Figure 3: The effects of downscale and dropout.Sparse represents the proposed sparse structure, Downscale means cutting off part of the output dimension of the network, and Dropout means adding extra dropout based on the original dropout layer. The experiment is conducted on FB15k-237.
  • Figure 4: Hits@1 of DSparsE under different numbers of experts and temperatures on FB15k-237.$t$ denotes the temperature and Pure MLP denotes an MLP layer which has the same number of parameters as the dynamic layer.
  • Figure 5: The output of gated layer for each entity-relation pair. Each point represents an entity-relation pair in latent space after t-SNE reduction. The color of a point represents the relation type.
  • ...and 1 more figures