SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations

Md Saidul Hoque Anik; Ariful Azad

SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations

Md Saidul Hoque Anik, Ariful Azad

TL;DR

This work tackles the slow training of translation-based knowledge graph embeddings by identifying embedding gradient computation as a major bottleneck. It introduces SparseTransX, a sparse-matrix framework that replaces dense embedding gathering with high-performance $SpMM$ operations, unifying forward and backward passes through a sparse incidence matrix $A$ and enabling efficient training of models such as $TransE$, $TransR$, $TransH$, and $TorusE$. The approach yields substantial speedups on both CPU (up to 5.3×) and GPU (up to 4.2×) while reducing GPU memory usage, with accuracy remaining on par with established frameworks across seven datasets. The system includes a PyTorch-based library, scalable data loading, streaming embeddings, and configurable sparse backends, offering a path to large-batch training and broader applicability to other KGEs. Overall, SparseTransX demonstrates that leveraging sparse linear algebra for KG embedding training can significantly improve performance and scalability without sacrificing predictive quality.

Abstract

Knowledge graph (KG) learning offers a powerful framework for generating new knowledge and making inferences. Training KG embedding can take a significantly long time, especially for larger datasets. Our analysis shows that the gradient computation of embedding is one of the dominant functions in the translation-based KG embedding training loop. We address this issue by replacing the core embedding computation with SpMM (Sparse-Dense Matrix Multiplication) kernels. This allows us to unify multiple scatter (and gather) operations as a single operation, reducing training time and memory usage. We create a general framework for training KG models using sparse kernels and implement four models, namely TransE, TransR, TransH, and TorusE. Our sparse implementations exhibit up to 5.3x speedup on the CPU and up to 4.2x speedup on the GPU with a significantly low GPU memory footprint. The speedups are consistent across large and small datasets for a given model. Our proposed sparse approach can be extended to accelerate other translation-based (such as TransC, TransM, etc.) and non-translational (such as DistMult, ComplEx, RotatE, etc.) models as well. An implementation of the SpTransX framework is publicly available as a Python package in https://github.com/HipGraph/SpTransX.

SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations

TL;DR

Abstract

SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)