Table of Contents
Fetching ...

Knowledge Graph Pruning for Recommendation

Fake Lin, Xi Zhu, Ziwei Zhao, Deqiang Huang, Yu Yu, Xueying Li, Zhi Zheng, Tong Xu, Enhong Chen

TL;DR

This work tackles the knowledge explosion problem in KG-based recommender systems by introducing KGTrimmer, a dual-view pruning framework that learns what KG components are valueless for recommendation. It combines a collective-view evaluator, based on collaborative signals from user interactions, with a holistic-view learnable mask, and fuses them into an importance-aware graph neural network that propagates through a pruned KG. Across three public datasets, KGTrimmer consistently maintains or improves top-K recommendation metrics while significantly reducing KG size and training time; ablation confirms both views contribute meaningfully, with the collective view often driving larger gains. The approach enables scalable, efficient KG-enhanced recommendations and offers practical pruning strategies with clear thresholds and aggregation schemes for real-world systems.

Abstract

Recent years have witnessed the prosperity of knowledge graph based recommendation system (KGRS), which enriches the representation of users, items, and entities by structural knowledge with striking improvement. Nevertheless, its unaffordable computational cost still limits researchers from exploring more sophisticated models. We observe that the bottleneck for training efficiency arises from the knowledge graph, which is plagued by the well-known issue of knowledge explosion. Recently, some works have attempted to slim the inflated KG via summarization techniques. However, these summarized nodes may ignore the collaborative signals and deviate from the facts that nodes in knowledge graph represent symbolic abstractions of entities from the real-world. To this end, in this paper, we propose a novel approach called KGTrimmer for knowledge graph pruning tailored for recommendation, to remove the unessential nodes while minimizing performance degradation. Specifically, we design an importance evaluator from a dual-view perspective. For the collective view, we embrace the idea of collective intelligence by extracting community consensus based on abundant collaborative signals, i.e. nodes are considered important if they attract attention of numerous users. For the holistic view, we learn a global mask to identify the valueless nodes from their inherent properties or overall popularity. Next, we build an end-to-end importance-aware graph neural network, which injects filtered knowledge to enhance the distillation of valuable user-item collaborative signals. Ultimately, we generate a pruned knowledge graph with lightweight, stable, and robust properties to facilitate the following-up recommendation task. Extensive experiments are conducted on three publicly available datasets to prove the effectiveness and generalization ability of KGTrimmer.

Knowledge Graph Pruning for Recommendation

TL;DR

This work tackles the knowledge explosion problem in KG-based recommender systems by introducing KGTrimmer, a dual-view pruning framework that learns what KG components are valueless for recommendation. It combines a collective-view evaluator, based on collaborative signals from user interactions, with a holistic-view learnable mask, and fuses them into an importance-aware graph neural network that propagates through a pruned KG. Across three public datasets, KGTrimmer consistently maintains or improves top-K recommendation metrics while significantly reducing KG size and training time; ablation confirms both views contribute meaningfully, with the collective view often driving larger gains. The approach enables scalable, efficient KG-enhanced recommendations and offers practical pruning strategies with clear thresholds and aggregation schemes for real-world systems.

Abstract

Recent years have witnessed the prosperity of knowledge graph based recommendation system (KGRS), which enriches the representation of users, items, and entities by structural knowledge with striking improvement. Nevertheless, its unaffordable computational cost still limits researchers from exploring more sophisticated models. We observe that the bottleneck for training efficiency arises from the knowledge graph, which is plagued by the well-known issue of knowledge explosion. Recently, some works have attempted to slim the inflated KG via summarization techniques. However, these summarized nodes may ignore the collaborative signals and deviate from the facts that nodes in knowledge graph represent symbolic abstractions of entities from the real-world. To this end, in this paper, we propose a novel approach called KGTrimmer for knowledge graph pruning tailored for recommendation, to remove the unessential nodes while minimizing performance degradation. Specifically, we design an importance evaluator from a dual-view perspective. For the collective view, we embrace the idea of collective intelligence by extracting community consensus based on abundant collaborative signals, i.e. nodes are considered important if they attract attention of numerous users. For the holistic view, we learn a global mask to identify the valueless nodes from their inherent properties or overall popularity. Next, we build an end-to-end importance-aware graph neural network, which injects filtered knowledge to enhance the distillation of valuable user-item collaborative signals. Ultimately, we generate a pruned knowledge graph with lightweight, stable, and robust properties to facilitate the following-up recommendation task. Extensive experiments are conducted on three publicly available datasets to prove the effectiveness and generalization ability of KGTrimmer.
Paper Structure (38 sections, 20 equations, 4 figures, 6 tables, 1 algorithm)

This paper contains 38 sections, 20 equations, 4 figures, 6 tables, 1 algorithm.

Figures (4)

  • Figure 1: The motivation example from an online a movie platform, where (a) presents a real-world collaborative knowledge graph and (b) indicates the corresponding collective importance of each entity for user modeling.
  • Figure 2: The overall framework of the proposed KGTrimmer framework, which consists three components: (a)Dual-View Importance Evaluator, (b) Importance-Aware Graph Neural Network, and (c) Pruned Knowledge Graph Generation.
  • Figure 3: The final importance scores' distribution in three datasets. The blue line and red line represent the distribution of entity scores and triplet scores, respectively.
  • Figure 4: Parameters sensitivity to show the how the $\gamma$ affect the performance.

Theorems & Definitions (3)

  • definition 1
  • definition 2
  • definition 3