Table of Contents
Fetching ...

SimRec: Mitigating the Cold-Start Problem in Sequential Recommendation by Integrating Item Similarity

Shaked Brody, Shoval Lagziel

TL;DR

This work proposes SimRec - a new approach to mitigate the cold-start problem in sequential recommendation systems by leveraging the inherent similarity among items, incorporating item similarities into the training process through a customized loss function, resulting in a robust contextual sequential recommendation model capable of effectively handling rare items.

Abstract

Sequential recommendation systems often struggle to make predictions or take action when dealing with cold-start items that have limited amount of interactions. In this work, we propose SimRec - a new approach to mitigate the cold-start problem in sequential recommendation systems. SimRec addresses this challenge by leveraging the inherent similarity among items, incorporating item similarities into the training process through a customized loss function. Importantly, this enhancement is attained with identical model architecture and the same amount of trainable parameters, resulting in the same inference time and requiring minimal additional effort. This novel approach results in a robust contextual sequential recommendation model capable of effectively handling rare items, including those that were not explicitly seen during training, thereby enhancing overall recommendation performance. Rigorous evaluations against multiple baselines on diverse datasets showcase SimRec's superiority, particularly in scenarios involving items occurring less than 10 times in the training data. The experiments reveal an impressive improvement, with SimRec achieving up to 78% higher HR@10 compared to SASRec. Notably, SimRec outperforms strong baselines on sparse datasets while delivering on-par performance on dense datasets. Our code is available at https://github.com/amazon-science/sequential-recommendation-using-similarity.

SimRec: Mitigating the Cold-Start Problem in Sequential Recommendation by Integrating Item Similarity

TL;DR

This work proposes SimRec - a new approach to mitigate the cold-start problem in sequential recommendation systems by leveraging the inherent similarity among items, incorporating item similarities into the training process through a customized loss function, resulting in a robust contextual sequential recommendation model capable of effectively handling rare items.

Abstract

Sequential recommendation systems often struggle to make predictions or take action when dealing with cold-start items that have limited amount of interactions. In this work, we propose SimRec - a new approach to mitigate the cold-start problem in sequential recommendation systems. SimRec addresses this challenge by leveraging the inherent similarity among items, incorporating item similarities into the training process through a customized loss function. Importantly, this enhancement is attained with identical model architecture and the same amount of trainable parameters, resulting in the same inference time and requiring minimal additional effort. This novel approach results in a robust contextual sequential recommendation model capable of effectively handling rare items, including those that were not explicitly seen during training, thereby enhancing overall recommendation performance. Rigorous evaluations against multiple baselines on diverse datasets showcase SimRec's superiority, particularly in scenarios involving items occurring less than 10 times in the training data. The experiments reveal an impressive improvement, with SimRec achieving up to 78% higher HR@10 compared to SASRec. Notably, SimRec outperforms strong baselines on sparse datasets while delivering on-par performance on dense datasets. Our code is available at https://github.com/amazon-science/sequential-recommendation-using-similarity.

Paper Structure

This paper contains 17 sections, 4 equations, 9 figures, 8 tables.

Figures (9)

  • Figure 1: The Cold-Start Problem: The item frequency of the Amazon Beauty training set. 91% of the items appear less than 10 times.
  • Figure 2: Cold-start performance analysis reveals that SimRec excels on rare items, showcasing improvements attributed to enhanced performance in such scenarios, particularly evident in the Beauty dataset. SimRec achieves a remarkable 78% higher HR@10 and a 101% higher NDCG@10 compared to SASRec kang2018self for items that appear less than 10 times in the training set (accounting for 61% of the test set).
  • Figure 3: Relative performance gain across various dataset densities: SimRec exhibits a higher relative gain over SASRec kang2018self with the sparser variants of the Beauty dataset, which decreases as dataset density increases.
  • Figure 4: Cold-start performance on the Beauty dataset. Datapoints with item frequencies less than 10 collectively account for 61% of the test set.
  • Figure 5: Cold-start performance on the Tools dataset. Datapoints with item frequencies less than 10 collectively account for 64% of the test set.
  • ...and 4 more figures