Effects of Using Synthetic Data on Deep Recommender Models' Performance

Fatih Cihan Taskin; Ilknur Akcay; Muhammed Pesen; Said Aldemir; Ipek Iraz Esin; Furkan Durmus

Effects of Using Synthetic Data on Deep Recommender Models' Performance

Fatih Cihan Taskin, Ilknur Akcay, Muhammed Pesen, Said Aldemir, Ipek Iraz Esin, Furkan Durmus

TL;DR

The results show that the inclusion of generated negative samples consistently improves the Area Under the Curve (AUC) scores, highlighting the potential of data augmentation strategies to address issues of data sparsity and imbalance, ultimately leading to improved performance of recommender systems.

Abstract

Recommender systems are essential for enhancing user experiences by suggesting items based on individual preferences. However, these systems frequently face the challenge of data imbalance, characterized by a predominance of negative interactions over positive ones. This imbalance can result in biased recommendations favoring popular items. This study investigates the effectiveness of synthetic data generation in addressing data imbalances within recommender systems. Six different methods were used to generate synthetic data. Our experimental approach involved generating synthetic data using these methods and integrating the generated samples into the original dataset. Our results show that the inclusion of generated negative samples consistently improves the Area Under the Curve (AUC) scores. The significant impact of synthetic negative samples highlights the potential of data augmentation strategies to address issues of data sparsity and imbalance, ultimately leading to improved performance of recommender systems.

Effects of Using Synthetic Data on Deep Recommender Models' Performance

TL;DR

Abstract

Effects of Using Synthetic Data on Deep Recommender Models' Performance

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (1)