SSDRec: Self-Augmented Sequence Denoising for Sequential Recommendation

Chi Zhang; Qilong Han; Rui Chen; Xiangyu Zhao; Peng Tang; Hongtao Song

SSDRec: Self-Augmented Sequence Denoising for Sequential Recommendation

Chi Zhang, Qilong Han, Rui Chen, Xiangyu Zhao, Peng Tang, Hongtao Song

TL;DR

SSDRec tackles noise in user sequences for sequential recommendation by introducing a three-stage framework that augments sequences before denoising. The method first builds a multi-relational graph and learns inter-sequence priors via a global relation encoder, then uses a self-augmentation module to insert two informative items at a carefully chosen position, and finally applies a hierarchical denoising module to produce reliable noiseless subsequences for downstream recommenders. Empirically, SSDRec improves performance across five real-world datasets and consistently outperforms state-of-the-art denoising methods and various backbone models, while maintaining practical efficiency. The approach offers a plug-in, data-driven way to mitigate over-denoising and under-denoising (OUPs) and enhances the robustness of sequential recommendations in noisy, real-world data.

Abstract

Traditional sequential recommendation methods assume that users' sequence data is clean enough to learn accurate sequence representations to reflect user preferences. In practice, users' sequences inevitably contain noise (e.g., accidental interactions), leading to incorrect reflections of user preferences. Consequently, some pioneer studies have explored modeling sequentiality and correlations in sequences to implicitly or explicitly reduce noise's influence. However, relying on only available intra-sequence information (i.e., sequentiality and correlations in a sequence) is insufficient and may result in over-denoising and under-denoising problems (OUPs), especially for short sequences. To improve reliability, we propose to augment sequences by inserting items before denoising. However, due to the data sparsity issue and computational costs, it is challenging to select proper items from the entire item universe to insert into proper positions in a target sequence. Motivated by the above observation, we propose a novel framework--Self-augmented Sequence Denoising for sequential Recommendation (SSDRec) with a three-stage learning paradigm to solve the above challenges. In the first stage, we empower SSDRec by a global relation encoder to learn multi-faceted inter-sequence relations in a data-driven manner. These relations serve as prior knowledge to guide subsequent stages. In the second stage, we devise a self-augmentation module to augment sequences to alleviate OUPs. Finally, we employ a hierarchical denoising module in the third stage to reduce the risk of false augmentations and pinpoint all noise in raw sequences. Extensive experiments on five real-world datasets demonstrate the superiority of \model over state-of-the-art denoising methods and its flexible applications to mainstream sequential recommendation models. The source code is available at https://github.com/zc-97/SSDRec.

SSDRec: Self-Augmented Sequence Denoising for Sequential Recommendation

TL;DR

Abstract

Paper Structure (33 sections, 15 equations, 5 figures, 6 tables)

This paper contains 33 sections, 15 equations, 5 figures, 6 tables.

Introduction
Preliminaries
Methodology
Multi-Relation Graph Construction
Item-Relation Sub-Graphs
User-Relation Sub-Graphs
Embedding Layer
Global Relation Encoder
Item-Transitional Relation Encoding Layer
Item-Incompatible Relation Encoding Layer
User-Item Interactional Relation Encoding Layer
User-Similar Relation Encoding Layer
User-Dissimilar Relation Encoding Layer
Self-Augmentation Module
Position Selector
...and 18 more sections

Figures (5)

Figure 1: The OUPs of different sequence denoising methods on ML-100K.
Figure 2: The architecture of the proposed SSDRec framework. (a) The first stage of SSDRec takes the multi-relation graph $\mathcal{G}$ as input and outputs the item representation sequence for subsequent denoising stages; (b) The second stage of SSDRec selects items and positions to augment sequences; (c) The third stage of SSDRec adopts a hierarchical denoising module to ensure augmentation reliability and generate noiseless sequences for making recommendations.
Figure 3: A toy example of the construction of a multi-relation graph.
Figure 4: A case study to show how the three-stage learning paradigm of SSDRec affects next-item recommendation on ML-100K.
Figure 5: Hyperparameter study for SSDRec in terms of HR@20, N@20, and MRR on ML-100K, ML-1M, Beauty, Sports, and Yelp datasets.

SSDRec: Self-Augmented Sequence Denoising for Sequential Recommendation

TL;DR

Abstract

SSDRec: Self-Augmented Sequence Denoising for Sequential Recommendation

Authors

TL;DR

Abstract

Table of Contents

Figures (5)