Enhancing Recommendation with Denoising Auxiliary Task
Pengsheng Liu, Linan Zheng, Jiale Chen, Guangfa Zhang, Yang Xu, Jinyun Fang
TL;DR
Noise in historical user interaction sequences hinders accurate next-action prediction in recommender systems. The authors propose Auxiliary Task Joint Training (ATJT), a self-supervised approach that jointly trains a noise-recognition auxiliary task to reweight sequences, using artificially generated noise to learn robust weights $w_i$. By combining a noise-recognition objective with the main recommender objective, ATJT achieves adaptive sequence weighting during training. Experiments across MovieLens20M, Amazon Electro, and Yelp on six base models demonstrate consistent improvements in AUC, HR@5, and NDCG@5, highlighting ATJT’s robustness and broad applicability in denoising sequential data for recommendation.
Abstract
The historical interaction sequences of users plays a crucial role in training recommender systems that can accurately predict user preferences. However, due to the arbitrariness of user behavior, the presence of noise in these sequences poses a challenge to predicting their next actions in recommender systems. To address this issue, our motivation is based on the observation that training noisy sequences and clean sequences (sequences without noise) with equal weights can impact the performance of the model. We propose a novel self-supervised Auxiliary Task Joint Training (ATJT) method aimed at more accurately reweighting noisy sequences in recommender systems. Specifically, we strategically select subsets from users' original sequences and perform random replacements to generate artificially replaced noisy sequences. Subsequently, we perform joint training on these artificially replaced noisy sequences and the original sequences. Through effective reweighting, we incorporate the training results of the noise recognition model into the recommender model. We evaluate our method on three datasets using a consistent base model. Experimental results demonstrate the effectiveness of introducing self-supervised auxiliary task to enhance the base model's performance.
