InfoDCL: Informative Noise Enhanced Diffusion Based Contrastive Learning
Xufeng Liang, Zhida Qin, Chong Zhang, Tianyu Huang, Gangyi Ding
TL;DR
This work tackles the sparsity of user-item interactions in recommender systems by moving beyond random perturbations in contrastive learning. It introduces InfoDCL, a diffusion-based framework that generates informative noise through a single-step diffusion process guided by auxiliary metadata via PSNet, including spectral rectification and contextual re-encoding. A collaborative training objective aligns generation, contrastive learning, and ranking signals, while keeping the diffusion process off during training and using inference-time LightGCN to inject higher-order structure efficiently. Across five real-world datasets, InfoDCL consistently outperforms state-of-the-art baselines, demonstrating the value of semantically informed diffusion for learning authentic user preferences in sparse settings.
Abstract
Contrastive learning has demonstrated promising potential in recommender systems. Existing methods typically construct sparser views by randomly perturbing the original interaction graph, as they have no idea about the authentic user preferences. Owing to the sparse nature of recommendation data, this paradigm can only capture insufficient semantic information. To address the issue, we propose InfoDCL, a novel diffusion-based contrastive learning framework for recommendation. Rather than injecting randomly sampled Gaussian noise, we employ a single-step diffusion process that integrates noise with auxiliary semantic information to generate signals and feed them to the standard diffusion process to generate authentic user preferences as contrastive views. Besides, based on a comprehensive analysis of the mutual influence between generation and preference learning in InfoDCL, we build a collaborative training objective strategy to transform the interference between them into mutual collaboration. Additionally, we employ multiple GCN layers only during inference stage to incorporate higher-order co-occurrence information while maintaining training efficiency. Extensive experiments on five real-world datasets demonstrate that InfoDCL significantly outperforms state-of-the-art methods. Our InfoDCL offers an effective solution for enhancing recommendation performance and suggests a novel paradigm for applying diffusion method in contrastive learning frameworks.
