Bridging User Dynamics: Transforming Sequential Recommendations with Schrödinger Bridge and Diffusion Models
Wenjia Xie, Rui Zhou, Hao Wang, Tingjia Shen, Enhong Chen
TL;DR
This work tackles the challenge of personalization in sequential recommendation when diffusion-based methods are constrained by Gaussian priors that limit per-user information. By introducing the Schrödinger Bridge, the authors replace the Gaussian prior with the user’s current state $x_1$ and connect it to the target item embedding $x_0$, enabling direct modeling of the user-to-item transition. The proposed SdifRec architecture, with a connectivity model $f_\theta$, trains to reconstruct $x_0$ from intermediate SB states using a cross-entropy objective, and infers by iteratively sampling toward $x_0$ for ranking. An extended variant, con-SdifRec, leverages clustering-guided conditioning from LightGCN to incorporate collaborative information through a controllable guidance mechanism $w$, yielding further improvements. Experiments on three benchmark SR datasets show that SdifRec and con-SdifRec outperform strong baselines, with notable gains in efficiency (fewer sampling steps) and robustness across sequence lengths and item popularity, suggesting a practical path to personalized diffusion-based SR with flexible priors and multimodal guidance.
Abstract
Sequential recommendation has attracted increasing attention due to its ability to accurately capture the dynamic changes in user interests. We have noticed that generative models, especially diffusion models, which have achieved significant results in fields like image and audio, hold considerable promise in the field of sequential recommendation. However, existing sequential recommendation methods based on diffusion models are constrained by a prior distribution limited to Gaussian distribution, hindering the possibility of introducing user-specific information for each recommendation and leading to information loss. To address these issues, we introduce the Schrödinger Bridge into diffusion-based sequential recommendation models, creating the SdifRec model. This allows us to replace the Gaussian prior of the diffusion model with the user's current state, directly modeling the process from a user's current state to the target recommendation. Additionally, to better utilize collaborative information in recommendations, we propose an extended version of SdifRec called con-SdifRec, which utilizes user clustering information as a guiding condition to further enhance the posterior distribution. Finally, extensive experiments on multiple public benchmark datasets have demonstrated the effectiveness of SdifRec and con-SdifRec through comparison with several state-of-the-art methods. Further in-depth analysis has validated their efficiency and robustness.
