Intelligent Model Update Strategy for Sequential Recommendation
Zheqi Lv, Wenqiao Zhang, Zhengyu Chen, Shengyu Zhang, Kun Kuang
TL;DR
The paper addresses the inefficiency of frequent cloud-to-edge parameter updates in edge-cloud collaborative sequential recommendations. It introduces IntellectReq, a lightweight edge-side controller comprising MRD and DM that decides when cloud-generated dynamic parameters are truly beneficial, and maps real-time user behavior to a Gaussian latent space to estimate uncertainty. Building on EC-CDR, IntellectReq enables revenue-aware, budget-guided edge updates by predicting mis-recommendations and modeling data-shift uncertainty, with an MRD-driven threshold governing updates. Empirical evaluation on CDs, Electronic, and Douban Books benchmarks shows that IntellectReq delivers competitive accuracy with reduced communication and computing costs, achieving substantial resource savings while maintaining generalization and adaptability in dynamic user environments.
Abstract
Modern online platforms are increasingly employing recommendation systems to address information overload and improve user engagement. There is an evolving paradigm in this research field that recommendation network learning occurs both on the cloud and on edges with knowledge transfer in between (i.e., edge-cloud collaboration). Recent works push this field further by enabling edge-specific context-aware adaptivity, where model parameters are updated in real-time based on incoming on-edge data. However, we argue that frequent data exchanges between the cloud and edges often lead to inefficiency and waste of communication/computation resources, as considerable parameter updates might be redundant. To investigate this problem, we introduce Intelligent Edge-Cloud Parameter Request Model, abbreviated as IntellectReq. IntellectReq is designed to operate on edge, evaluating the cost-benefit landscape of parameter requests with minimal computation and communication overhead. We formulate this as a novel learning task, aimed at the detection of out-of-distribution data, thereby fine-tuning adaptive communication strategies. Further, we employ statistical mapping techniques to convert real-time user behavior into a normal distribution, thereby employing multi-sample outputs to quantify the model's uncertainty and thus its generalization capabilities. Rigorous empirical validation on four widely-adopted benchmarks evaluates our approach, evidencing a marked improvement in the efficiency and generalizability of edge-cloud collaborative and dynamic recommendation systems.
