A Privacy-Preserving Framework with Multi-Modal Data for Cross-Domain Recommendation
Li Wang, Lei Sang, Quangui Zhang, Qiang Wu, Min Xu
TL;DR
This work tackles data sparsity and privacy leakage in cross-domain recommendation by introducing P2M2-CDR, a framework that fuses multi-modal data with a disentangled encoder to learn domain-common and domain-specific user representations. A privacy-preserving decoder employs local differential privacy and contrastive learning (domain-intra and domain-inter) to obfuscate and align embeddings during inter-domain transfer. Extensive experiments on four Amazon-based CD scenarios show that P2M2-CDR outperforms state-of-the-art baselines while maintaining a controllable privacy-utility trade-off, with ablations validating the contribution of each component. The approach advances privacy-aware cross-domain recommendations by exploiting rich modalities (texts, visuals, reviews) and robust contrastive objectives, with potential extensions to federated settings for even stronger privacy guarantees.
Abstract
Cross-domain recommendation (CDR) aims to enhance recommendation accuracy in a target domain with sparse data by leveraging rich information in a source domain, thereby addressing the data-sparsity problem. Some existing CDR methods highlight the advantages of extracting domain-common and domain-specific features to learn comprehensive user and item representations. However, these methods can't effectively disentangle these components as they often rely on simple user-item historical interaction information (such as ratings, clicks, and browsing), neglecting the rich multi-modal features. Additionally, they don't protect user-sensitive data from potential leakage during knowledge transfer between domains. To address these challenges, we propose a Privacy-Preserving Framework with Multi-Modal Data for Cross-Domain Recommendation, called P2M2-CDR. Specifically, we first design a multi-modal disentangled encoder that utilizes multi-modal information to disentangle more informative domain-common and domain-specific embeddings. Furthermore, we introduce a privacy-preserving decoder to mitigate user privacy leakage during knowledge transfer. Local differential privacy (LDP) is utilized to obfuscate the disentangled embeddings before inter-domain exchange, thereby enhancing privacy protection. To ensure both consistency and differentiation among these obfuscated disentangled embeddings, we incorporate contrastive learning-based domain-inter and domain-intra losses. Extensive Experiments conducted on four real-world datasets demonstrate that P2M2-CDR outperforms other state-of-the-art single-domain and cross-domain baselines.
