Table of Contents
Fetching ...

MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation

Yunshan Ma, Yingzhi He, Xiang Wang, Yinwei Wei, Xiaoyu Du, Yuyangzi Fu, Tat-Seng Chua

TL;DR

MultiCBR advances bundle recommendation by introducing a three-view (UB, UI, BI) representation learning framework and an early fusion–late contrast strategy. By fusing the three view-specific embeddings before applying self-supervised contrast, it captures both cross-view and ego-view user preferences while using only two contrastive losses, improving efficiency. Empirical results on three benchmark datasets show consistent gains over state-of-the-art methods, with particularly large improvements on highly BI-sparse data like iFashion. The work also provides extensive ablations and model studies that validate the BI view's role in addressing BI sparsity and the effectiveness of the fusion design for multi-view collaboration.

Abstract

Bundle recommendation seeks to recommend a bundle of related items to users to improve both user experience and the profits of platform. Existing bundle recommendation models have progressed from capturing only user-bundle interactions to the modeling of multiple relations among users, bundles and items. CrossCBR, in particular, incorporates cross-view contrastive learning into a two-view preference learning framework, significantly improving SOTA performance. It does, however, have two limitations: 1) the two-view formulation does not fully exploit all the heterogeneous relations among users, bundles and items; and 2) the "early contrast and late fusion" framework is less effective in capturing user preference and difficult to generalize to multiple views. In this paper, we present MultiCBR, a novel Multi-view Contrastive learning framework for Bundle Recommendation. First, we devise a multi-view representation learning framework capable of capturing all the user-bundle, user-item and bundle-item relations, especially better utilizing the bundle-item affiliations to enhance sparse bundles' representations. Second, we innovatively adopt an "early fusion and late contrast" design that first fuses the multi-view representations before performing self-supervised contrastive learning. In comparison to existing approaches, our framework reverses the order of fusion and contrast, introducing the following advantages: 1)our framework is capable of modeling both cross-view and ego-view preferences, allowing us to achieve enhanced user preference modeling; and 2) instead of requiring quadratic number of cross-view contrastive losses, we only require two self-supervised contrastive losses, resulting in minimal extra costs. Experimental results on three public datasets indicate that our method outperforms SOTA methods.

MultiCBR: Multi-view Contrastive Learning for Bundle Recommendation

TL;DR

MultiCBR advances bundle recommendation by introducing a three-view (UB, UI, BI) representation learning framework and an early fusion–late contrast strategy. By fusing the three view-specific embeddings before applying self-supervised contrast, it captures both cross-view and ego-view user preferences while using only two contrastive losses, improving efficiency. Empirical results on three benchmark datasets show consistent gains over state-of-the-art methods, with particularly large improvements on highly BI-sparse data like iFashion. The work also provides extensive ablations and model studies that validate the BI view's role in addressing BI sparsity and the effectiveness of the fusion design for multi-view collaboration.

Abstract

Bundle recommendation seeks to recommend a bundle of related items to users to improve both user experience and the profits of platform. Existing bundle recommendation models have progressed from capturing only user-bundle interactions to the modeling of multiple relations among users, bundles and items. CrossCBR, in particular, incorporates cross-view contrastive learning into a two-view preference learning framework, significantly improving SOTA performance. It does, however, have two limitations: 1) the two-view formulation does not fully exploit all the heterogeneous relations among users, bundles and items; and 2) the "early contrast and late fusion" framework is less effective in capturing user preference and difficult to generalize to multiple views. In this paper, we present MultiCBR, a novel Multi-view Contrastive learning framework for Bundle Recommendation. First, we devise a multi-view representation learning framework capable of capturing all the user-bundle, user-item and bundle-item relations, especially better utilizing the bundle-item affiliations to enhance sparse bundles' representations. Second, we innovatively adopt an "early fusion and late contrast" design that first fuses the multi-view representations before performing self-supervised contrastive learning. In comparison to existing approaches, our framework reverses the order of fusion and contrast, introducing the following advantages: 1)our framework is capable of modeling both cross-view and ego-view preferences, allowing us to achieve enhanced user preference modeling; and 2) instead of requiring quadratic number of cross-view contrastive losses, we only require two self-supervised contrastive losses, resulting in minimal extra costs. Experimental results on three public datasets indicate that our method outperforms SOTA methods.
Paper Structure (31 sections, 13 equations, 7 figures, 9 tables)

This paper contains 31 sections, 13 equations, 7 figures, 9 tables.

Figures (7)

  • Figure 1: Brief comparison between MultiCBR and CrossCBR: 1) MultiCBR extends two views to three views, where the additional BI view aims to capture the bundle composition patterns implied by BI graph; and 2) MultiCBR adopts an "early fusion and late contrast" framework, which can better model user preference while introducing few extra expenses.
  • Figure 2: The overall framework of MultiCBR consists of three parts: 1) multi-view representation learning, including user-bundle interaction view (UB Int. View), user-item interaction view (UI Int. View), and bundle-item affiliation view (BI Aff. View); 2) multi-view fusion and prediction; and 3) joint optimization of BPR loss and self-supervised contrastive loss.
  • Figure 3: The performance comparison among bundle groups with different B-I-U sparsity rates.
  • Figure 4: The cross-view and overall performance comparison for several ablated models.
  • Figure 5: BI view coefficients analysis. The horizontal axis of all the sub-figures is the view coefficient on the BI view, and the vertial axis corresponds to the NDCG@20 score of the model.
  • ...and 2 more figures