Multimodal-enhanced Federated Recommendation: A Group-wise Fusion Approach

Chunxu Zhang; Weipeng Zhang; Guodong Long; Zhiheng Xue; Riting Xia; Bo Yang

Multimodal-enhanced Federated Recommendation: A Group-wise Fusion Approach

Chunxu Zhang, Weipeng Zhang, Guodong Long, Zhiheng Xue, Riting Xia, Bo Yang

TL;DR

This work proposes a novel multimodal fusion mechanism in federated recommendation settings (GFMFR), which offloads multimodal representation learning to the server, which stores item content and employs a high-capacity encoder to generate expressive representations, alleviating client-side overhead.

Abstract

Federated Recommendation (FR) is a new learning paradigm to tackle the learn-to-rank problem in a privacy-preservation manner. How to integrate multi-modality features into federated recommendation is still an open challenge in terms of efficiency, distribution heterogeneity, and fine-grained alignment. To address these challenges, we propose a novel multimodal fusion mechanism in federated recommendation settings (GFMFR). Specifically, it offloads multimodal representation learning to the server, which stores item content and employs a high-capacity encoder to generate expressive representations, alleviating client-side overhead. Moreover, a group-aware item representation fusion approach enables fine-grained knowledge sharing among similar users while retaining individual preferences. The proposed fusion loss could be simply plugged into any existing federated recommender systems empowering their capability by adding multi-modality features. Extensive experiments on five public benchmark datasets demonstrate that GFMFR consistently outperforms state-of-the-art multimodal FR baselines.

Multimodal-enhanced Federated Recommendation: A Group-wise Fusion Approach

TL;DR

Abstract

Multimodal-enhanced Federated Recommendation: A Group-wise Fusion Approach

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (9)