Not All Clients Are Equal: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients

Minhyuk Seo; Taeheon Kim; Hankook Lee; Jonghyun Choi; Tinne Tuytelaars

Not All Clients Are Equal: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients

Minhyuk Seo, Taeheon Kim, Hankook Lee, Jonghyun Choi, Tinne Tuytelaars

TL;DR

FedMosaic tackles data and model heterogeneity in personalized federated learning by coupling RELA, a relevance-guided, gradient-based aggregation scheme, with PQ-LoRA, a shareable, dimension-invariant adapter mechanism that enables knowledge transfer across heterogeneous architectures. It introduces DRAKE, a comprehensive multi-modal FL benchmark with task heterogeneity and distribution shifts to reflect real-world conditions. Empirically, FedMosaic achieves superior personalization and generalization across diverse heterogeneous setups, including cross-family model sharing and large-scale LLM scenarios, while maintaining manageable computation and communication costs through gradient sanitization and compressed PQ-LoRA updates. The work offers a practical pathway for deploying personalized, privacy-preserving multi-modal models in real-world, heterogeneous environments and provides a rich benchmark for future research in federated learning with foundation models.

Abstract

As AI becomes more personal, e.g., Agentic AI, there is an increasing need for personalizing models for various use cases. Personalized federated learning (PFL) enables each client to collaboratively leverage other clients' knowledge for better adaptation to the task of interest, without privacy risks. Despite its potential, existing PFL methods remain confined to rather simplified scenarios where data and models are the same across clients. To move towards realistic scenarios, we propose FedMosaic, a method that jointly addresses data and model heterogeneity with a task-relevance-aware model aggregation strategy to reduce parameter interference, and a dimension-invariant module that enables knowledge sharing across heterogeneous architectures without huge computational cost. To mimic the real-world task diversity, we propose a multi-modal PFL benchmark spanning 40 distinct tasks with distribution shifts over time. The empirical study shows that FedMosaic outperforms the state-of-the-art PFL methods, excelling in both personalization and generalization capabilities under challenging, realistic scenarios.

Not All Clients Are Equal: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients

TL;DR

Abstract

Not All Clients Are Equal: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (23)

Theorems & Definitions (4)