Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

Pei-Yau Weng; Minh Hoang; Lam M. Nguyen; My T. Thai; Tsui-Wei Weng; Trong Nghia Hoang

Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

Pei-Yau Weng, Minh Hoang, Lam M. Nguyen, My T. Thai, Tsui-Wei Weng, Trong Nghia Hoang

TL;DR

This work tackles federated learning with severely non-IID and imbalanced data by replacing costly full-model fine-tuning with a compact, probabilistic prompt-tuning strategy. By modeling local prompts as samples from a generative process anchored by global summarizing prompts, PFPT aligns diverse client contexts through a bipartite-matching-based aggregation that preserves privacy and reduces communication. The method achieves consistent gains over adapted FL baselines across multiple vision benchmarks, including long-tailed and globally skewed distributions, and demonstrates convergence and diversity in the learned prompts. Overall, PFPT offers a scalable, communication-efficient path to robust model adaptation in highly heterogeneous federated environments.

Abstract

Fine-tuning pre-trained models is a popular approach in machine learning for solving complex tasks with moderate data. However, fine-tuning the entire pre-trained model is ineffective in federated data scenarios where local data distributions are diversely skewed. To address this, we explore integrating federated learning with a more effective prompt-tuning method, optimizing for a small set of input prefixes to reprogram the pre-trained model's behavior. Our approach transforms federated learning into a distributed set modeling task, aggregating diverse sets of prompts to globally fine-tune the pre-trained model. We benchmark various baselines based on direct adaptations of existing federated model aggregation techniques and introduce a new probabilistic prompt aggregation method that substantially outperforms these baselines. Our reported results on a variety of computer vision datasets confirm that the proposed method is most effective to combat extreme data heterogeneity in federated learning.

Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

TL;DR

Abstract

Probabilistic Federated Prompt-Tuning with Non-IID and Imbalanced Data

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (5)

Theorems & Definitions (3)