Unified Alignment Protocol: Making Sense of the Unlabeled Data in New Domains
Sabbir Ahmed, Mamshad Nayeem Rizve, Abdullah Al Arafat, Jacqueline Liu, Rahim Hossain, Mohaiminul Al Nahian, Adnan Siraj Rakin
TL;DR
The paper tackles S-FDG, where clients hold unlabeled data and the server has limited labels, under practical domain shifts. It introduces the Unified Alignment Protocol (UAP), a two-stage alternating training framework that first aligns the server feature distribution to a known parametric form and then guides client features to align with that distribution, all without additional communication. By enforcing a Gaussian-like server feature model with a class-conditioned CDD loss and a covariance regularizer, UAP yields domain-invariant representations and improved unseen-domain generalization. Experiments on five domain-generalization benchmarks demonstrate state-of-the-art performance in the S-FDG setting across diverse architectures, highlighting a practical, privacy-preserving path to robust cross-domain learning in real-world federated systems.
Abstract
Semi-Supervised Federated Learning (SSFL) is gaining popularity over conventional Federated Learning in many real-world applications. Due to the practical limitation of limited labeled data on the client side, SSFL considers that participating clients train with unlabeled data, and only the central server has the necessary resources to access limited labeled data, making it an ideal fit for real-world applications (e.g., healthcare). However, traditional SSFL assumes that the data distributions in the training phase and testing phase are the same. In practice, however, domain shifts frequently occur, making it essential for SSFL to incorporate generalization capabilities and enhance their practicality. The core challenge is improving model generalization to new, unseen domains while the client participate in SSFL. However, the decentralized setup of SSFL and unsupervised client training necessitates innovation to achieve improved generalization across domains. To achieve this, we propose a novel framework called the Unified Alignment Protocol (UAP), which consists of an alternating two-stage training process. The first stage involves training the server model to learn and align the features with a parametric distribution, which is subsequently communicated to clients without additional communication overhead. The second stage proposes a novel training algorithm that utilizes the server feature distribution to align client features accordingly. Our extensive experiments on standard domain generalization benchmark datasets across multiple model architectures reveal that proposed UAP successfully achieves SOTA generalization performance in SSFL setting.
