Table of Contents
Fetching ...

MAP: Model Aggregation and Personalization in Federated Learning with Incomplete Classes

Xin-Chun Li, Shaoming Song, Yinchuan Li, Bingshuai Li, Yunfeng Shao, Yang Yang, De-Chuan Zhan

TL;DR

MAP addresses federated learning under incomplete class distributions by unifying two components: Restricted Softmax (RS) to stabilize aggregation against missing classes and Inherited Private Model (HPM) to preserve and leverage historical personalization. The two-stage personalization in MAP—RS-driven initialization followed by standard training with knowledge transfer from HPM—enables simultaneous improvements in both global generalization and local client performance. Theoretical analysis and extensive experiments on multiple datasets demonstrate MAP’s ability to outperform standard FL methods and approach or exceed specialized aggregation or personalization baselines, while also offering scalability to large-scale settings like FEMNIST. This work provides a practical, principled approach to non-I.I.D. FL where clients observe only subsets of classes, with potential implications for fairness and robustness in real-world deployments.

Abstract

In some real-world applications, data samples are usually distributed on local devices, where federated learning (FL) techniques are proposed to coordinate decentralized clients without directly sharing users' private data. FL commonly follows the parameter server architecture and contains multiple personalization and aggregation procedures. The natural data heterogeneity across clients, i.e., Non-I.I.D. data, challenges both the aggregation and personalization goals in FL. In this paper, we focus on a special kind of Non-I.I.D. scene where clients own incomplete classes, i.e., each client can only access a partial set of the whole class set. The server aims to aggregate a complete classification model that could generalize to all classes, while the clients are inclined to improve the performance of distinguishing their observed classes. For better model aggregation, we point out that the standard softmax will encounter several problems caused by missing classes and propose "restricted softmax" as an alternative. For better model personalization, we point out that the hard-won personalized models are not well exploited and propose "inherited private model" to store the personalization experience. Our proposed algorithm named MAP could simultaneously achieve the aggregation and personalization goals in FL. Abundant experimental studies verify the superiorities of our algorithm.

MAP: Model Aggregation and Personalization in Federated Learning with Incomplete Classes

TL;DR

MAP addresses federated learning under incomplete class distributions by unifying two components: Restricted Softmax (RS) to stabilize aggregation against missing classes and Inherited Private Model (HPM) to preserve and leverage historical personalization. The two-stage personalization in MAP—RS-driven initialization followed by standard training with knowledge transfer from HPM—enables simultaneous improvements in both global generalization and local client performance. Theoretical analysis and extensive experiments on multiple datasets demonstrate MAP’s ability to outperform standard FL methods and approach or exceed specialized aggregation or personalization baselines, while also offering scalability to large-scale settings like FEMNIST. This work provides a practical, principled approach to non-I.I.D. FL where clients observe only subsets of classes, with potential implications for fairness and robustness in real-world deployments.

Abstract

In some real-world applications, data samples are usually distributed on local devices, where federated learning (FL) techniques are proposed to coordinate decentralized clients without directly sharing users' private data. FL commonly follows the parameter server architecture and contains multiple personalization and aggregation procedures. The natural data heterogeneity across clients, i.e., Non-I.I.D. data, challenges both the aggregation and personalization goals in FL. In this paper, we focus on a special kind of Non-I.I.D. scene where clients own incomplete classes, i.e., each client can only access a partial set of the whole class set. The server aims to aggregate a complete classification model that could generalize to all classes, while the clients are inclined to improve the performance of distinguishing their observed classes. For better model aggregation, we point out that the standard softmax will encounter several problems caused by missing classes and propose "restricted softmax" as an alternative. For better model personalization, we point out that the hard-won personalized models are not well exploited and propose "inherited private model" to store the personalization experience. Our proposed algorithm named MAP could simultaneously achieve the aggregation and personalization goals in FL. Abundant experimental studies verify the superiorities of our algorithm.
Paper Structure (30 sections, 13 equations, 12 figures, 3 tables, 1 algorithm)

This paper contains 30 sections, 13 equations, 12 figures, 3 tables, 1 algorithm.

Figures (12)

  • Figure 1: The considered FL scene with incomplete classes.
  • Figure 2: The properties of softmax, i.e., Eq. \ref{['eq:update-w']}, Eq. \ref{['eq:update-miss-w']} and Eq. \ref{['eq:update-obser-w']}.
  • Figure 3: Model aggregation and personalization challenges caused by incomplete classes.
  • Figure 4: The architecture and procedure of the proposed MAP.
  • Figure 5: Visualization of learned features and proxies with 8, 5, and 2 observed classes, respectively.
  • ...and 7 more figures

Theorems & Definitions (2)

  • Definition 1: Goal of Aggregation in FL
  • Definition 2: Goal of Personalization in FL