Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy

Feng Wang; M. Cenk Gursoy; Senem Velipasalar

Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy

Feng Wang, M. Cenk Gursoy, Senem Velipasalar

TL;DR

This work tackles the communication bottlenecks of federated learning by introducing feature-based federated transfer learning (FbFTL), which uploads extracted features and outputs instead of gradients to drastically reduce uplink traffic. FbFTL partitions a pre-trained source model, fixes the feature extractor, and trains only a task-specific sub-model, enabling order-of-magnitude reductions in uplink and downlink payloads while shifting computation toward the server. The paper provides rigorous payload, time-complexity, robustness, and privacy analyses, including new privacy notions for label privacy leakage and feature privacy leakage, and validates the approach on image (VGG-16 CIFAR-10) and NLP (FLAN-T5-small) tasks with differential privacy considerations. The results demonstrate that FbFTL achieves comparable accuracy to gradient-based baselines under much smaller communication budgets, improved resilience to packet loss, and practical privacy protections, highlighting its potential for deployment in bandwidth-constrained, privacy-sensitive federated settings.

Abstract

In this paper, we propose feature-based federated transfer learning as a novel approach to improve communication efficiency by reducing the uplink payload by multiple orders of magnitude compared to that of existing approaches in federated learning and federated transfer learning. Specifically, in the proposed feature-based federated learning, we design the extracted features and outputs to be uploaded instead of parameter updates. For this distributed learning model, we determine the required payload and provide comparisons with the existing schemes. Subsequently, we analyze the robustness of feature-based federated transfer learning against packet loss, data insufficiency, and quantization. Finally, we address privacy considerations by defining and analyzing label privacy leakage and feature privacy leakage, and investigating mitigating approaches. For all aforementioned analyses, we evaluate the performance of the proposed learning scheme via experiments on an image classification task and a natural language processing task to demonstrate its effectiveness.

Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy

TL;DR

Abstract

Paper Structure (24 sections, 2 theorems, 37 equations, 12 figures, 5 tables, 1 algorithm)

This paper contains 24 sections, 2 theorems, 37 equations, 12 figures, 5 tables, 1 algorithm.

Introduction
Contributions
Organization and Notations
Preliminaries
Federated Learning
Federated Transfer Learning
Other Recent Transfer and Federated Learning Schemes
Feature-based Federated Transfer Learning
Learning Structure
Payload Analysis
Time Complexity Analysis
Experimental Results
Robustness Analysis
Packet Loss
Data Insufficiency
...and 9 more sections

Key Result

Lemma 1

Assume that the true sample distribution is known. Once $U$ batches are revealed to the adversary, the distribution of the batches (from the adversary's perspective) depends only on the frequency/count of each batch type in the unveiled $U$ batches and not on the sample distribution, i.e.,

Figures (12)

Figure 1: Diagram of the iterative training process of model-based federated transfer learning.
Figure 2: Diagram of the training process of feature-based federated transfer learning, which reduces to one-time communication of the output and the intermediate output (i.e., extracted features).
Figure 3: ImageNet samples with labels.
Figure 4: CIFAR-10 samples with labels.
Figure 5: Diagram of the VGG-16 model for training on CIFAR-10 dataset. "Conv $\{$receptive field size$\}$ - $\{$number of output channels$\}$" depicts the convolutional layers. "FC $\{$output size$\}$" depicts the fully connected layers.
...and 7 more figures

Theorems & Definitions (6)

Definition 1
Lemma 1
proof
Definition 2
Lemma 2
proof

Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy

TL;DR

Abstract

Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (6)