Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy
Feng Wang, M. Cenk Gursoy, Senem Velipasalar
TL;DR
This work tackles the communication bottlenecks of federated learning by introducing feature-based federated transfer learning (FbFTL), which uploads extracted features and outputs instead of gradients to drastically reduce uplink traffic. FbFTL partitions a pre-trained source model, fixes the feature extractor, and trains only a task-specific sub-model, enabling order-of-magnitude reductions in uplink and downlink payloads while shifting computation toward the server. The paper provides rigorous payload, time-complexity, robustness, and privacy analyses, including new privacy notions for label privacy leakage and feature privacy leakage, and validates the approach on image (VGG-16 CIFAR-10) and NLP (FLAN-T5-small) tasks with differential privacy considerations. The results demonstrate that FbFTL achieves comparable accuracy to gradient-based baselines under much smaller communication budgets, improved resilience to packet loss, and practical privacy protections, highlighting its potential for deployment in bandwidth-constrained, privacy-sensitive federated settings.
Abstract
In this paper, we propose feature-based federated transfer learning as a novel approach to improve communication efficiency by reducing the uplink payload by multiple orders of magnitude compared to that of existing approaches in federated learning and federated transfer learning. Specifically, in the proposed feature-based federated learning, we design the extracted features and outputs to be uploaded instead of parameter updates. For this distributed learning model, we determine the required payload and provide comparisons with the existing schemes. Subsequently, we analyze the robustness of feature-based federated transfer learning against packet loss, data insufficiency, and quantization. Finally, we address privacy considerations by defining and analyzing label privacy leakage and feature privacy leakage, and investigating mitigating approaches. For all aforementioned analyses, we evaluate the performance of the proposed learning scheme via experiments on an image classification task and a natural language processing task to demonstrate its effectiveness.
