FedMID: A Data-Free Method for Using Intermediate Outputs as a Defense Mechanism Against Poisoning Attacks in Federated Learning
Sungwon Han, Hyeonho Song, Sungwon Park, Meeyoung Cha
TL;DR
This work tackles poisoning attacks in federated learning by challenging the reliance on model parameters for defense. It introduces FedMID, a data-free method that evaluates and compares the models’ functional mappings through their intermediate outputs obtained from synthetic data, guarded by normalization and distance-based analysis across layers. FedMID demonstrates superior robustness across non-IID settings, multiple model architectures, and adaptive attack scenarios, outperforming traditional parameter-based defenses in accuracy and security metrics. The approach preserves privacy by avoiding real data usage and provides a practical, scalable defense mechanism for safe cross-client collaboration in FL.
Abstract
Federated learning combines local updates from clients to produce a global model, which is susceptible to poisoning attacks. Most previous defense strategies relied on vectors derived from projections of local updates on a Euclidean space; however, these methods fail to accurately represent the functionality and structure of local models, resulting in inconsistent performance. Here, we present a new paradigm to defend against poisoning attacks in federated learning using functional mappings of local models based on intermediate outputs. Experiments show that our mechanism is robust under a broad range of computing conditions and advanced attack scenarios, enabling safer collaboration among data-sensitive participants via federated learning.
