DEFEND: Poisoned Model Detection and Malicious Client Exclusion Mechanism for Secure Federated Learning-based Road Condition Classification

Sheng Liu; Panos Papadimitratos

DEFEND: Poisoned Model Detection and Malicious Client Exclusion Mechanism for Secure Federated Learning-based Road Condition Classification

Sheng Liu, Panos Papadimitratos

TL;DR

This work tackles the vulnerability of federated learning for road condition classification (RCC) to targeted label-flipping attacks by introducing DEFEND, a defense that detects poisoned updates through neuron-wise magnitude analysis of the output layer and clusters the affected parameter changes with a Gaussian Mixture Model. It then validates the global model using Source Recall and Attack Success Rate thresholds and enforces adaptive client exclusion based on per-client ratings. Extensive experiments across multiple RCC tasks and neural architectures demonstrate that DEFEND not only thwarts TLFA but also matches or surpasses attack-free performance, outperforming seven baselines by notable margins. The approach offers a practical, scalable security primitive for ITS deployments with privacy-preserving federated learning Cooperative RCC systems.

Abstract

Federated Learning (FL) has drawn the attention of the Intelligent Transportation Systems (ITS) community. FL can train various models for ITS tasks, notably camera-based Road Condition Classification (RCC), in a privacy-preserving collaborative way. However, opening up to collaboration also opens FL-based RCC systems to adversaries, i.e., misbehaving participants that can launch Targeted Label-Flipping Attacks (TLFAs) and threaten transportation safety. Adversaries mounting TLFAs poison training data to misguide model predictions, from an actual source class (e.g., wet road) to a wrongly perceived target class (e.g., dry road). Existing countermeasures against poisoning attacks cannot maintain model performance under TLFAs close to the performance level in attack-free scenarios, because they lack specific model misbehavior detection for TLFAs and neglect client exclusion after the detection. To close this research gap, we propose DEFEND, which includes a poisoned model detection strategy that leverages neuron-wise magnitude analysis for attack goal identification and Gaussian Mixture Model (GMM)-based clustering. DEFEND discards poisoned model contributions in each round and adapts accordingly client ratings, eventually excluding malicious clients. Extensive evaluation involving various FL-RCC models and tasks shows that DEFEND can thwart TLFAs and outperform seven baseline countermeasures, with at least 15.78% improvement, with DEFEND remarkably achieving under attack the same performance as in attack-free scenarios.

DEFEND: Poisoned Model Detection and Malicious Client Exclusion Mechanism for Secure Federated Learning-based Road Condition Classification

TL;DR

Abstract

DEFEND: Poisoned Model Detection and Malicious Client Exclusion Mechanism for Secure Federated Learning-based Road Condition Classification

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)