SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data Sharing

Mingjie Zhao; Cheng Dai; Fei Chen; Xin Chen; Kaoru Ota; Mianxiong Dong; Bing Guo

SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data Sharing

Mingjie Zhao, Cheng Dai, Fei Chen, Xin Chen, Kaoru Ota, Mianxiong Dong, Bing Guo

TL;DR

This work designs a blockchain-based consensus protocol for decentralized aggregation, where aggregation eligibility is tied to Shapley incentives, and quantifies client contributions using a Shapley value metric that jointly considers rare-event utility, data diversity, data quality, and timeliness.

Abstract

In high-speed rail (HSR) systems, federated learning (FL) enables cross-departmental flow prediction without sharing raw data. However, existing schemes suffer from two key limitations: (1) insufficient incentives, leading to free-riding and model poisoning; and (2) centralized aggregation, which introduces a single point of failure. We propose a secure and efficient framework SI-ChainFL that addresses these issues by combining contribution-aware incentives with decentralized aggregation. First, we quantify client contributions using a Shapley value metric that jointly considers rare-event utility, data diversity, data quality, and timeliness. To reduce computational overhead, we further develop a rare positive driven client clustering strategy to accelerate Shapley estimation. Moreover, we design a blockchain-based consensus protocol for decentralized aggregation, where aggregation eligibility is tied to Shapley incentives. This design motivates clients to submit high-quality updates and enables efficient and secure global aggregation. Experiments on MNIST, CIFAR 10 and CIFAR 100, and a HSR flow dataset show that SI ChainFL remains effective under 90% malicious clients in PA attacks, achieving 14.12% higher accuracy than RAGA. Theoretical analysis further guarantees an upper bound on performance

SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data Sharing

TL;DR

Abstract

Paper Structure (32 sections, 46 equations, 12 figures, 3 tables, 3 algorithms)

This paper contains 32 sections, 46 equations, 12 figures, 3 tables, 3 algorithms.

Introduction
Background
Motivation
Contributions
RELATED WORK
Incentive mechanisms for FL
Blockchain-enabled secure aggregation in FL
PRELIMINARIES
Federated Learning
Shapley Value
PROBLEM STATEMENT
System Model
Threat Model
Design Goals
Proposed Approach
...and 17 more sections

Figures (12)

Figure 1: High-speed rail data sharing application scenarios
Figure 2: System model
Figure 3: System workflow
Figure 4: Comparison of convergence performance between SI-ChainFL and FedAvg on MNIST, CIFAR-10, CIFAR-100 and HSR.
Figure 5: The impact of number of clients on model accuracy.
...and 7 more figures

SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data Sharing

TL;DR

Abstract

SI-ChainFL: Shapley-Incentivized Secure Federated Learning for High-Speed Rail Data Sharing

Authors

TL;DR

Abstract

Table of Contents

Figures (12)