TrustChain: A Blockchain Framework for Auditing and Verifying Aggregators in Decentralized Federated Learning
Ehsan Hallaji, Roozbeh Razavi-Far, Mehrdad Saif
TL;DR
TrustChain addresses the risk of rogue aggregators in decentralized federated learning by introducing a two-stage, blockchain-based verification: a Pre-Selection Evaluation that scores candidates using concept-drift-inspired cosine similarity over a sliding window, and a Post-Aggregation Auditing that uses HSIC to test the independence between the aggregated model and current updates. The approach combines a lightweight, topology-agnostic trust mechanism with auditable on-chain records, supported by a simple permissioned blockchain and smart contracts to trigger pre- and post-checks. Empirical results across multiple datasets and attack scenarios show that TrustChain effectively mitigates malicious aggregators, maintaining accuracy close to benign conditions and outperforming non-audited baselines. The work contributes a practical, low-overhead framework for continuous aggregator verification in DFL, with potential for integration into broader secure, privacy-preserving distributed learning systems.
Abstract
The server-less nature of Decentralized Federated Learning (DFL) requires allocating the aggregation role to specific participants in each federated round. Current DFL architectures ensure the trustworthiness of the aggregator node upon selection. However, most of these studies overlook the possibility that the aggregating node may turn rogue and act maliciously after being nominated. To address this problem, this paper proposes a DFL structure, called TrustChain, that scores the aggregators before selection based on their past behavior and additionally audits them after the aggregation. To do this, the statistical independence between the client updates and the aggregated model is continuously monitored using the Hilbert-Schmidt Independence Criterion (HSIC). The proposed method relies on several principles, including blockchain, anomaly detection, and concept drift analysis. The designed structure is evaluated on several federated datasets and attack scenarios with different numbers of Byzantine nodes.
