A Differentially Private Blockchain-Based Approach for Vertical Federated Learning
Linh Tran, Sanjay Chari, Md. Saikat Islam Khan, Aaron Zachariah, Stacy Patterson, Oshani Seneviratne
TL;DR
This work tackles data privacy and trust in vertical federated learning by introducing DP-BBVFL, a serverless framework that uses a private blockchain as the aggregation layer and applies local differential privacy via the Poisson Binomial Mechanism to protect embeddings exposed on-chain. It provides provable privacy guarantees in the form of $$(\alpha,\epsilon)$$-RDP and achieves verifiability through smart contracts and on-chain aggregation, while reducing reliance on a central trusted party. Empirical evaluation on Breast Cancer and MIMIC-III demonstrates competitive accuracy with a clear privacy-utility trade-off, albeit with increased training time due to on-chain operations. The approach broadens the applicability of privacy-preserving distributed learning to sensitive domains like healthcare by enabling transparent, trustworthy collaboration across decentralized institutions.
Abstract
We present the Differentially Private Blockchain-Based Vertical Federal Learning (DP-BBVFL) algorithm that provides verifiability and privacy guarantees for decentralized applications. DP-BBVFL uses a smart contract to aggregate the feature representations, i.e., the embeddings, from clients transparently. We apply local differential privacy to provide privacy for embeddings stored on a blockchain, hence protecting the original data. We provide the first prototype application of differential privacy with blockchain for vertical federated learning. Our experiments with medical data show that DP-BBVFL achieves high accuracy with a tradeoff in training time due to on-chain aggregation. This innovative fusion of differential privacy and blockchain technology in DP-BBVFL could herald a new era of collaborative and trustworthy machine learning applications across several decentralized application domains.
