Decentralized Collaborative Learning Framework with External Privacy Leakage Analysis
Tsuyoshi Idé, Dzung T. Phan, Rudy Raymond
TL;DR
The paper tackles privacy-preserving decentralized multi-task density estimation for unsupervised anomaly detection in blockchain-like networks. It extends the CollabDict framework to support deep models via a multi-task VAE, enabling expressive anomaly detection while maintaining decentralization. It provides a theoretical external privacy leakage guarantee using Rényi differential privacy for Gaussian mixture trials and introduces a practical entropy-based monitor for internal privacy breaches. The work demonstrates how to combine dynamical consensus, random data chunking, and Bayesian DL to enable privacy-aware collaborative learning with potential applications in next-generation blockchain platforms.
Abstract
This paper presents two methodological advancements in decentralized multi-task learning under privacy constraints, aiming to pave the way for future developments in next-generation Blockchain platforms. First, we expand the existing framework for collaborative dictionary learning (CollabDict), which has previously been limited to Gaussian mixture models, by incorporating deep variational autoencoders (VAEs) into the framework, with a particular focus on anomaly detection. We demonstrate that the VAE-based anomaly score function shares the same mathematical structure as the non-deep model, and provide comprehensive qualitative comparison. Second, considering the widespread use of "pre-trained models," we provide a mathematical analysis on data privacy leakage when models trained with CollabDict are shared externally. We show that the CollabDict approach, when applied to Gaussian mixtures, adheres to a Renyi differential privacy criterion. Additionally, we propose a practical metric for monitoring internal privacy breaches during the learning process.
