MPCPA: Multi-Center Privacy Computing with Predictions Aggregation based on Denoising Diffusion Probabilistic Model
Guibo Luo, Hanwen Zhang, Xiuling Wang, Mingzhi Chen, Yuesheng Zhu
TL;DR
The paper addresses privacy-preserving multi-center learning under Non-IID data and stringent communication constraints. It introduces MPCPA, a framework that combines conditional diffusion model training, DDPM-based data generation, per-client classifiers, and ensemble aggregation of predictions to avoid sharing raw data. Empirical results show MPCPA matches or surpasses centralized learning on several natural and medical datasets and consistently outperforms Federated Learning, while requiring far fewer communications ($3n$ transmissions versus $2n\cdot\text{iters}$). The approach also demonstrates robustness to image memorization and membership inference attacks, suggesting practical applicability for privacy-preserving collaboration in distributed settings.
Abstract
Privacy-preserving computing is crucial for multi-center machine learning in many applications such as healthcare and finance. In this paper a Multi-center Privacy Computing framework with Predictions Aggregation (MPCPA) based on denoising diffusion probabilistic model (DDPM) is proposed, in which conditional diffusion model training, DDPM data generation, a classifier, and strategy of prediction aggregation are included. Compared to federated learning, this framework necessitates fewer communications and leverages high-quality generated data to support robust privacy computing. Experimental validation across multiple datasets demonstrates that the proposed framework outperforms classic federated learning and approaches the performance of centralized learning with original data. Moreover, our approach demonstrates robust security, effectively addressing challenges such as image memorization and membership inference attacks. Our experiments underscore the efficacy of the proposed framework in the realm of privacy computing, with the code set to be released soon.
