Federated Prediction-Powered Inference from Decentralized Data
Ping Luo, Xiaoge Deng, Ziqing Wen, Tao Sun, Dongsheng Li
TL;DR
Fed-PPI unites federated learning with Prediction-Powered Inference to enable statistically valid conclusions from decentralized, private data without data sharing. The framework defines aggregation rules, imputed gradients, and empirical rectifiers to produce prediction-powered confidence intervals for convex and nonconvex estimands, with concrete algorithms for mean, quantile, logistic, and linear regression. Theoretical guarantees (finite-sample and asymptotic) and extensive experiments across real tasks and simulations demonstrate that Fed-PPI delivers intervals with valid coverage close to centralized analyses, while accommodating data heterogeneity and unlabeled data. This approach addresses data silos and privacy concerns, with practical impact for diverse scientific domains relying on collaborative yet privacy-preserving inference.
Abstract
In various domains, the increasing application of machine learning allows researchers to access inexpensive predictive data, which can be utilized as auxiliary data for statistical inference. Although such data are often unreliable compared to gold-standard datasets, Prediction-Powered Inference (PPI) has been proposed to ensure statistical validity despite the unreliability. However, the challenge of `data silos' arises when the private gold-standard datasets are non-shareable for model training, leading to less accurate predictive models and invalid inferences. In this paper, we introduces the Federated Prediction-Powered Inference (Fed-PPI) framework, which addresses this challenge by enabling decentralized experimental data to contribute to statistically valid conclusions without sharing private information. The Fed-PPI framework involves training local models on private data, aggregating them through Federated Learning (FL), and deriving confidence intervals using PPI computation. The proposed framework is evaluated through experiments, demonstrating its effectiveness in producing valid confidence intervals.
