Secure multiparty computations in floating-point arithmetic
Chuan Guo, Awni Hannun, Brian Knott, Laurens van der Maaten, Mark Tygert, Ruiyu Zhu
TL;DR
This work presents a practical framework for secure multiparty computation in floating-point arithmetic, enabling privacy-preserving machine learning without resorting to modular arithmetic. It combines additive sharing and Beaver multiplication, with rigorous information-leakage bounds and numerical stability analyses, to operate on standard double-precision hardware. The authors develop polynomial approximation techniques (Newton iterations, Chebyshev series, and softmax scaling) to securely compute common ML functions, and validate the approach on synthetic data and real datasets (MNIST, covtype, and horsekicks) using CrypTen on PyTorch. The results show near-plaintext accuracy and generalization, while quantifying controlled leakage and demonstrating feasible performance on commodity hardware, highlighting the method’s practical potential for privacy-preserving analytics.
Abstract
Secure multiparty computations enable the distribution of so-called shares of sensitive data to multiple parties such that the multiple parties can effectively process the data while being unable to glean much information about the data (at least not without collusion among all parties to put back together all the shares). Thus, the parties may conspire to send all their processed results to a trusted third party (perhaps the data provider) at the conclusion of the computations, with only the trusted third party being able to view the final results. Secure multiparty computations for privacy-preserving machine-learning turn out to be possible using solely standard floating-point arithmetic, at least with a carefully controlled leakage of information less than the loss of accuracy due to roundoff, all backed by rigorous mathematical proofs of worst-case bounds on information loss and numerical stability in finite-precision arithmetic. Numerical examples illustrate the high performance attained on commodity off-the-shelf hardware for generalized linear models, including ordinary linear least-squares regression, binary and multinomial logistic regression, probit regression, and Poisson regression.
