N-Parties Private Structure and Parameter Learning for Sum-Product Networks
Xenia Heilmann, Ernst Althaus, Mattia Cerrato, Nick Johannes Peter Rassau, Mohammad Sadeq Dousti, Stefan Kramer
TL;DR
The paper tackles privacy-preserving density estimation by introducing an N-party secret-sharing protocol for jointly learning SPN structures and parameters, followed by private inference. It builds a forest of RAT-SPNs, trains and weights structures privately on distributed data, and learns sum and leaf parameters without exposing private data. Private inference is supported via secure multiparty computation over shared parameters and inputs. Empirical results show log-likelihood remains robust under privacy constraints and that the approach scales favorably compared to SMPC baselines for neural networks, with practical runtimes on modest datasets and party counts.
Abstract
A sum-product network (SPN) is a graphical model that allows several types of probabilistic inference to be performed efficiently. In this paper, we propose a privacy-preserving protocol which tackles structure generation and parameter learning of SPNs. Additionally, we provide a protocol for private inference on SPNs, subsequent to training. To preserve the privacy of the participants, we derive our protocol based on secret sharing, which guarantees privacy in the honest-but-curious setting even when at most half of the parties cooperate to disclose the data. The protocol makes use of a forest of randomly generated SPNs, which is trained and weighted privately and can then be used for private inference on data points. Our experiments indicate that preserving the privacy of all participants does not decrease log-likelihood performance on both homogeneously and heterogeneously partitioned data. We furthermore show that our protocol's performance is comparable to current state-of-the-art SPN learners in homogeneously partitioned data settings. In terms of runtime and memory usage, we demonstrate that our implementation scales well when increasing the number of parties, comparing favorably to protocols for neural networks, when they are trained to reproduce the input-output behavior of SPNs.
