Distributed Differentially Private Data Analytics via Secure Sketching
Jakob Burkhardt, Hannah Keller, Claudio Orlandi, Chris Schwiegelshohn
TL;DR
This work introduces the linear-transformation model (LTM) for distributed differential privacy, enabling secure linear sketches to be computed across multiple servers via MPC while limiting per-client noise. By leveraging oblivious subspace embeddings and Johnson-Lindenstrauss transforms, the authors provide DP mechanisms for private low-rank approximation and ridge regression that approach central-DP utility with fewer privacy losses than local models. They formalize privacy guarantees under a multi-central model, develop dense and sparse JL-based sketching mechanisms, and connect the approach to cryptographic assumptions and the shuffle model. Empirical results, including MPC-based running-time experiments and real-world data, demonstrate that LTM can interpolate between local and central DP as the number of clients grows, offering practical, scalable DP with strong utility guarantees. Overall, LTM provides a viable middle ground for distributed DP by trading expressiveness for efficiency while preserving strong privacy and utility in linear-algebraic tasks.
Abstract
We introduce the linear-transformation model, a distributed model of differentially private data analysis. Clients have access to a trusted platform capable of applying a public matrix to their inputs. Such computations can be securely distributed across multiple servers using simple and efficient secure multiparty computation techniques. The linear-transformation model serves as an intermediate model between the highly expressive central model and the minimal local model. In the central model, clients have access to a trusted platform capable of applying any function to their inputs. However, this expressiveness comes at a cost, as it is often prohibitively expensive to distribute such computations, leading to the central model typically being implemented by a single trusted server. In contrast, the local model assumes no trusted platform, which forces clients to add significant noise to their data. The linear-transformation model avoids the single point of failure for privacy present in the central model, while also mitigating the high noise required in the local model. We demonstrate that linear transformations are very useful for differential privacy, allowing for the computation of linear sketches of input data. These sketches largely preserve utility for tasks such as private low-rank approximation and private ridge regression, while introducing only minimal error, critically independent of the number of clients.
