Differential privacy and Sublinear time are incompatible sometimes

Jeremiah Blocki; Hendrik Fichtenberger; Elena Grigorescu; Tamalika Mukherjee

Differential privacy and Sublinear time are incompatible sometimes

Jeremiah Blocki, Hendrik Fichtenberger, Elena Grigorescu, Tamalika Mukherjee

TL;DR

This paper investigates the compatibility of differential privacy with sublinear-time computation and shows, via fingerprinting-code-based lower bounds, that the two properties can be inherently incompatible in general. By constructing a problem based on one-way marginals and employing a PR-PPC fingerprinting-code framework augmented with a secret-sharing encoding, the authors establish a separation: there exist DP algorithms running in near-linear time, while sublinear-time DP algorithms cannot achieve nontrivial accuracy without violating DP, up to logarithmic factors. The main technical contribution is a two-pronged lower-bound approach (random-oracle warm-up and secret-sharing encoding) that demonstrates any $(1/3,1/3)$-accurate DP algorithm must read essentially all data, yielding a near-linear time requirement. Overall, the work clarifies fundamental limits on achieving DP under sublinear access models and provides a principled methodology for deriving such lower bounds in the presence of complex access patterns.

Abstract

Differential privacy and sublinear algorithms are both rapidly emerging algorithmic themes in times of big data analysis. Although recent works have shown the existence of differentially private sublinear algorithms for many problems including graph parameter estimation and clustering, little is known regarding hardness results on these algorithms. In this paper, we initiate the study of lower bounds for problems that aim for both differentially-private and sublinear-time algorithms. Our main result is the incompatibility of both the desiderata in the general case. In particular, we prove that a simple problem based on one-way marginals yields both a differentially-private algorithm, as well as a sublinear-time algorithm, but does not admit a ``strictly'' sublinear-time algorithm that is also differentially private.

Differential privacy and Sublinear time are incompatible sometimes

TL;DR

-accurate DP algorithm must read essentially all data, yielding a near-linear time requirement. Overall, the work clarifies fundamental limits on achieving DP under sublinear access models and provides a principled methodology for deriving such lower bounds in the presence of complex access patterns.

Abstract

Paper Structure (8 sections, 19 theorems, 4 equations, 2 algorithms)

This paper contains 8 sections, 19 theorems, 4 equations, 2 algorithms.

Introduction
Technical Overview
Related Work
Preliminaries
Permute Rows and Pad and Permute Columns Fingerprinting Codes (PR-PPC FPC)
Lower Bound
Warm Up: Using a Random Oracle
Using a Secret Sharing Encoding

Key Result

Theorem 1

There exists a problem ${\mathcal{P}}\xspace$ of size $m$ such that

Theorems & Definitions (44)

Theorem 1: Informal Theorem
Remark
Definition 1: Differential Privacy DworkMNS06
Definition 2: Accuracy
Definition 3: Fingerprinting Codes BunUV18
Remark
Theorem 2: Tardos Fingerprinting Code Tardos08
Theorem 3: Gaussian Mechanism, dwork2014algorithmic
Lemma 1
proof
...and 34 more

Differential privacy and Sublinear time are incompatible sometimes

TL;DR

Abstract

Differential privacy and Sublinear time are incompatible sometimes

Authors

TL;DR

Abstract

Table of Contents

Key Result

Theorems & Definitions (44)