The Fundamental Limits of Least-Privilege Learning

Theresa Stadler; Bogdan Kulynych; Michael C. Gastpar; Nicolas Papernot; Carmela Troncoso

The Fundamental Limits of Least-Privilege Learning

Theresa Stadler, Bogdan Kulynych, Michael C. Gastpar, Nicolas Papernot, Carmela Troncoso

TL;DR

The paper formalizes the least-privilege principle (LPP) for machine learning in MLaaS settings by bounding the maximal leakage of any sensitive attribute $S$ given the task-relevant representation $Z$ and the task label $Y$ through $I_\infty(S; Z \mid Y) \le \gamma$. It establishes a fundamental trade-off: as a representation's utility for predicting the target $Y$ (measured by $I_\alpha(Y; Z)$ with $\alpha \in {1,\infty}$) grows, there exists some $S \neq Y$ that can be inferred from $Z$ with risk not smaller than $\gamma$, under realistic assumptions such as strictly positive $P_{Y|X}$. The authors formalize LPP, contrast unconditional LPP with the conditional LPP, relate LPP to MNI/CEB and LDP, and prove imcompatibilities that imply LPP cannot, in general, outperform differential privacy in maintaining low leakage for all attributes. Through extensive empirical evaluation across image and tabular datasets, architectures, and learning methods, they demonstrate the persistent leakage (the “whack-a-mole” effect) and fundamental leakage risks, including leakage arising from task labels themselves. The results suggest that LPP, while conceptually appealing, does not by itself guarantee harmless representations in MLaaS and should be assessed alongside, or in combination with, other privacy-preserving approaches and contextual integrity considerations.

Abstract

The promise of least-privilege learning -- to find feature representations that are useful for a learning task but prevent inference of any sensitive information unrelated to this task -- is highly appealing. However, so far this concept has only been stated informally. It thus remains an open question whether and how we can achieve this goal. In this work, we provide the first formalisation of the least-privilege principle for machine learning and characterise its feasibility. We prove that there is a fundamental trade-off between a representation's utility for a given task and its leakage beyond the intended task: it is not possible to learn representations that have high utility for the intended task but, at the same time prevent inference of any attribute other than the task label itself. This trade-off holds under realistic assumptions on the data distribution and regardless of the technique used to learn the feature mappings that produce these representations. We empirically validate this result for a wide range of learning techniques, model architectures, and datasets.

The Fundamental Limits of Least-Privilege Learning

TL;DR

The paper formalizes the least-privilege principle (LPP) for machine learning in MLaaS settings by bounding the maximal leakage of any sensitive attribute

given the task-relevant representation

and the task label

through

. It establishes a fundamental trade-off: as a representation's utility for predicting the target

(measured by

with

) grows, there exists some

that can be inferred from

with risk not smaller than

, under realistic assumptions such as strictly positive

. The authors formalize LPP, contrast unconditional LPP with the conditional LPP, relate LPP to MNI/CEB and LDP, and prove imcompatibilities that imply LPP cannot, in general, outperform differential privacy in maintaining low leakage for all attributes. Through extensive empirical evaluation across image and tabular datasets, architectures, and learning methods, they demonstrate the persistent leakage (the “whack-a-mole” effect) and fundamental leakage risks, including leakage arising from task labels themselves. The results suggest that LPP, while conceptually appealing, does not by itself guarantee harmless representations in MLaaS and should be assessed alongside, or in combination with, other privacy-preserving approaches and contextual integrity considerations.

Abstract

Paper Structure (24 sections, 9 theorems, 22 equations, 10 figures)

This paper contains 24 sections, 9 theorems, 22 equations, 10 figures.

Introduction
Problem Setup
The Least-Privilege Principle in Machine Learning
Utility Measure
Leakage Measure
Strawman Approach: Unconditional Least-Privilege Principle
Formalisation of the Least-Privilege Principle
Related Formalisms.
LPP vs. LDP.
Empirical Evaluation
The Potential Harms of Fundamental Leakage
The Least-Privilege And Utility Trade-Off
Limitations and Future Work
Conclusions
Formal Details
...and 9 more sections

Key Result

Theorem 1

Suppose that $P_{Y \mid X}$ is strictly positive (ass:positivePosterior). Then, for $\alpha \in \{1, \infty\}$, the following two properties cannot hold at the same time:

Figures (10)

Figure 1: To prevent potential data misuse in a MLaaS setting, users share a representation of their data. These representations should be useful to achieve the intended purpose (verify users' identity) but prevent inference of other data attributes (users' gender) that might lead to harms, such as discrimination.
Figure 2: $\gamma$-LPP limits maximum utility $I_\alpha(Y;Z)$ of a representation $Z$ to the greyed-out region.
Figure 3: Fundamental leakage: the task label reveals information about other data attributes, which might not be obvious to data subjects. Attribute inference gain of the label-only adversary(left) and pairwise Pearson's correlation between attributes(right). In the LFWA+ dataset, the 'Attractive' label is highly correlated with the perceived gender. Thus, predicting the 'Attractive' label will reveal information about gender.
Figure 4: If the model-generated representations have utility for the task(right), there exists a sensitive attribute with an even higher inference gain for the adversary(left, red means more leakage). This holds for both standard ERM(top) and attribute censoring(bottom) where we censor the attribute with highest leakage in the respective ERM model (marked as $\textcolor{gray}{\tikzmarknode[strike out,draw]{2}{\footnotesize \faEye}}$). Censoring has a 'whack-a-mole' effect: as we censor one attribute, leakage of another attribute increases.
Figure 5: If the model-generated representations have utility for the task(right), there exists a sensitive attribute with an even higher inference gain for the adversary(left, red means more leakage). This holds for both standard ERM(top) and attribute censoring(bottom) where we censor the attribute with highest leakage in the respective ERM model (marked as $\textcolor{gray}{\tikzmarknode[strike out,draw]{2}{\footnotesize \faEye}}$). Censoring has a 'whack-a-mole' effect: as we censor one attribute, leakage of another attribute increases.
...and 5 more figures

Theorems & Definitions (20)

Definition 1: Unconditional LPP
Theorem 1: Unconditional LPP and Utility Trade-Off
Definition 2: LPP
Proposition 1
Corollary 1
Corollary 2
Theorem 2: LPP and Utility Trade-Off
Definition 3: LDP
Proposition 2
Lemma 1: IssaWK19
...and 10 more

The Fundamental Limits of Least-Privilege Learning

TL;DR

Abstract

The Fundamental Limits of Least-Privilege Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (20)