Table of Contents
Fetching ...

The Fundamental Limits of Least-Privilege Learning

Theresa Stadler, Bogdan Kulynych, Michael C. Gastpar, Nicolas Papernot, Carmela Troncoso

TL;DR

The paper formalizes the least-privilege principle (LPP) for machine learning in MLaaS settings by bounding the maximal leakage of any sensitive attribute $S$ given the task-relevant representation $Z$ and the task label $Y$ through $I_\infty(S; Z \mid Y) \le \gamma$. It establishes a fundamental trade-off: as a representation's utility for predicting the target $Y$ (measured by $I_\alpha(Y; Z)$ with $\alpha \in {1,\infty}$) grows, there exists some $S \neq Y$ that can be inferred from $Z$ with risk not smaller than $\gamma$, under realistic assumptions such as strictly positive $P_{Y|X}$. The authors formalize LPP, contrast unconditional LPP with the conditional LPP, relate LPP to MNI/CEB and LDP, and prove imcompatibilities that imply LPP cannot, in general, outperform differential privacy in maintaining low leakage for all attributes. Through extensive empirical evaluation across image and tabular datasets, architectures, and learning methods, they demonstrate the persistent leakage (the “whack-a-mole” effect) and fundamental leakage risks, including leakage arising from task labels themselves. The results suggest that LPP, while conceptually appealing, does not by itself guarantee harmless representations in MLaaS and should be assessed alongside, or in combination with, other privacy-preserving approaches and contextual integrity considerations.

Abstract

The promise of least-privilege learning -- to find feature representations that are useful for a learning task but prevent inference of any sensitive information unrelated to this task -- is highly appealing. However, so far this concept has only been stated informally. It thus remains an open question whether and how we can achieve this goal. In this work, we provide the first formalisation of the least-privilege principle for machine learning and characterise its feasibility. We prove that there is a fundamental trade-off between a representation's utility for a given task and its leakage beyond the intended task: it is not possible to learn representations that have high utility for the intended task but, at the same time prevent inference of any attribute other than the task label itself. This trade-off holds under realistic assumptions on the data distribution and regardless of the technique used to learn the feature mappings that produce these representations. We empirically validate this result for a wide range of learning techniques, model architectures, and datasets.

The Fundamental Limits of Least-Privilege Learning

TL;DR

The paper formalizes the least-privilege principle (LPP) for machine learning in MLaaS settings by bounding the maximal leakage of any sensitive attribute given the task-relevant representation and the task label through . It establishes a fundamental trade-off: as a representation's utility for predicting the target (measured by with ) grows, there exists some that can be inferred from with risk not smaller than , under realistic assumptions such as strictly positive . The authors formalize LPP, contrast unconditional LPP with the conditional LPP, relate LPP to MNI/CEB and LDP, and prove imcompatibilities that imply LPP cannot, in general, outperform differential privacy in maintaining low leakage for all attributes. Through extensive empirical evaluation across image and tabular datasets, architectures, and learning methods, they demonstrate the persistent leakage (the “whack-a-mole” effect) and fundamental leakage risks, including leakage arising from task labels themselves. The results suggest that LPP, while conceptually appealing, does not by itself guarantee harmless representations in MLaaS and should be assessed alongside, or in combination with, other privacy-preserving approaches and contextual integrity considerations.

Abstract

The promise of least-privilege learning -- to find feature representations that are useful for a learning task but prevent inference of any sensitive information unrelated to this task -- is highly appealing. However, so far this concept has only been stated informally. It thus remains an open question whether and how we can achieve this goal. In this work, we provide the first formalisation of the least-privilege principle for machine learning and characterise its feasibility. We prove that there is a fundamental trade-off between a representation's utility for a given task and its leakage beyond the intended task: it is not possible to learn representations that have high utility for the intended task but, at the same time prevent inference of any attribute other than the task label itself. This trade-off holds under realistic assumptions on the data distribution and regardless of the technique used to learn the feature mappings that produce these representations. We empirically validate this result for a wide range of learning techniques, model architectures, and datasets.
Paper Structure (24 sections, 9 theorems, 22 equations, 10 figures)

This paper contains 24 sections, 9 theorems, 22 equations, 10 figures.

Key Result

Theorem 1

Suppose that $P_{Y \mid X}$ is strictly positive (ass:positivePosterior). Then, for $\alpha \in \{1, \infty\}$, the following two properties cannot hold at the same time:

Figures (10)

  • Figure 1: To prevent potential data misuse in a MLaaS setting, users share a representation of their data. These representations should be useful to achieve the intended purpose (verify users' identity) but prevent inference of other data attributes (users' gender) that might lead to harms, such as discrimination.
  • Figure 2: $\gamma$-LPP limits maximum utility $I_\alpha(Y;Z)$ of a representation $Z$ to the greyed-out region.
  • Figure 3: Fundamental leakage: the task label reveals information about other data attributes, which might not be obvious to data subjects. Attribute inference gain of the label-only adversary(left) and pairwise Pearson's correlation between attributes(right). In the LFWA+ dataset, the 'Attractive' label is highly correlated with the perceived gender. Thus, predicting the 'Attractive' label will reveal information about gender.
  • Figure 4: If the model-generated representations have utility for the task(right), there exists a sensitive attribute with an even higher inference gain for the adversary(left, red means more leakage). This holds for both standard ERM(top) and attribute censoring(bottom) where we censor the attribute with highest leakage in the respective ERM model (marked as $\textcolor{gray}{\tikzmarknode[strike out,draw]{2}{\footnotesize \faEye}}$). Censoring has a 'whack-a-mole' effect: as we censor one attribute, leakage of another attribute increases.
  • Figure 5: If the model-generated representations have utility for the task(right), there exists a sensitive attribute with an even higher inference gain for the adversary(left, red means more leakage). This holds for both standard ERM(top) and attribute censoring(bottom) where we censor the attribute with highest leakage in the respective ERM model (marked as $\textcolor{gray}{\tikzmarknode[strike out,draw]{2}{\footnotesize \faEye}}$). Censoring has a 'whack-a-mole' effect: as we censor one attribute, leakage of another attribute increases.
  • ...and 5 more figures

Theorems & Definitions (20)

  • Definition 1: Unconditional LPP
  • Theorem 1: Unconditional LPP and Utility Trade-Off
  • Definition 2: LPP
  • Proposition 1
  • Corollary 1
  • Corollary 2
  • Theorem 2: LPP and Utility Trade-Off
  • Definition 3: LDP
  • Proposition 2
  • Lemma 1: IssaWK19
  • ...and 10 more