An Algorithm for Enhancing Privacy-Utility Tradeoff in the Privacy Funnel and Other Lift-based Measures

Mohammad Amin Zarrabian; Parastoo Sadeghi

An Algorithm for Enhancing Privacy-Utility Tradeoff in the Privacy Funnel and Other Lift-based Measures

Mohammad Amin Zarrabian, Parastoo Sadeghi

TL;DR

The paper tackles the privacy-utility tradeoff in the privacy funnel by replacing the standard MI-based privacy constraint with a semi-pointwise measure $L(y)$, enabling more tractable optimization and improved utility at a fixed privacy budget. It builds on max-lift ideas by leveraging the average information density per observation and introduces a heuristic algorithm that combines high-ε lift-polytope vertices to identify extreme privacy points, yielding higher utility $I(X;Y)$ under the same privacy constraint. The approach demonstrates superior performance to prior lift-based and subset-merging methods across lift-based measures including $\ell_{1}$-norm and $\chi^{2}$-divergence, and it aligns with theoretical benchmarks in the strong $\chi^{2}$ framework. These results suggest practical gains for privacy-preserving data sharing using lift-based privacy measures in diverse settings.

Abstract

This paper investigates the privacy funnel, a privacy-utility tradeoff problem in which mutual information quantifies both privacy and utility. The objective is to maximize utility while adhering to a specified privacy budget. However, the privacy funnel represents a non-convex optimization problem, making it challenging to achieve an optimal solution. An existing proposed approach to this problem involves substituting the mutual information with the lift (the exponent of information density) and then solving the optimization. Since mutual information is the expectation of the information density, this substitution overestimates the privacy loss and results in a final smaller bound on the privacy of mutual information than what is allowed in the budget. This significantly compromises the utility. To overcome this limitation, we propose using a privacy measure that is more relaxed than the lift but stricter than mutual information while still allowing the optimization to be efficiently solved. Instead of directly using information density, our proposed measure is the average of information density over the sensitive data distribution for each observed data realization. We then introduce a heuristic algorithm capable of achieving solutions that produce extreme privacy values, which enhances utility. The numerical results confirm improved utility at the same privacy budget compared to existing solutions in the literature. Additionally, we explore two other privacy measures, $\ell_{1}$-norm and strong $χ^2$-divergence, demonstrating the applicability of our algorithm to these lift-based measures. We evaluate the performance of our method by comparing its output with previous works. Finally, we validate our heuristic approach with a theoretical framework that estimates the optimal utility for strong $χ^2$-divergence, numerically showing a perfect match.

An Algorithm for Enhancing Privacy-Utility Tradeoff in the Privacy Funnel and Other Lift-based Measures

TL;DR

The paper tackles the privacy-utility tradeoff in the privacy funnel by replacing the standard MI-based privacy constraint with a semi-pointwise measure

, enabling more tractable optimization and improved utility at a fixed privacy budget. It builds on max-lift ideas by leveraging the average information density per observation and introduces a heuristic algorithm that combines high-ε lift-polytope vertices to identify extreme privacy points, yielding higher utility

under the same privacy constraint. The approach demonstrates superior performance to prior lift-based and subset-merging methods across lift-based measures including

-norm and

-divergence, and it aligns with theoretical benchmarks in the strong

framework. These results suggest practical gains for privacy-preserving data sharing using lift-based privacy measures in diverse settings.

Abstract

-norm and strong

-divergence, demonstrating the applicability of our algorithm to these lift-based measures. We evaluate the performance of our method by comparing its output with previous works. Finally, we validate our heuristic approach with a theoretical framework that estimates the optimal utility for strong

-divergence, numerically showing a perfect match.

Paper Structure (8 sections, 3 theorems, 20 equations, 3 figures, 1 algorithm)

This paper contains 8 sections, 3 theorems, 20 equations, 3 figures, 1 algorithm.

Introduction
Notation
System Models and Privacy Measures
Privacy-utility tradeoff
Optimal max-lift mechanism
Heuristic algorithm to enhance PUT
Numerical Results
Validation with theoretical framework in 2021StrongChi2

Key Result

Proposition 1

Given an $\varepsilon\!\in\!\mathbb{R}_{+}$, we have the following properties:

Figures (3)

Figure 1: Privacy-utility tradeoff comparison between Algorithm \ref{['alg:funnel']}, subset merging 2023Onthelift (modified for semi-pointwise measure $\mathfrak{L}(y)$) and max-lift mechanism 2021DataSanitize for privacy funnel.
Figure 2: Privacy-utility tradeoff comparison between Algorithm \ref{['alg:funnel']} and subset merging 2023Onthelift modified for $\ell_{1}$-norm and $\chi^2$-divergence.
Figure 3: Privacy-utility tradeoff comparison between Algorithm \ref{['alg:funnel']} and theoretical framework in 2021StrongChi2 for and $\chi^2$-divergence.

Theorems & Definitions (8)

Definition 1
Remark 1
Definition 2
Definition 3
Proposition 1
Proposition 2: 2019DataDsclsurPtfPriv[Proposition 1]
Corollary 1: 2019DataDsclsurPtfPriv[Corollary 1]
Example 1: 2021StrongChi2

An Algorithm for Enhancing Privacy-Utility Tradeoff in the Privacy Funnel and Other Lift-based Measures

TL;DR

Abstract

An Algorithm for Enhancing Privacy-Utility Tradeoff in the Privacy Funnel and Other Lift-based Measures

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (3)

Theorems & Definitions (8)