Table of Contents
Fetching ...

Disparate Privacy Vulnerability: Targeted Attribute Inference Attacks and Defenses

Ehsanul Kabir, Lucas Craig, Shagufta Mehnaz

TL;DR

This work reveals substantial privacy leakage in tabular-data ML models by exposing disparate vulnerability across data groups to attribute inference attacks. It introduces the disparity inference attack to identify high-risk groups and two targeted attribute inference attacks that exploit these groups with higher accuracy than untargeted variants. To counter this risk, the authors propose DAMIR and BCorr, with BCorr delivering robust disparity mitigation while preserving model performance, demonstrated on Census19, Texas-100X, and Adult datasets. The findings emphasize that correlation between sensitive attributes and model outputs drives group-level vulnerability and that targeted attacks can be highly effective, underscoring the need for practical privacy-preserving defenses in sensitive-domain ML deployments.

Abstract

As machine learning (ML) technologies become more prevalent in privacy-sensitive areas like healthcare and finance, eventually incorporating sensitive information in building data-driven algorithms, it is vital to scrutinize whether these data face any privacy leakage risks. One potential threat arises from an adversary querying trained models using the public, non-sensitive attributes of entities in the training data to infer their private, sensitive attributes, a technique known as the attribute inference attack. This attack is particularly deceptive because, while it may perform poorly in predicting sensitive attributes across the entire dataset, it excels at predicting the sensitive attributes of records from a few vulnerable groups, a phenomenon known as disparate vulnerability. This paper illustrates that an adversary can take advantage of this disparity to carry out a series of new attacks, showcasing a threat level beyond previous imagination. We first develop a novel inference attack called the disparity inference attack, which targets the identification of high-risk groups within the dataset. We then introduce two targeted variations of the attribute inference attack that can identify and exploit a vulnerable subset of the training data, marking the first instances of targeted attacks in this category, achieving significantly higher accuracy than untargeted versions. We are also the first to introduce a novel and effective disparity mitigation technique that simultaneously preserves model performance and prevents any risk of targeted attacks.

Disparate Privacy Vulnerability: Targeted Attribute Inference Attacks and Defenses

TL;DR

This work reveals substantial privacy leakage in tabular-data ML models by exposing disparate vulnerability across data groups to attribute inference attacks. It introduces the disparity inference attack to identify high-risk groups and two targeted attribute inference attacks that exploit these groups with higher accuracy than untargeted variants. To counter this risk, the authors propose DAMIR and BCorr, with BCorr delivering robust disparity mitigation while preserving model performance, demonstrated on Census19, Texas-100X, and Adult datasets. The findings emphasize that correlation between sensitive attributes and model outputs drives group-level vulnerability and that targeted attacks can be highly effective, underscoring the need for practical privacy-preserving defenses in sensitive-domain ML deployments.

Abstract

As machine learning (ML) technologies become more prevalent in privacy-sensitive areas like healthcare and finance, eventually incorporating sensitive information in building data-driven algorithms, it is vital to scrutinize whether these data face any privacy leakage risks. One potential threat arises from an adversary querying trained models using the public, non-sensitive attributes of entities in the training data to infer their private, sensitive attributes, a technique known as the attribute inference attack. This attack is particularly deceptive because, while it may perform poorly in predicting sensitive attributes across the entire dataset, it excels at predicting the sensitive attributes of records from a few vulnerable groups, a phenomenon known as disparate vulnerability. This paper illustrates that an adversary can take advantage of this disparity to carry out a series of new attacks, showcasing a threat level beyond previous imagination. We first develop a novel inference attack called the disparity inference attack, which targets the identification of high-risk groups within the dataset. We then introduce two targeted variations of the attribute inference attack that can identify and exploit a vulnerable subset of the training data, marking the first instances of targeted attacks in this category, achieving significantly higher accuracy than untargeted versions. We are also the first to introduce a novel and effective disparity mitigation technique that simultaneously preserves model performance and prevents any risk of targeted attacks.

Paper Structure

This paper contains 30 sections, 9 equations, 18 figures, 7 tables, 2 algorithms.

Figures (18)

  • Figure 2: Correlation vs. Attack performance for Male and Female group for 9 different scenarios using Census-19 Dataset.
  • Figure 5: Angular difference vs. attack performance (CSMIA - left, LOMIA - right) of 51 states from Census19 dataset. Accuracy is used as attack performance metric.
  • Figure 6: ASRD of Mutual Information Regularization defense (MIR - left, DAMIR - middle) under CSMIA and LOMIA and target model accuracy of MIR and DAMIR trained models (right).
  • Figure 7: Model utility vs. attack performance (CSMIA - left, LOMIA - right) of 51 states from Census19 dataset. Accuracy is used as a metric for both axes.
  • Figure 8: Correlation vs. angular difference of 51 states from Census19 Dataset
  • ...and 13 more figures

Theorems & Definitions (3)

  • Definition 5.1: Confidence Matrix
  • Definition 5.2: Angular Difference
  • Definition 7.1: ASRD