Dual-View Inference Attack: Machine Unlearning Amplifies Privacy Exposure

Lulu Xue; Shengshan Hu; Linqiang Qian; Peijin Guo; Yechao Zhang; Minghui Li; Yanjun Zhang; Dayong Ye; Leo Yu Zhang

Dual-View Inference Attack: Machine Unlearning Amplifies Privacy Exposure

Lulu Xue, Shengshan Hu, Linqiang Qian, Peijin Guo, Yechao Zhang, Minghui Li, Yanjun Zhang, Dayong Ye, Leo Yu Zhang

TL;DR

This work reveals a previously overlooked privacy risk of machine unlearning: allowing an attacker to query both the original and unlearned models (dual-view) amplifies privacy leakage for retained data. It introduces Privacy Knowledge Gain to formalize this risk and DVIA, a black-box membership inference attack that uses Unlearning Confidence Difference and a simple likelihood-ratio inference without attack-model training. Through experiments on CIFAR/SVHN/CIFAR-100 with varying unlearning methods, DVIA consistently outperforms prior MIAs, including when unlearning is approximate, and demonstrates strong leakage even for non-forgotten data. The study highlights practical privacy implications for unlearning APIs and proposes avenues for defense and future research in broader domains.

Abstract

Machine unlearning is a newly popularized technique for removing specific training data from a trained model, enabling it to comply with data deletion requests. While it protects the rights of users requesting unlearning, it also introduces new privacy risks. Prior works have primarily focused on the privacy of data that has been unlearned, while the risks to retained data remain largely unexplored. To address this gap, we focus on the privacy risks of retained data and, for the first time, reveal the vulnerabilities introduced by machine unlearning under the dual-view setting, where an adversary can query both the original and the unlearned models. From an information-theoretic perspective, we introduce the concept of {privacy knowledge gain} and demonstrate that the dual-view setting allows adversaries to obtain more information than querying either model alone, thereby amplifying privacy leakage. To effectively demonstrate this threat, we propose DVIA, a Dual-View Inference Attack, which extracts membership information on retained data using black-box queries to both models. DVIA eliminates the need to train an attack model and employs a lightweight likelihood ratio inference module for efficient inference. Experiments across different datasets and model architectures validate the effectiveness of DVIA and highlight the privacy risks inherent in the dual-view setting.

Dual-View Inference Attack: Machine Unlearning Amplifies Privacy Exposure

TL;DR

Abstract

Dual-View Inference Attack: Machine Unlearning Amplifies Privacy Exposure

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (10)

Theorems & Definitions (6)