Table of Contents
Fetching ...

SeqMIA: Sequential-Metric Based Membership Inference Attack

Hao Li, Zheng Li, Siyuan Wu, Chengrui Hu, Yutong Ye, Min Zhang, Dengguo Feng, Yang Zhang

TL;DR

SeqMIA addresses the high false-positive rates in membership inference attacks by exploiting a new integrated signal: the Pattern of Metric Sequence, which captures time-dependent patterns across multiple training stages. It uses knowledge distillation to generate distilled snapshots of the target model, constructs serialized multi-metric sequences from these snapshots, and feeds them to an attention-based RNN to infer membership. Empirical results across seven datasets and multiple architectures show SeqMIA substantially outperforms baselines, achieving large gains in TPR at $0.1\%$ FPR and robustness under several defenses. The work highlights the value of temporal patterns in training signals for MIAs and outlines practical defense considerations and future enhancements in metrics and serialization models.

Abstract

Most existing membership inference attacks (MIAs) utilize metrics (e.g., loss) calculated on the model's final state, while recent advanced attacks leverage metrics computed at various stages, including both intermediate and final stages, throughout the model training. Nevertheless, these attacks often process multiple intermediate states of the metric independently, ignoring their time-dependent patterns. Consequently, they struggle to effectively distinguish between members and non-members who exhibit similar metric values, particularly resulting in a high false-positive rate. In this study, we delve deeper into the new membership signals in the black-box scenario. We identify a new, more integrated membership signal: the Pattern of Metric Sequence, derived from the various stages of model training. We contend that current signals provide only partial perspectives of this new signal: the new one encompasses both the model's multiple intermediate and final states, with a greater emphasis on temporal patterns among them. Building upon this signal, we introduce a novel attack method called Sequential-metric based Membership Inference Attack (SeqMIA). Specifically, we utilize knowledge distillation to obtain a set of distilled models representing various stages of the target model's training. We then assess multiple metrics on these distilled models in chronological order, creating distilled metric sequence. We finally integrate distilled multi-metric sequences as a sequential multiformat and employ an attention-based RNN attack model for inference. Empirical results show SeqMIA outperforms all baselines, especially can achieve an order of magnitude improvement in terms of TPR @ 0.1% FPR. Furthermore, we delve into the reasons why this signal contributes to SeqMIA's high attack performance, and assess various defense mechanisms against SeqMIA.

SeqMIA: Sequential-Metric Based Membership Inference Attack

TL;DR

SeqMIA addresses the high false-positive rates in membership inference attacks by exploiting a new integrated signal: the Pattern of Metric Sequence, which captures time-dependent patterns across multiple training stages. It uses knowledge distillation to generate distilled snapshots of the target model, constructs serialized multi-metric sequences from these snapshots, and feeds them to an attention-based RNN to infer membership. Empirical results across seven datasets and multiple architectures show SeqMIA substantially outperforms baselines, achieving large gains in TPR at FPR and robustness under several defenses. The work highlights the value of temporal patterns in training signals for MIAs and outlines practical defense considerations and future enhancements in metrics and serialization models.

Abstract

Most existing membership inference attacks (MIAs) utilize metrics (e.g., loss) calculated on the model's final state, while recent advanced attacks leverage metrics computed at various stages, including both intermediate and final stages, throughout the model training. Nevertheless, these attacks often process multiple intermediate states of the metric independently, ignoring their time-dependent patterns. Consequently, they struggle to effectively distinguish between members and non-members who exhibit similar metric values, particularly resulting in a high false-positive rate. In this study, we delve deeper into the new membership signals in the black-box scenario. We identify a new, more integrated membership signal: the Pattern of Metric Sequence, derived from the various stages of model training. We contend that current signals provide only partial perspectives of this new signal: the new one encompasses both the model's multiple intermediate and final states, with a greater emphasis on temporal patterns among them. Building upon this signal, we introduce a novel attack method called Sequential-metric based Membership Inference Attack (SeqMIA). Specifically, we utilize knowledge distillation to obtain a set of distilled models representing various stages of the target model's training. We then assess multiple metrics on these distilled models in chronological order, creating distilled metric sequence. We finally integrate distilled multi-metric sequences as a sequential multiformat and employ an attention-based RNN attack model for inference. Empirical results show SeqMIA outperforms all baselines, especially can achieve an order of magnitude improvement in terms of TPR @ 0.1% FPR. Furthermore, we delve into the reasons why this signal contributes to SeqMIA's high attack performance, and assess various defense mechanisms against SeqMIA.
Paper Structure (30 sections, 2 equations, 15 figures, 22 tables)

This paper contains 30 sections, 2 equations, 15 figures, 22 tables.

Figures (15)

  • Figure 1: (a) the mean curves and fluctuation area of loss values for members and non-members during different training epochs; (b) the distribution of the cumulative loss fluctuation amplitude (CLFA) within 100 epochs.
  • Figure 2: Absolute correlation coefficients among multiple metrics calculated from MLPs trained on Location.
  • Figure 3: Overview of SeqMIA. Different from existing MIAs, SeqMIA focuses on the sequential membership information (multi-metric sequences) in a high-dimensional space.
  • Figure 4: Workflow of multi-metric sequence construction, which assembles the membership information of a sample into a sequence of a $k$-dimensional space.
  • Figure 5: Log-scale ROC curves for attacks on different model architectures and four image datasets (from top to bottom: CIFAR10, CIFAR100, CINIC10, and GTSRB).
  • ...and 10 more figures