Table of Contents
Fetching ...

Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features

Francisco Teixeira, Karla Pizzi, Raphael Olivier, Alberto Abad, Bhiksha Raj, Isabel Trancoso

TL;DR

This paper addresses privacy auditing in ASR by developing loss-based MI features computed on unprocessed model logits and augmenting them with Gaussian and adversarial perturbations. The approach outperforms traditional error-based features for sample-level MI, achieving high accuracy and AUC, while still offering improvements under realistic shadow-target and architecture-mismatch scenarios. For speaker-level MI, gains are more modest and sensitive to model architecture differences. The findings support using logit-based, perturbation-informed MI features to audit ASR systems while considering access level and privacy trade-offs in real-world deployments.

Abstract

Membership Inference (MI) poses a substantial privacy threat to the training data of Automatic Speech Recognition (ASR) systems, while also offering an opportunity to audit these models with regard to user data. This paper explores the effectiveness of loss-based features in combination with Gaussian and adversarial perturbations to perform MI in ASR models. To the best of our knowledge, this approach has not yet been investigated. We compare our proposed features with commonly used error-based features and find that the proposed features greatly enhance performance for sample-level MI. For speaker-level MI, these features improve results, though by a smaller margin, as error-based features already obtained a high performance for this task. Our findings emphasise the importance of considering different feature sets and levels of access to target models for effective MI in ASR systems, providing valuable insights for auditing such models.

Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features

TL;DR

This paper addresses privacy auditing in ASR by developing loss-based MI features computed on unprocessed model logits and augmenting them with Gaussian and adversarial perturbations. The approach outperforms traditional error-based features for sample-level MI, achieving high accuracy and AUC, while still offering improvements under realistic shadow-target and architecture-mismatch scenarios. For speaker-level MI, gains are more modest and sensitive to model architecture differences. The findings support using logit-based, perturbation-informed MI features to audit ASR systems while considering access level and privacy trade-offs in real-world deployments.

Abstract

Membership Inference (MI) poses a substantial privacy threat to the training data of Automatic Speech Recognition (ASR) systems, while also offering an opportunity to audit these models with regard to user data. This paper explores the effectiveness of loss-based features in combination with Gaussian and adversarial perturbations to perform MI in ASR models. To the best of our knowledge, this approach has not yet been investigated. We compare our proposed features with commonly used error-based features and find that the proposed features greatly enhance performance for sample-level MI. For speaker-level MI, these features improve results, though by a smaller margin, as error-based features already obtained a high performance for this task. Our findings emphasise the importance of considering different feature sets and levels of access to target models for effective MI in ASR systems, providing valuable insights for auditing such models.
Paper Structure (16 sections, 2 tables, 2 algorithms)