Table of Contents
Fetching ...

FT-PrivacyScore: Personalized Privacy Scoring Service for Machine Learning Participation

Yuechun Gu, Jiajie He, Keke Chen

TL;DR

The demo prototype FT-PrivacyScore is developed to show that it's possible to efficiently and quantitatively estimate the privacy risk of participating in a model fine-tuning task.

Abstract

Training data privacy has been a top concern in AI modeling. While methods like differentiated private learning allow data contributors to quantify acceptable privacy loss, model utility is often significantly damaged. In practice, controlled data access remains a mainstream method for protecting data privacy in many industrial and research environments. In controlled data access, authorized model builders work in a restricted environment to access sensitive data, which can fully preserve data utility with reduced risk of data leak. However, unlike differential privacy, there is no quantitative measure for individual data contributors to tell their privacy risk before participating in a machine learning task. We developed the demo prototype FT-PrivacyScore to show that it's possible to efficiently and quantitatively estimate the privacy risk of participating in a model fine-tuning task. The demo source code will be available at \url{https://github.com/RhincodonE/demo_privacy_scoring}.

FT-PrivacyScore: Personalized Privacy Scoring Service for Machine Learning Participation

TL;DR

The demo prototype FT-PrivacyScore is developed to show that it's possible to efficiently and quantitatively estimate the privacy risk of participating in a model fine-tuning task.

Abstract

Training data privacy has been a top concern in AI modeling. While methods like differentiated private learning allow data contributors to quantify acceptable privacy loss, model utility is often significantly damaged. In practice, controlled data access remains a mainstream method for protecting data privacy in many industrial and research environments. In controlled data access, authorized model builders work in a restricted environment to access sensitive data, which can fully preserve data utility with reduced risk of data leak. However, unlike differential privacy, there is no quantitative measure for individual data contributors to tell their privacy risk before participating in a machine learning task. We developed the demo prototype FT-PrivacyScore to show that it's possible to efficiently and quantitatively estimate the privacy risk of participating in a model fine-tuning task. The demo source code will be available at \url{https://github.com/RhincodonE/demo_privacy_scoring}.

Paper Structure

This paper contains 7 sections, 3 figures, 1 algorithm.

Figures (3)

  • Figure 1: Personalized privacy scoring service: evaluate the privacy risk for users participating in a model fine-tuning task
  • Figure 2: Quality of our efficient privacy scoring method.
  • Figure 3: Interactive live system