Table of Contents
Fetching ...

Network Inversion for Training-Like Data Reconstruction

Pirzada Suhail, Amit Sethi

TL;DR

This paper presents Training-Like Data Reconstruction (TLDR), a network inversion-based approach to reconstruct training-like data from trained models by exploiting key properties of the classifier with respect to the training data along with some prior knowledge about the images.

Abstract

Machine Learning models are often trained on proprietary and private data that cannot be shared, though the trained models themselves are distributed openly assuming that sharing model weights is privacy preserving, as training data is not expected to be inferred from the model weights. In this paper, we present Training-Like Data Reconstruction (TLDR), a network inversion-based approach to reconstruct training-like data from trained models. To begin with, we introduce a comprehensive network inversion technique that learns the input space corresponding to different classes in the classifier using a single conditioned generator. While inversion may typically return random and arbitrary input images for a given output label, we modify the inversion process to incentivize the generator to reconstruct training-like data by exploiting key properties of the classifier with respect to the training data along with some prior knowledge about the images. To validate our approach, we conduct empirical evaluations on multiple standard vision classification datasets, thereby highlighting the potential privacy risks involved in sharing machine learning models.

Network Inversion for Training-Like Data Reconstruction

TL;DR

This paper presents Training-Like Data Reconstruction (TLDR), a network inversion-based approach to reconstruct training-like data from trained models by exploiting key properties of the classifier with respect to the training data along with some prior knowledge about the images.

Abstract

Machine Learning models are often trained on proprietary and private data that cannot be shared, though the trained models themselves are distributed openly assuming that sharing model weights is privacy preserving, as training data is not expected to be inferred from the model weights. In this paper, we present Training-Like Data Reconstruction (TLDR), a network inversion-based approach to reconstruct training-like data from trained models. To begin with, we introduce a comprehensive network inversion technique that learns the input space corresponding to different classes in the classifier using a single conditioned generator. While inversion may typically return random and arbitrary input images for a given output label, we modify the inversion process to incentivize the generator to reconstruct training-like data by exploiting key properties of the classifier with respect to the training data along with some prior knowledge about the images. To validate our approach, we conduct empirical evaluations on multiple standard vision classification datasets, thereby highlighting the potential privacy risks involved in sharing machine learning models.

Paper Structure

This paper contains 20 sections, 11 equations, 5 figures.

Figures (5)

  • Figure 1: Proposed Approach to Network Inversion
  • Figure 2: Schematic Approach to Training-Like Data Reconstruction using Network Inversion
  • Figure 3: Inverted Images for all 10 classes in MNIST, FashionMNIST, SVHN & CIFAR-10.
  • Figure 4: Reconstructed Images for all 10 classes in MNIST and FashionMNIST respectively .
  • Figure 5: Reconstructed Images for all 10 classes in SVHN and CIFAR-10 respectively.