Table of Contents
Fetching ...

Differentially Private Kernel Inducing Points using features from ScatterNets (DP-KIP-ScatterNet) for Privacy Preserving Data Distillation

Margarita Vinaroz, Mi Jung Park

TL;DR

DP-KIP applies differential privacy to kernel inducing points within the KIP framework to produce privacy-preserving distilled datasets. By exploring kernel choices, the authors find ScatterNet features offer a favorable privacy-utility trade-off, achieving performance close to infinite-width conv-NTKs but at much lower computational cost, enabling single-GPU training. DP-KIP-ScatterNet outperforms prior DP data distillation methods on image benchmarks and remains effective under pixel corruption, while DP-KIP with FC-NTK shows strong results on tabular data with few distilled samples. Overall, the work broadens practical privacy-preserving data distillation by combining kernel methods with fixed, low-cost feature maps and a scalable DP-SGD training procedure.

Abstract

Data distillation aims to generate a small data set that closely mimics the performance of a given learning algorithm on the original data set. The distilled dataset is hence useful to simplify the training process thanks to its small data size. However, distilled data samples are not necessarily privacy-preserving, even if they are generally humanly indiscernible. To address this limitation, we introduce differentially private kernel inducing points (DP-KIP) for privacy-preserving data distillation. Unlike our original intention to simply apply DP-SGD to the framework of KIP, we find that KIP using infinitely-wide convolutional neural tangent kernels (conv-NTKs) performs better compared to KIP using fully-connected NTKs. However, KIP with conv-NTKs, due to its convolutional and pooling operations, introduces an unbearable computational complexity, requiring hundreds of V100 GPUs in parallel to train, which is impractical and more importantly, such computational resources are inaccessible to many. To overcome this issue, we propose an alternative that does not require pre-training (to avoid a privacy loss) and can well capture complex information on images, as those features from conv-NKTs do, while the computational cost is manageable by a single V100 GPU. To this end, we propose DP-KIP-ScatterNet, which uses the wavelet features from Scattering networks (ScatterNet) instead of those from conv-NTKs, to perform DP-KIP at a reasonable computational cost. We implement DP-KIP-ScatterNet in -- computationally efficient -- JAX and test on several popular image datasets to show its efficacy and its superior performance compared to state-of-the art methods in image data distillation with differential privacy guarantees.

Differentially Private Kernel Inducing Points using features from ScatterNets (DP-KIP-ScatterNet) for Privacy Preserving Data Distillation

TL;DR

DP-KIP applies differential privacy to kernel inducing points within the KIP framework to produce privacy-preserving distilled datasets. By exploring kernel choices, the authors find ScatterNet features offer a favorable privacy-utility trade-off, achieving performance close to infinite-width conv-NTKs but at much lower computational cost, enabling single-GPU training. DP-KIP-ScatterNet outperforms prior DP data distillation methods on image benchmarks and remains effective under pixel corruption, while DP-KIP with FC-NTK shows strong results on tabular data with few distilled samples. Overall, the work broadens practical privacy-preserving data distillation by combining kernel methods with fixed, low-cost feature maps and a scalable DP-SGD training procedure.

Abstract

Data distillation aims to generate a small data set that closely mimics the performance of a given learning algorithm on the original data set. The distilled dataset is hence useful to simplify the training process thanks to its small data size. However, distilled data samples are not necessarily privacy-preserving, even if they are generally humanly indiscernible. To address this limitation, we introduce differentially private kernel inducing points (DP-KIP) for privacy-preserving data distillation. Unlike our original intention to simply apply DP-SGD to the framework of KIP, we find that KIP using infinitely-wide convolutional neural tangent kernels (conv-NTKs) performs better compared to KIP using fully-connected NTKs. However, KIP with conv-NTKs, due to its convolutional and pooling operations, introduces an unbearable computational complexity, requiring hundreds of V100 GPUs in parallel to train, which is impractical and more importantly, such computational resources are inaccessible to many. To overcome this issue, we propose an alternative that does not require pre-training (to avoid a privacy loss) and can well capture complex information on images, as those features from conv-NKTs do, while the computational cost is manageable by a single V100 GPU. To this end, we propose DP-KIP-ScatterNet, which uses the wavelet features from Scattering networks (ScatterNet) instead of those from conv-NTKs, to perform DP-KIP at a reasonable computational cost. We implement DP-KIP-ScatterNet in -- computationally efficient -- JAX and test on several popular image datasets to show its efficacy and its superior performance compared to state-of-the art methods in image data distillation with differential privacy guarantees.
Paper Structure (22 sections, 1 theorem, 10 equations, 2 figures, 16 tables, 1 algorithm)

This paper contains 22 sections, 1 theorem, 10 equations, 2 figures, 16 tables, 1 algorithm.

Key Result

Proposition 1

The DP-KIP algorithm produces a ($\epsilon, \delta$)-DP distilled dataset.

Figures (2)

  • Figure 1: Generated image samples from KIP comparison models.
  • Figure 2: Generated image samples from DP-KIP-ScatterNet for different $\epsilon$ values.

Theorems & Definitions (6)

  • Definition 2.1
  • Definition 2.2
  • Definition 2.3
  • Proposition 1
  • proof
  • Definition B.1