Table of Contents
Fetching ...

Descent-to-Delete: Gradient-Based Methods for Machine Unlearning

Seth Neel, Aaron Roth, Saeed Sharifi-Malvajerdi

TL;DR

This work tackles efficient data deletion (unlearning) for convex models under adversarial, long update sequences by framing indistinguishability relative to full retraining. It introduces two gradient-based strategies: a simple Perturbed Gradient Descent (PGD) with Gaussian noise and a distributed, partitioned variant employing reservoir sampling and bootstrap sampling to maintain distributional properties. The authors provide rigorous ε-δ unlearning guarantees under strong convexity and smoothness, with weaker guarantees and secret-state alternatives for non-strongly convex cases, and demonstrate improved runtime-accuracy tradeoffs in high-dimensional settings. Through connections to differential privacy and distributed learning literature (e.g., SISA, ZDW12), the paper presents a principled framework for robust, privacy-preserving machine unlearning in convex ERM systems.

Abstract

We study the data deletion problem for convex models. By leveraging techniques from convex optimization and reservoir sampling, we give the first data deletion algorithms that are able to handle an arbitrarily long sequence of adversarial updates while promising both per-deletion run-time and steady-state error that do not grow with the length of the update sequence. We also introduce several new conceptual distinctions: for example, we can ask that after a deletion, the entire state maintained by the optimization algorithm is statistically indistinguishable from the state that would have resulted had we retrained, or we can ask for the weaker condition that only the observable output is statistically indistinguishable from the observable output that would have resulted from retraining. We are able to give more efficient deletion algorithms under this weaker deletion criterion.

Descent-to-Delete: Gradient-Based Methods for Machine Unlearning

TL;DR

This work tackles efficient data deletion (unlearning) for convex models under adversarial, long update sequences by framing indistinguishability relative to full retraining. It introduces two gradient-based strategies: a simple Perturbed Gradient Descent (PGD) with Gaussian noise and a distributed, partitioned variant employing reservoir sampling and bootstrap sampling to maintain distributional properties. The authors provide rigorous ε-δ unlearning guarantees under strong convexity and smoothness, with weaker guarantees and secret-state alternatives for non-strongly convex cases, and demonstrate improved runtime-accuracy tradeoffs in high-dimensional settings. Through connections to differential privacy and distributed learning literature (e.g., SISA, ZDW12), the paper presents a principled framework for robust, privacy-preserving machine unlearning in convex ERM systems.

Abstract

We study the data deletion problem for convex models. By leveraging techniques from convex optimization and reservoir sampling, we give the first data deletion algorithms that are able to handle an arbitrarily long sequence of adversarial updates while promising both per-deletion run-time and steady-state error that do not grow with the length of the update sequence. We also introduce several new conceptual distinctions: for example, we can ask that after a deletion, the entire state maintained by the optimization algorithm is statistically indistinguishable from the state that would have resulted had we retrained, or we can ask for the weaker condition that only the observable output is statistically indistinguishable from the observable output that would have resulted from retraining. We are able to give more efficient deletion algorithms under this weaker deletion criterion.

Paper Structure

This paper contains 19 sections, 16 theorems, 99 equations, 1 table, 7 algorithms.

Key Result

Lemma 2.12

Suppose for any $z \in \mathcal{Z}$, $f_z$ is $L$-Lipschitz and $m$-strongly convex. For any dataset $\mathcal{D}$, let $\theta^*_\mathcal{D} \triangleq \mathop{\mathrm{argmin}}\limits_{\theta \in \Theta} f_\mathcal{D} \left(\theta \right)$. We have that for any integer $n$, any data set $\mathcal{D

Theorems & Definitions (42)

  • Definition 2.1: Update
  • Definition 2.2: $\mathcal{D}_i, \theta_i, \hat{\theta}_i, \tilde{\theta}_i$
  • Definition 2.3: $(\epsilon,\delta)$-indistinguishability
  • Definition 2.4: $(\epsilon,\delta)$-unlearning
  • Remark 2.5
  • Definition 2.7: $(\alpha, \beta)$-accuracy
  • Definition 2.8: strong vs. weak unlearning
  • Definition 2.9: Strong Convexity
  • Definition 2.10: Lipschitzness
  • Definition 2.11: Smoothness
  • ...and 32 more