Certified Per-Instance Unlearning Using Individual Sensitivity Bounds

Hanna Benarroch; Jamal Atif; Olivier Cappé

Certified Per-Instance Unlearning Using Individual Sensitivity Bounds

Hanna Benarroch, Jamal Atif, Olivier Cappé

TL;DR

The paper tackles certified machine unlearning by moving from uniform worst-case noise to per-instance noise calibrated to the erased point’s influence. It builds a learn–then–unlearn framework using Langevin dynamics and Gaussian differential privacy, deriving high-probability, per-instance sensitivity bounds for ridge regression and showing how to tune unlearning noise accordingly. The main contributions are a trajectory-based GDP accounting method under contractive updates, and explicit per-instance unlearning guarantees for ridge regression, supported by experiments in MNIST and CIFAR-10 that reveal substantial heterogeneity across data points. The work demonstrates improved privacy–utility trade-offs through instance-aware calibration and motivates extending per-instance guarantees to more complex, non-linear models. Overall, it advances practical, provable unlearning by leveraging data-point specific influence, contraction properties, and GDP-based privacy accounting.

Abstract

Certified machine unlearning can be achieved via noise injection leading to differential privacy guarantees, where noise is calibrated to worst-case sensitivity. Such conservative calibration often results in performance degradation, limiting practical applicability. In this work, we investigate an alternative approach based on adaptive per-instance noise calibration tailored to the individual contribution of each data point to the learned solution. This raises the following challenge: how can one establish formal unlearning guarantees when the mechanism depends on the specific point to be removed? To define individual data point sensitivities in noisy gradient dynamics, we consider the use of per-instance differential privacy. For ridge regression trained via Langevin dynamics, we derive high-probability per-instance sensitivity bounds, yielding certified unlearning with substantially less noise injection. We corroborate our theoretical findings through experiments in linear settings and provide further empirical evidence on the relevance of the approach in deep learning settings.

Certified Per-Instance Unlearning Using Individual Sensitivity Bounds

TL;DR

Abstract

Paper Structure (58 sections, 7 theorems, 91 equations, 11 figures, 1 table, 1 algorithm)

This paper contains 58 sections, 7 theorems, 91 equations, 11 figures, 1 table, 1 algorithm.

Introduction
Related work
Exact and approximate unlearning.
Gradient-based unlearning with noisy optimization.
Per-instance unlearning mechanisms.
Gaussian differential privacy and privacy accounting.
Problem Setting
Regularized Empirical Risk Minimization
Certified per-instance unlearning
Algorithms: learn then targeted unlearn
Langevin learning $\mathcal{A}$
Targeted unlearning $\mathcal{U}$ via retain-set fine-tuning
Theory for Certified Per-instance Unlearning
Privacy loss tracking under contractive updates and per-instance sensitivity
Compared trajectories for privacy analysis.
...and 43 more sections

Key Result

Proposition 4.2

Consider the two parameter trajectories $(\theta_k)_{k\ge0}$ and $(\theta'_k)_{k\ge0}$ defined as above. Assume that Assumption ass:contraction holds with contraction factor $c<1$. For an unlearning request concerning $(x_i,y_i)$, define the per-instance sensitivity at time $k$ as and assume that there exists a deterministic sequence $\{s_{i,k}\}_{k=0}^{T}$ (which does not depend on the stochasti

Figures (11)

Figure 1: Intuition behind per-instance unlearning. Top: After training, the parameter distributions induced by $\mathcal{A}(D)$ and $\mathcal{A}(D^{-i})$ are sharply concentrated and well separated, resulting in a large privacy gap $\varepsilon_0$. Bottom: After applying the unlearning procedure $\mathcal{U}$, the resulting distributions corresponding to $\mathcal{U}(\mathcal{A}(D),D^{-i},(x_i,y_i))$ and $\mathcal{U}(\mathcal{A}(D^{-i}),D^{-i},\varnothing)$ become closer, yielding a much smaller privacy parameter $\varepsilon \ll \varepsilon_0$.
Figure 2: Learn--unlearn vs. retraining trajectories. The solid path $(\theta_k)$ corresponds to training on the full dataset $D$ followed by unlearning, while the dashed path $(\theta'_k)$ corresponds to (fictitiously) training and retraining on the retain dataset $D^{-i}$.
Figure 3: High-probability sensitivity bounds $s_{i,k}^{\delta_{\mathrm s}}$ for all MNIST training points in the linear ridge regression setting. Rows correspond to data points and columns to learning iterations. Points are sorted by increasing final sensitivity $s_{i,T}^{\delta_{\mathrm s}}$, highlighting substantial per-instance heterogeneity already at the level of analytical bounds.
Figure 4: Representative MNIST examples selected for unlearning, ordered from left to right by increasing difficulty, as measured by the per-sample gradient norm $\|\nabla_\theta \ell(\theta_T; x_i, y_i)\|$ at convergence for a certain seed.
Figure 5: Privacy--utility trade-off on MNIST. Final test accuracy as a function of the privacy budget $\varepsilon$ for per-instance unlearning (solid lines) and a uniform baseline (dashed lines). Each curve corresponds to a distinct removed training point, highlighting the heterogeneity of the trade-off across points.
...and 6 more figures

Theorems & Definitions (17)

Definition 3.1: Differential Privacy dwork2006tcc
Definition 3.2: $(\varepsilon,\delta)$-Reference Unlearning ginart2019
Definition 3.3: $(\varepsilon,\delta)$-Unlearning sekhari2021
Definition 3.4: $(\varepsilon,\delta)$-Per-Instance Unlearning
Proposition 4.2: Gaussian DP accounting of the unlearning mechanism $\mathcal{U}$ for bounded per-instance sensitivities
Theorem 4.3: Certified per-instance unlearning for ridge regression
Proposition 4.4: High-probability sensitivity bounds $s_{i,k}^{\delta_{\mathrm s}}$ for ridge regression
Remark 4.5: Exact unlearning for ridge regression
Definition A.1: Gaussian Differential Privacy dong2019gdp
Lemma B.1: Shifted step lemma (bok2024shiftedinterpolationdifferentialprivacy)
...and 7 more

Certified Per-Instance Unlearning Using Individual Sensitivity Bounds

TL;DR

Abstract

Certified Per-Instance Unlearning Using Individual Sensitivity Bounds

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (17)