SMS: Self-supervised Model Seeding for Verification of Machine Unlearning

Weiqi Wang; Chenhan Zhang; Zhiyi Tian; Shui Yu

SMS: Self-supervised Model Seeding for Verification of Machine Unlearning

Weiqi Wang, Chenhan Zhang, Zhiyi Tian, Shui Yu

TL;DR

SMS (Self-supervised Model Seeding) tackles the verification gap in machine unlearning by embedding user seeds into genuine data and learning a seed-aware latent representation through a self-supervised seeding task. A seed-embedded model $M_{ exttt{S}}$ is trained jointly on the primary task and a self-supervised objective, with seed verification performed by a user-specific verifier $oldsymbol{ ext{V}}$ based on seed presence; verification bounds are given by $ ext{Pr}ig\uparrow oldsymbol{ ext{V}}(M_{ exttt{S}}, x_{i,s_i})=1ig parrow \\ge 1 - ext{eps}^{-N}$ and unambiguity bounds, while functionality is preserved so that $d(M(x), M_{ exttt{S}}(x)) \\le \

Abstract

Many machine unlearning methods have been proposed recently to uphold users' right to be forgotten. However, offering users verification of their data removal post-unlearning is an important yet under-explored problem. Current verifications typically rely on backdooring, i.e., adding backdoored samples to influence model performance. Nevertheless, the backdoor methods can merely establish a connection between backdoored samples and models but fail to connect the backdoor with genuine samples. Thus, the backdoor removal can only confirm the unlearning of backdoored samples, not users' genuine samples, as genuine samples are independent of backdoored ones. In this paper, we propose a Self-supervised Model Seeding (SMS) scheme to provide unlearning verification for genuine samples. Unlike backdooring, SMS links user-specific seeds (such as users' unique indices), original samples, and models, thereby facilitating the verification of unlearning genuine samples. However, implementing SMS for unlearning verification presents two significant challenges. First, embedding the seeds into the service model while keeping them secret from the server requires a sophisticated approach. We address this by employing a self-supervised model seeding task, which learns the entire sample, including the seeds, into the model's latent space. Second, maintaining the utility of the original service model while ensuring the seeding effect requires a delicate balance. We design a joint-training structure that optimizes both the self-supervised model seeding task and the primary service task simultaneously on the model, thereby maintaining model utility while achieving effective model seeding. The effectiveness of the proposed SMS scheme is evaluated through extensive experiments, which demonstrate that SMS provides effective verification for genuine sample unlearning, addressing existing limitations.

SMS: Self-supervised Model Seeding for Verification of Machine Unlearning

TL;DR

Abstract

SMS: Self-supervised Model Seeding for Verification of Machine Unlearning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (4)