ESC: Erasing Space Concept for Knowledge Deletion
Tae-Young Lee, Sundong Park, Minwoo Jeon, Hyoseok Hwang, Gyeong-Moon Park
TL;DR
This work reframes machine unlearning as Knowledge Deletion (KD), addressing real-world demands for removing both utility-relevant and privacy-sensitive knowledge from models. It identifies a critical gap: prior MU methods preserve feature-level knowledge that can leak or be exploited, and thus proposes a feature-space deletion approach. The authors introduce Erasing Space Concept (ESC) and its training-enabled variant ESC-T to directly suppress forgetting information in embedding spaces, with Knowledge Retention Score (KR) as a principled metric for feature-level knowledge retention. Through extensive experiments across diverse datasets and models, ESC/ESC-T achieve fast, state-of-the-art KD performance and robust KR scores, including in facial-domain scenarios, suggesting practical utility for privacy-preserving model updates. The work also demonstrates efficiency, broad applicability, and provides a pathway for future extensions to larger models and generative domains.
Abstract
As concerns regarding privacy in deep learning continue to grow, individuals are increasingly apprehensive about the potential exploitation of their personal knowledge in trained models. Despite several research efforts to address this, they often fail to consider the real-world demand from users for complete knowledge erasure. Furthermore, our investigation reveals that existing methods have a risk of leaking personal knowledge through embedding features. To address these issues, we introduce a novel concept of Knowledge Deletion (KD), an advanced task that considers both concerns, and provides an appropriate metric, named Knowledge Retention score (KR), for assessing knowledge retention in feature space. To achieve this, we propose a novel training-free erasing approach named Erasing Space Concept (ESC), which restricts the important subspace for the forgetting knowledge by eliminating the relevant activations in the feature. In addition, we suggest ESC with Training (ESC-T), which uses a learnable mask to better balance the trade-off between forgetting and preserving knowledge in KD. Our extensive experiments on various datasets and models demonstrate that our proposed methods achieve the fastest and state-of-the-art performance. Notably, our methods are applicable to diverse forgetting scenarios, such as facial domain setting, demonstrating the generalizability of our methods. The code is available at http://github.com/KU-VGI/ESC .
