Table of Contents
Fetching ...

Targeted Therapy in Data Removal: Object Unlearning Based on Scene Graphs

Chenhan Zhang, Benjamin Zi Hao Zhao, Hassan Asghar, Dali Kaafar

TL;DR

This work tackles granular privacy by introducing a scene-graph–based object unlearning framework that can selectively remove specific objects from images generated by SG2I models while preserving remaining content. It formalizes object unlearning, leveraging scene graphs to map user requests to actionable model edits and proposes three unlearning methods (negative guidance, masking, and influence-function redaction) to avoid full retraining. Extensive experiments on Visual Genome across image reconstruction and synthesis demonstrate that object-masked fine-tuning offers the best trade-off between unlearning effectiveness and fidelity, with strong robustness to indirect leakage attacks. The approach significantly speeds up unlearning compared to retraining and provides a scalable path for privacy-compliant MLaaS without sacrificing overall data utility.

Abstract

Users may inadvertently upload personally identifiable information (PII) to Machine Learning as a Service (MLaaS) providers. When users no longer want their PII on these services, regulations like GDPR and COPPA mandate a right to forget for these users. As such, these services seek efficient methods to remove the influence of specific data points. Thus the introduction of machine unlearning. Traditionally, unlearning is performed with the removal of entire data samples (sample unlearning) or whole features across the dataset (feature unlearning). However, these approaches are not equipped to handle the more granular and challenging task of unlearning specific objects within a sample. To address this gap, we propose a scene graph-based object unlearning framework. This framework utilizes scene graphs, rich in semantic representation, transparently translate unlearning requests into actionable steps. The result, is the preservation of the overall semantic integrity of the generated image, bar the unlearned object. Further, we manage high computational overheads with influence functions to approximate the unlearning process. For validation, we evaluate the unlearned object's fidelity in outputs under the tasks of image reconstruction and image synthesis. Our proposed framework demonstrates improved object unlearning outcomes, with the preservation of unrequested samples in contrast to sample and feature learning methods. This work addresses critical privacy issues by increasing the granularity of targeted machine unlearning through forgetting specific object-level details without sacrificing the utility of the whole data sample or dataset feature.

Targeted Therapy in Data Removal: Object Unlearning Based on Scene Graphs

TL;DR

This work tackles granular privacy by introducing a scene-graph–based object unlearning framework that can selectively remove specific objects from images generated by SG2I models while preserving remaining content. It formalizes object unlearning, leveraging scene graphs to map user requests to actionable model edits and proposes three unlearning methods (negative guidance, masking, and influence-function redaction) to avoid full retraining. Extensive experiments on Visual Genome across image reconstruction and synthesis demonstrate that object-masked fine-tuning offers the best trade-off between unlearning effectiveness and fidelity, with strong robustness to indirect leakage attacks. The approach significantly speeds up unlearning compared to retraining and provides a scalable path for privacy-compliant MLaaS without sacrificing overall data utility.

Abstract

Users may inadvertently upload personally identifiable information (PII) to Machine Learning as a Service (MLaaS) providers. When users no longer want their PII on these services, regulations like GDPR and COPPA mandate a right to forget for these users. As such, these services seek efficient methods to remove the influence of specific data points. Thus the introduction of machine unlearning. Traditionally, unlearning is performed with the removal of entire data samples (sample unlearning) or whole features across the dataset (feature unlearning). However, these approaches are not equipped to handle the more granular and challenging task of unlearning specific objects within a sample. To address this gap, we propose a scene graph-based object unlearning framework. This framework utilizes scene graphs, rich in semantic representation, transparently translate unlearning requests into actionable steps. The result, is the preservation of the overall semantic integrity of the generated image, bar the unlearned object. Further, we manage high computational overheads with influence functions to approximate the unlearning process. For validation, we evaluate the unlearned object's fidelity in outputs under the tasks of image reconstruction and image synthesis. Our proposed framework demonstrates improved object unlearning outcomes, with the preservation of unrequested samples in contrast to sample and feature learning methods. This work addresses critical privacy issues by increasing the granularity of targeted machine unlearning through forgetting specific object-level details without sacrificing the utility of the whole data sample or dataset feature.

Paper Structure

This paper contains 17 sections, 11 equations, 12 figures, 4 tables.

Figures (12)

  • Figure 1: Scene graph-based object unlearning framework. Scene graphs can help both servers and users manage unlearning requests effectively by providing a structured way to understand the relationships between objects in an image. Scene graphs make it easier to identify and remove the requested object, such as a girl in an image. Moreover, this ensures that servers interpret and handle requests accurately, avoiding vague or incomplete unlearning actions. In this way, scene graphs act as a bridge, translating user intentions into actionable and transparent operations for the server.
  • Figure 2: Illustration of the unique identity of objects in the scene graph. In the image and the corresponding scene graph, even though two objects may belong to the same category, such as ‘girl,’ they are represented as distinct objects.
  • Figure 3: Schematic of scene-graph-to-image (SG2I) generator. Readers can refer to DBLP:conf/cvpr/DhamoFLNHT020 for a detailed view of the architecture.
  • Figure 4: Results of unlearning verification through metrics regarding A1, A2, and A3. To provide clarity for the reader, we have modify the distance metrics to follow a "larger is better" mantra. Specifically, we compute the complement values for A1_SSIM, A2_LPIPS, A3_LPIPS, A2_MAE, and A3_MAE for presentation within the plot. For SSIM and LPIPS, the complement transformation is 1 - value. For MAE, the complement transformation is 255 - value. The complement values are highlighted with a "$\sim$" prefix. For A3 metrics, since they involve multiple other samples, we calculate and report the average value among samples.
  • Figure 5: Visualization of unlearning verification. In the images, the 'green boxes' localize the unlearned object, while the 'red boxes' indicate objects of the same category as the unlearned object but in different samples. In the scene graph visualization, the 'green node' represents the unlearned object. GT represents the ground truth.
  • ...and 7 more figures

Theorems & Definitions (3)

  • Definition 1: Sample Unlearning
  • Definition 2: Feature Unlearning
  • Definition 3: Object Unlearning