Is there anything left? Measuring semantic residuals of objects removed from 3D Gaussian Splatting
Simona Kocour, Assia Benbihi, Aikaterini Adam, Torsten Sattler
TL;DR
This work tackles the privacy-oriented problem of residual information after removing objects from 3D Gaussian Splatting representations. It introduces a quantitative framework combining semantic, instance-level, and depth-based metrics, including $IoU_{drop}$, $acc_{seg}$, $sim_{SAM}$, and $acc_{ riangle depth}$, and couples them with a graph-cut based removal refinement that enforces spatial and semantic consistency. Through experiments on indoor/outdoor scenes with multiple removal methods, the authors show that methods like GaussianCut and GaussianGrouping yield strong removals, while a user study supports the alignment between metrics and perceived removal quality. The results advance privacy-preserving mapping by providing robust, multi-faceted evaluation tools and a practical refinement technique, along with open-source code and data to catalyze further research.
Abstract
Searching in and editing 3D scenes has become extremely intuitive with trainable scene representations that allow linking human concepts to elements in the scene. These operations are often evaluated on the basis of how accurately the searched element is segmented or extracted from the scene. In this paper, we address the inverse problem, that is, how much of the searched element remains in the scene after it is removed. This question is particularly important in the context of privacy-preserving mapping when a user reconstructs a 3D scene and wants to remove private elements before sharing the map. To the best of our knowledge, this is the first work to address this question. To answer this, we propose a quantitative evaluation that measures whether a removal operation leaves object residuals that can be reasoned over. The scene is not private when such residuals are present. Experiments on state-of-the-art scene representations show that the proposed metrics are meaningful and consistent with the user study that we also present. We also propose a method to refine the removal based on spatial and semantic consistency.
