A Unified Membership Inference Method for Visual Self-supervised Encoder via Part-aware Capability
Jie Zhu, Jirong Zha, Ding Li, Leye Wang
TL;DR
The paper tackles privacy risks in visual self-supervised learning by proposing PartCrop, a unified membership inference method that operates under a black-box setting where the training recipe is unknown. It exploits a shared part-aware capability across SSL paradigms by querying image parts and analyzing their distributional responses to form membership features, learned by a simple attacker. Comprehensive experiments across MAE, DINO, MoCo and multiple datasets demonstrate PartCrop's superior attack performance relative to baselines, including EncoderMI, and show its generalization to additional SSL paradigms; defenses such as early stop, differential privacy, and a novel shrinking crop scale range are evaluated, with SCSR often providing strong privacy gains with acceptable utility costs. Overall, PartCrop offers a practical, cross-paradigm MI framework for SSL models and highlights actionable defense strategies for real-world deployed systems.
Abstract
Self-supervised learning shows promise in harnessing extensive unlabeled data, but it also confronts significant privacy concerns, especially in vision. In this paper, we aim to perform membership inference on visual self-supervised models in a more realistic setting: self-supervised training method and details are unknown for an adversary when attacking as he usually faces a black-box system in practice. In this setting, considering that self-supervised model could be trained by completely different self-supervised paradigms, e.g., masked image modeling and contrastive learning, with complex training details, we propose a unified membership inference method called PartCrop. It is motivated by the shared part-aware capability among models and stronger part response on the training data. Specifically, PartCrop crops parts of objects in an image to query responses with the image in representation space. We conduct extensive attacks on self-supervised models with different training protocols and structures using three widely used image datasets. The results verify the effectiveness and generalization of PartCrop. Moreover, to defend against PartCrop, we evaluate two common approaches, i.e., early stop and differential privacy, and propose a tailored method called shrinking crop scale range. The defense experiments indicate that all of them are effective. Our code is available at https://github.com/JiePKU/PartCrop.
