MAP: MAsk-Pruning for Source-Free Model Intellectual Property Protection
Boyang Peng, Sanqing Qu, Yong Wu, Tianpei Zou, Lianghua He, Alois Knoll, Guang Chen, changjun jiang
TL;DR
This work tackles the practical problem of protecting pretrained model IP without access to the original training data by introducing MAsk Pruning (MAP). Grounded in the Inverse Transfer Parameter Hypothesis, MAP prunes target-domain–related parameters via a Learnable Binary Mask, constraining the model's generalization to unauthorized domains while preserving authorized-domain performance. It extends to two more capable variants, SF-MAP and DF-MAP, which synthesize pseudo-source data and diverse neighbor domains to guide pruning in source-free and data-free settings, respectively, and introduces the ST-D metric to balance source and target degradation. Across multiple benchmarks, MAP achieves state-of-the-art IP protection in source-available, source-free, and data-free scenarios, with practical implications for decentralized data privacy and ownership verification via watermarking.
Abstract
Deep learning has achieved remarkable progress in various applications, heightening the importance of safeguarding the intellectual property (IP) of well-trained models. It entails not only authorizing usage but also ensuring the deployment of models in authorized data domains, i.e., making models exclusive to certain target domains. Previous methods necessitate concurrent access to source training data and target unauthorized data when performing IP protection, making them risky and inefficient for decentralized private data. In this paper, we target a practical setting where only a well-trained source model is available and investigate how we can realize IP protection. To achieve this, we propose a novel MAsk Pruning (MAP) framework. MAP stems from an intuitive hypothesis, i.e., there are target-related parameters in a well-trained model, locating and pruning them is the key to IP protection. Technically, MAP freezes the source model and learns a target-specific binary mask to prevent unauthorized data usage while minimizing performance degradation on authorized data. Moreover, we introduce a new metric aimed at achieving a better balance between source and target performance degradation. To verify the effectiveness and versatility, we have evaluated MAP in a variety of scenarios, including vanilla source-available, practical source-free, and challenging data-free. Extensive experiments indicate that MAP yields new state-of-the-art performance.
