Catch-Only-One: Non-Transferable Examples for Model-Specific Authorization
Zihan Wang, Zhiyong Ma, Zhongkui Ma, Shuofeng Liu, Akide Liu, Derui Wang, Minhui Xue, Guangdong Bai
TL;DR
This work tackles how to preserve data utility for an authorized model while preventing misuse by unknown models at inference time. It introduces non-transferable examples (Nes), a training-free input-side recoding strategy that confines perturbations to the authorized model's insensitivity subspace, keeping $f^\star$'s predictions intact while degrading non-target models. The authors provide a formal framework and theoretical guarantees based on spectral perturbation and the Hoffman-Wielandt inequality, and validate across diverse architectures and modalities (including vision-language models) with strong cross-model non-transferability and robustness to common preprocessing and reconstruction attacks. Collectively, Nes offer a practical, scalable approach to model-level usage control without retraining or heavy cryptographic cost.
Abstract
Recent AI regulations call for data that remain useful for innovation while resistant to misuse, balancing utility with protection at the model level. Existing approaches either perturb data to make it unlearnable or retrain models to suppress transfer, but neither governs inference by unknown models, and both typically require control over training. We propose non-transferable examples (NEs), a training-free and data-agnostic input-side usage-control mechanism. We recode inputs within a model-specific low-sensitivity subspace, preserving outputs for the authorized model while reducing performance on unauthorized models through subspace misalignment. We establish formal bounds that guarantee utility for the authorized model and quantify deviation for unauthorized ones, with the Hoffman-Wielandt inequality linking degradation to spectral differences. Empirically, NEs retain performance on diverse vision backbones and state-of-the-art vision-language models under common preprocessing, whereas non-target models collapse even with reconstruction attempts. These results establish NEs as a practical means to preserve intended data utility while preventing unauthorized exploitation. Our project is available at https://trusted-system-lab.github.io/model-specificity
