Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang; Jianting Tang; Chaohu Liu; Linli Xu

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

Yubo Wang, Jianting Tang, Chaohu Liu, Linli Xu

TL;DR

Facing copyright and unauthorized fine-tuning of LVLMs, the authors propose Parameter Learning Attack (PLA), which generates trigger images by adversarially attacking the original model while updating parameters in the opposite direction to preserve the published model and enable tracking of fine-tuned derivatives. The method designs rare question–answer pairs and constructs triggers that cause both the original and derivative models to output a predetermined target, measured by Target Match Rate (TMR). Empirical results on LLaVA-1.5 across six downstream fine-tuning scenarios show PLA outperforms backdoor-based and ordinary adversarial baselines, with robustness to input transformations and parameter perturbations. The work provides a practical, post-release copyright-protection mechanism for LVLMs and demonstrates generalizability to multiple LVLM architectures and fine-tuning strategies.

Abstract

Large vision-language models (LVLMs) have demonstrated remarkable image understanding and dialogue capabilities, allowing them to handle a variety of visual question answering tasks. However, their widespread availability raises concerns about unauthorized usage and copyright infringement, where users or individuals can develop their own LVLMs by fine-tuning published models. In this paper, we propose a novel method called Parameter Learning Attack (PLA) for tracking the copyright of LVLMs without modifying the original model. Specifically, we construct adversarial images through targeted attacks against the original model, enabling it to generate specific outputs. To ensure these attacks remain effective on potential fine-tuned models to trigger copyright tracking, we allow the original model to learn the trigger images by updating parameters in the opposite direction during the adversarial attack process. Notably, the proposed method can be applied after the release of the original model, thus not affecting the model's performance and behavior. To simulate real-world applications, we fine-tune the original model using various strategies across diverse datasets, creating a range of models for copyright verification. Extensive experiments demonstrate that our method can more effectively identify the original copyright of fine-tuned models compared to baseline methods. Therefore, this work provides a powerful tool for tracking copyrights and detecting unlicensed usage of LVLMs.

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

TL;DR

Abstract

Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)