ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models

Linkang Du; Zheng Zhu; Min Chen; Zhou Su; Shouling Ji; Peng Cheng; Jiming Chen; Zhikun Zhang

ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models

Linkang Du, Zheng Zhu, Min Chen, Zhou Su, Shouling Ji, Peng Cheng, Jiming Chen, Zhikun Zhang

TL;DR

This work tackles copyright protection for artist styles in diffusion-based text-to-image generation by proposing ArtistAuditor, a data-use auditing framework that leverages multi-granularity CNN latent representations as intrinsic fingerprints of an artist's style. A discriminator is trained on public artworks, mimicked outputs, and augmented samples to produce confidence scores indicating whether a suspicious model has been finetuned on the target artist's work, with thresholding or one-sided t-testing to make decisions. The method demonstrates high discriminative performance (AUC > 0.937) across multiple models and datasets, robustness to some variations in captioning and data overlap, and practical viability demonstrated on an online platform, while maintaining data fidelity without modifying original artworks or retraining hidden models. The work provides actionable guidance on dataset preparation, augmentation, distortion calibration, and transferability, offering a scalable, black-box auditing solution for real-world art-style copyright protection in AI-generated content.

Abstract

Text-to-image models based on diffusion processes, such as DALL-E, Stable Diffusion, and Midjourney, are capable of transforming texts into detailed images and have widespread applications in art and design. As such, amateur users can easily imitate professional-level paintings by collecting an artist's work and fine-tuning the model, leading to concerns about artworks' copyright infringement. To tackle these issues, previous studies either add visually imperceptible perturbation to the artwork to change its underlying styles (perturbation-based methods) or embed post-training detectable watermarks in the artwork (watermark-based methods). However, when the artwork or the model has been published online, i.e., modification to the original artwork or model retraining is not feasible, these strategies might not be viable. To this end, we propose a novel method for data-use auditing in the text-to-image generation model. The general idea of ArtistAuditor is to identify if a suspicious model has been finetuned using the artworks of specific artists by analyzing the features related to the style. Concretely, ArtistAuditor employs a style extractor to obtain the multi-granularity style representations and treats artworks as samplings of an artist's style. Then, ArtistAuditor queries a trained discriminator to gain the auditing decisions. The experimental results on six combinations of models and datasets show that ArtistAuditor can achieve high AUC values (> 0.937). By studying ArtistAuditor's transferability and core modules, we provide valuable insights into the practical implementation. Finally, we demonstrate the effectiveness of ArtistAuditor in real-world cases by an online platform Scenario. ArtistAuditor is open-sourced at https://github.com/Jozenn/ArtistAuditor.

ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models

TL;DR

Abstract

ArtistAuditor: Auditing Artist Style Pirate in Text-to-Image Generation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)