SSCL-BW: Sample-Specific Clean-Label Backdoor Watermarking for Dataset Ownership Verification
Yingjia Wang, Ting Qiao, Xing Liu, Chongzuo Li, Sixing Wu, Jianbin Li
TL;DR
SSCL-BW addresses dataset ownership verification by embedding sample specific clean-label backdoor watermarks using a U-Net based watermark generator. It introduces a three component loss $L = a L_t + b L_{nt} + c L_{lpips}$ to enforce target and non target behavior while preserving perceptual quality. The method embeds watermarks into a subset of target class samples and uses black box verification with a hypothesis test to detect unauthorized use, achieving high Delta P and low p values while remaining robust to watermark removal and model architecture changes. Experiments on CIFAR-10, Sub-ImageNet and MNIST demonstrate strong watermarking efficacy, verification reliability, and transferability, supporting practical dataset copyright protection.
Abstract
The rapid advancement of deep neural networks (DNNs) heavily relies on large-scale, high-quality datasets. However, unauthorized commercial use of these datasets severely violates the intellectual property rights of dataset owners. Existing backdoor-based dataset ownership verification methods suffer from inherent limitations: poison-label watermarks are easily detectable due to label inconsistencies, while clean-label watermarks face high technical complexity and failure on high-resolution images. Moreover, both approaches employ static watermark patterns that are vulnerable to detection and removal. To address these issues, this paper proposes a sample-specific clean-label backdoor watermarking (i.e., SSCL-BW). By training a U-Net-based watermarked sample generator, this method generates unique watermarks for each sample, fundamentally overcoming the vulnerability of static watermark patterns. The core innovation lies in designing a composite loss function with three components: target sample loss ensures watermark effectiveness, non-target sample loss guarantees trigger reliability, and perceptual similarity loss maintains visual imperceptibility. During ownership verification, black-box testing is employed to check whether suspicious models exhibit predefined backdoor behaviors. Extensive experiments on benchmark datasets demonstrate the effectiveness of the proposed method and its robustness against potential watermark removal attacks.
