CBW: Towards Dataset Ownership Verification for Speaker Verification via Clustering-based Backdoor Watermarking
Yiming Li, Kaiying Yan, Shuo Shao, Tongqing Zhai, Shu-Tao Xia, Zhan Qin, Dacheng Tao
TL;DR
The paper addresses copyright protection for public speaker verification datasets by proposing clustering-based backdoor watermarking (CBW) and a hypothesis-test–based ownership verification framework suitable for black-box model evaluation. CBW watermarks data by clustering speakers in feature space and implanting cluster-specific triggers, enabling distinctive backdoor behavior without exposing enrolled speakers, and it is paired with both similarity-available and decision-only verification modes. The authors provide theoretical analyses showing verification viability with practical watermarking rates and conduct extensive experiments across multiple models and datasets, demonstrating high watermark success rates and low impact on normal performance, along with robustness to adaptive attacks and transferability across models. The work offers a practical, scalable approach to trustworthy data sharing and licensing in biometric verification, with potential extensions to other verification tasks. Overall, CBW advances dataset copyright protection by marrying clustered trigger design with rigorous statistical verification in a black-box setting, supported by empirical and theoretical guarantees.
Abstract
With the increasing adoption of deep learning in speaker verification, large-scale speech datasets have become valuable intellectual property. To audit and prevent the unauthorized usage of these valuable released datasets, especially in commercial or open-source scenarios, we propose a novel dataset ownership verification method. Our approach introduces a clustering-based backdoor watermark (CBW), enabling dataset owners to determine whether a suspicious third-party model has been trained on a protected dataset under a black-box setting. The CBW method consists of two key stages: dataset watermarking and ownership verification. During watermarking, we implant multiple trigger patterns in the dataset to make similar samples (measured by their feature similarities) close to the same trigger while dissimilar samples are near different triggers. This ensures that any model trained on the watermarked dataset exhibits specific misclassification behaviors when exposed to trigger-embedded inputs. To verify dataset ownership, we design a hypothesis-test-based framework that statistically evaluates whether a suspicious model exhibits the expected backdoor behavior. We conduct extensive experiments on benchmark datasets, verifying the effectiveness and robustness of our method against potential adaptive attacks. The code for reproducing main experiments is available at https://github.com/Radiant0726/CBW
