Watermarking Decision Tree Ensembles
Stefano Calzavara, Lorenzo Cazzaro, Donald Gera, Salvatore Orlando
TL;DR
The paper tackles intellectual-property protection for decision-tree ensembles, notably random forests, by embedding a watermark via a trigger set. The method learns two sub-ensembles and then composes the final watermark by selecting a tree from each sub-ensemble according to the binary signature $\{0,1\}^m$, with a trigger set $\mathcal{D}_{trigger}$ of size $k$ used during training to enforce targeted behavior. Security analysis shows robustness to detection and suppression, and proves watermark forgery is NP-hard via a 3SAT reduction. Experiments on public datasets demonstrate negligible accuracy loss and strong resistance to detection/suppression/forgery threats, indicating practical viability and suggesting extension to gradient-boosted ensembles.
Abstract
Protecting the intellectual property of machine learning models is a hot topic and many watermarking schemes for deep neural networks have been proposed in the literature. Unfortunately, prior work largely neglected the investigation of watermarking techniques for other types of models, including decision tree ensembles, which are a state-of-the-art model for classification tasks on non-perceptual data. In this paper, we present the first watermarking scheme designed for decision tree ensembles, focusing in particular on random forest models. We discuss watermark creation and verification, presenting a thorough security analysis with respect to possible attacks. We finally perform an experimental evaluation of the proposed scheme, showing excellent results in terms of accuracy and security against the most relevant threats.
