Robust Watermarking on Gradient Boosting Decision Trees
Jun Woo Chung, Yingjie Lao, Weijie Zhao
TL;DR
This paper addresses robust ownership protection for gradient boosting decision trees (GBDT) by introducing an in-place watermarking framework that embeds watermarks through fine-tuning existing trees rather than adding new ones. It proposes four embedding strategies—Wrong Prediction Flip, Outlier Flip, Cluster Center Flip, and Confidence Flip—to minimize accuracy impact while ensuring watermark robustness. Across diverse datasets, the methods achieve high watermarking effectiveness with limited degradation to general performance and demonstrate resilience to further fine-tuning, enabling post-deployment ownership verification. The work advances practical IP protection for GBDT models in industry and academia, providing guidance on strategy selection depending on data context and offering a path toward robust, post-hoc watermarking of non-differentiable, sequential tree ensembles.
Abstract
Gradient Boosting Decision Trees (GBDTs) are widely used in industry and academia for their high accuracy and efficiency, particularly on structured data. However, watermarking GBDT models remains underexplored compared to neural networks. In this work, we present the first robust watermarking framework tailored to GBDT models, utilizing in-place fine-tuning to embed imperceptible and resilient watermarks. We propose four embedding strategies, each designed to minimize impact on model accuracy while ensuring watermark robustness. Through experiments across diverse datasets, we demonstrate that our methods achieve high watermark embedding rates, low accuracy degradation, and strong resistance to post-deployment fine-tuning.
