Timber! Poisoning Decision Trees
Stefano Calzavara, Lorenzo Cazzaro, Massimo Vettori
TL;DR
Timber advances adversarial research by introducing the first white-box poisoning attack for decision trees, leveraging a tree-annotation mechanism and subtree retraining to efficiently estimate and realize the impact of label flips. The approach scales to large datasets and extends to random forests, aided by an optional early-stopping variant to reduce runtime. Empirical results on four public datasets show Timber and its variant generally outperform baselines in both effectiveness and efficiency, with model-agnostic defenses providing only partial mitigation. This work highlights a critical vulnerability of tree-based methods and motivates developing defenses tailored to decision-tree ensembles and non-differentiable learners.
Abstract
We present Timber, the first white-box poisoning attack targeting decision trees. Timber is based on a greedy attack strategy that leverages sub-tree retraining to efficiently estimate the damage caused by poisoning a given training instance. The attack relies on a tree annotation procedure, which enables the sorting of training instances so that they are processed in increasing order of the computational cost of sub-tree retraining. This sorting yields a variant of Timber that supports an early stopping criterion, designed to make poisoning attacks more efficient and feasible on larger datasets. We also discuss an extension of Timber to traditional random forest models, which is valuable since decision trees are typically combined into ensembles to improve their predictive power. Our experimental evaluation on public datasets demonstrates that our attacks outperform existing baselines in terms of effectiveness, efficiency, or both. Moreover, we show that two representative defenses can mitigate the effect of our attacks, but fail to effectively thwart them.
