Faster Repeated Evasion Attacks in Tree Ensembles
Lorenzo Cascioli, Laurens Devos, Ondřej Kuželka, Jesse Davis
TL;DR
This work tackles the expensive task of generating adversarial examples for tree ensembles by exploiting regularities across sequential evasion tasks. It identifies a small subset of features that are frequently perturbed and introduces two strategies, pruned and mixed, to constrain searches to this subset while preserving correctness guarantees for the mixed approach. The authors provide a statistically grounded method to identify relevant features and demonstrate substantial runtime speedups (up to 35x) across diverse datasets and ensemble types, applicable to both exact (Kantchelian MILP) and approximate (Veritas) attacks. The approach enhances efficiency for robustness assessment, empirical robustness, and model hardening, enabling faster, scalable adversarial analysis in high-dimensional settings.
Abstract
Tree ensembles are one of the most widely used model classes. However, these models are susceptible to adversarial examples, i.e., slightly perturbed examples that elicit a misprediction. There has been significant research on designing approaches to construct such examples for tree ensembles. But this is a computationally challenging problem that often must be solved a large number of times (e.g., for all examples in a training set). This is compounded by the fact that current approaches attempt to find such examples from scratch. In contrast, we exploit the fact that multiple similar problems are being solved. Specifically, our approach exploits the insight that adversarial examples for tree ensembles tend to perturb a consistent but relatively small set of features. We show that we can quickly identify this set of features and use this knowledge to speedup constructing adversarial examples.
