TabAttackBench: A Benchmark for Adversarial Attacks on Tabular Data
Zhipeng He, Chun Ouyang, Lijie Wen, Cong Liu, Catarina Moreira
TL;DR
TabAttackBench introduces a unified benchmark for adversarial attacks on tabular data, evaluating five white-box attacks (FGSM, BIM, PGD, DeepFool, C&W) across four predictive models (LR, MLP, TabTransformer, FT-Transformer) on 11 datasets. It jointly measures effectiveness (attack success rate) and imperceptibility using four metrics (Proximity, Sparsity, Deviation, Sensitivity), revealing a clear trade-off: ℓ∞-based attacks tend to be more effective but less imperceptible, while ℓ2-based attacks produce more realistic perturbations. The framework exposes dataset- and model-dependent patterns, including pronounced numerical feature perturbation, proximity and deviation dynamics, and transformer-model robustness, providing actionable insights for designing more imperceptible attacks and for developing robust defenses. By offering standardized preprocessing, reproducible pipelines, and open resources, the paper establishes a practical reference for tabular adversarial robustness research and future benchmark development.
Abstract
Adversarial attacks pose a significant threat to machine learning models by inducing incorrect predictions through imperceptible perturbations to input data. While these attacks are well studied in unstructured domains such as images, their behaviour on tabular data remains underexplored due to mixed feature types and complex inter-feature dependencies. This study introduces a comprehensive benchmark that evaluates adversarial attacks on tabular datasets with respect to both effectiveness and imperceptibility. We assess five white-box attack algorithms (FGSM, BIM, PGD, DeepFool, and C\&W) across four representative models (LR, MLP, TabTransformer and FT-Transformer) using eleven datasets spanning finance, energy, and healthcare domains. The benchmark employs four quantitative imperceptibility metrics (proximity, sparsity, deviation, and sensitivity) to characterise perturbation realism. The analysis quantifies the trade-off between these two aspects and reveals consistent differences between attack types, with $\ell_\infty$-based attacks achieving higher success but lower subtlety, and $\ell_2$-based attacks offering more realistic perturbations. The benchmark findings offer actionable insights for designing more imperceptible adversarial attacks, advancing the understanding of adversarial vulnerability in tabular machine learning.
