Understanding the Vulnerability of Skeleton-based Human Activity Recognition via Black-box Attack
Yunfeng Diao, He Wang, Tianjia Shao, Yong-Liang Yang, Kun Zhou, David Hogg, Meng Wang
TL;DR
This work shows that skeleton-based HAR systems are vulnerable to true black-box adversarial attacks by introducing BASAR, a method that crafts on-manifold adversarial motions via Guided Manifold Walk. It demonstrates that on-manifold adversaries are common and perceptually plausible, challenging the assumption that perturbations must lie off the data manifold. To counter this threat, the authors propose Mixed On-manifold Adversarial Training (MMAT), which jointly leverages on- and off-manifold adversarial samples to improve both accuracy and robustness without sacrificing performance on clean data. The approach is validated across multiple models and large-scale datasets, supported by perceptual studies, and shows practical implications for the security of HAR systems as well as a general defense framework for time-series tasks.
Abstract
Human Activity Recognition (HAR) has been employed in a wide range of applications, e.g. self-driving cars, where safety and lives are at stake. Recently, the robustness of skeleton-based HAR methods have been questioned due to their vulnerability to adversarial attacks. However, the proposed attacks require the full-knowledge of the attacked classifier, which is overly restrictive. In this paper, we show such threats indeed exist, even when the attacker only has access to the input/output of the model. To this end, we propose the very first black-box adversarial attack approach in skeleton-based HAR called BASAR. BASAR explores the interplay between the classification boundary and the natural motion manifold. To our best knowledge, this is the first time data manifold is introduced in adversarial attacks on time series. Via BASAR, we find on-manifold adversarial samples are extremely deceitful and rather common in skeletal motions, in contrast to the common belief that adversarial samples only exist off-manifold. Through exhaustive evaluation, we show that BASAR can deliver successful attacks across classifiers, datasets, and attack modes. By attack, BASAR helps identify the potential causes of the model vulnerability and provides insights on possible improvements. Finally, to mitigate the newly identified threat, we propose a new adversarial training approach by leveraging the sophisticated distributions of on/off-manifold adversarial samples, called mixed manifold-based adversarial training (MMAT). MMAT can successfully help defend against adversarial attacks without compromising classification accuracy.
