Adversarial Artifact Detection in EEG-Based Brain-Computer Interfaces

Xiaoqing Chen; Dongrui Wu

Adversarial Artifact Detection in EEG-Based Brain-Computer Interfaces

Xiaoqing Chen, Dongrui Wu

TL;DR

Security concerns in EEG-based BCIs are highlighted and adversarial detection is motivated as the first systematic study in this domain, noting that small input perturbations can cause misclassification. The authors adapt three detection methods—Bayesian Uncertainty (BU), Local Intrinsic Dimensionality (LID), and Mahalanobis distance-based confidence (MD_max)—and evaluate them against FGSM, PGD, CW, and transferability-based black-box attacks on two EEG datasets using three CNNs, with features drawn from the network's last layer. Findings show that white-box attacks are easier to detect than black-box ones; LID generally provides the strongest detection, while CW attacks are harder to detect; overall detection achieves meaningful AUC values across settings. The results support reactive adversarial detection as a viable defense for EEG BCIs and guide future work on combining detectors, expanding attack coverage, and integrating proactive defenses to enhance safety.

Abstract

Machine learning has achieved great success in electroencephalogram (EEG) based brain-computer interfaces (BCIs). Most existing BCI research focused on improving its accuracy, but few had considered its security. Recent studies, however, have shown that EEG-based BCIs are vulnerable to adversarial attacks, where small perturbations added to the input can cause misclassification. Detection of adversarial examples is crucial to both the understanding of this phenomenon and the defense. This paper, for the first time, explores adversarial detection in EEG-based BCIs. Experiments on two EEG datasets using three convolutional neural networks were performed to verify the performances of multiple detection approaches. We showed that both white-box and black-box attacks can be detected, and the former are easier to detect.

Adversarial Artifact Detection in EEG-Based Brain-Computer Interfaces

TL;DR

Abstract

Paper Structure (23 sections, 12 equations, 3 figures, 6 tables)

This paper contains 23 sections, 12 equations, 3 figures, 6 tables.

Introduction
Adversarial Attacks
Fast Gradient Sign Method (FGSM)
Projected Gradient Descent (PGD)
Carlini-Wagner (CW) Attack
Transferability-based Black-box Attack
Adversarial Detection
Bayesian Uncertainty
Local Intrinsic Dimensionality
Mahalanobis Distance-based Confidence Score
Experimental Setup
Datasets
Evaluated Models
Evaluation Settings
Parameter Tuning
...and 8 more sections

Figures (3)

Figure 1: The flowchart of a BCI system. Manual feature extraction is necessary if traditional machine learning algorithms are employed.
Figure 2: Examples of a P300 EEG epoch before and after adversarial attack. (a) FGSM; (b) PGD; and (c) CW.
Figure 3: $t$-SNE visualization of benign and corresponding adversarial MI EEG epochs in the output space of EEGNet. (a) FGSM; (b) PGD; and, (c) CW.

Adversarial Artifact Detection in EEG-Based Brain-Computer Interfaces

TL;DR

Abstract

Adversarial Artifact Detection in EEG-Based Brain-Computer Interfaces

Authors

TL;DR

Abstract

Table of Contents

Figures (3)