User Identity Protection in EEG-based Brain-Computer Interfaces
L. Meng, X. Jiang, J. Huang, W. Li, H. Luo, D. Wu
TL;DR
The paper identifies a privacy risk in EEG-based BCIs: user identity information is learnable from EEG data, enabling cross-session linking. It introduces two perturbation-based schemes—sample-wise and per-user—to generate identity-unlearnable EEG data under an $\ell_∞$ bound, preserving task accuracy measured by $BCA$. Across seven datasets and five paradigms, the methods reduce user identification accuracy from $UIA\approx70\%$ to $UIA\leq21.36\%$, with minimal impact on the primary BCI task and strong cross-model transferability. The work also demonstrates perturbation imperceptibility, effectiveness in online scenarios, and discusses limitations and avenues for future work in privacy-preserving BCIs.
Abstract
A brain-computer interface (BCI) establishes a direct communication pathway between the brain and an external device. Electroencephalogram (EEG) is the most popular input signal in BCIs, due to its convenience and low cost. Most research on EEG-based BCIs focuses on the accurate decoding of EEG signals; however, EEG signals also contain rich private information, e.g., user identity, emotion, and so on, which should be protected. This paper first exposes a serious privacy problem in EEG-based BCIs, i.e., the user identity in EEG data can be easily learned so that different sessions of EEG data from the same user can be associated together to more reliably mine private information. To address this issue, we further propose two approaches to convert the original EEG data into identity-unlearnable EEG data, i.e., removing the user identity information while maintaining the good performance on the primary BCI task. Experiments on seven EEG datasets from five different BCI paradigms showed that on average the generated identity-unlearnable EEG data can reduce the user identification accuracy from 70.01\% to at most 21.36\%, greatly facilitating user privacy protection in EEG-based BCIs.
