Protecting Intellectual Property of EEG-based Neural Networks with Watermarking
Ahmed Abdelaziz, Ahmed Fathi, Ahmed Fares
TL;DR
The paper addresses IP protection for EEG-based neural networks by introducing a cryptographic wonder-filter watermarking framework that embeds a verifiable watermark during training with minimal impact on EEG task performance. It leverages out-of-bound input values and a dual normal/null embedding scheme to achieve persistence and deter piracy, underpinned by digital signatures and collision-resistant hashing for authentication. Evaluations on the DEAP dataset across CCNN, EEGNet, and TSception show strong watermark detectability (near $100\%$ for true embeddings) and robustness to fine-tuning, transfer learning, and pruning, while secondary watermarks suffer significant accuracy losses, discouraging ownership piracy. The approach provides a tamper-proof, auditable IP protection mechanism suitable for healthcare and biometric applications, addressing limitations of prior trigger-based methods and enabling cryptographic ownership verification.
Abstract
EEG-based neural networks, pivotal in medical diagnosis and brain-computer interfaces, face significant intellectual property (IP) risks due to their reliance on sensitive neurophysiological data and resource-intensive development. Current watermarking methods, particularly those using abstract trigger sets, lack robust authentication and fail to address the unique challenges of EEG models. This paper introduces a cryptographic wonder filter-based watermarking framework tailored for EEG-based neural networks. Leveraging collision-resistant hashing and public-key encryption, the wonder filter embeds the watermark during training, ensuring minimal distortion ($\leq 5\%$ drop in EEG task accuracy) and high reliability (100\% watermark detection). The framework is rigorously evaluated against adversarial attacks, including fine-tuning, transfer learning, and neuron pruning. Results demonstrate persistent watermark retention, with classification accuracy for watermarked states remaining above 90\% even after aggressive pruning, while primary task performance degrades faster, deterring removal attempts. Piracy resistance is validated by the inability to embed secondary watermarks without severe accuracy loss ( $>10\%$ in EEGNet and CCNN models). Cryptographic hashing ensures authentication, reducing brute-force attack success probabilities. Evaluated on the DEAP dataset across models (CCNN, EEGNet, TSception), the method achieves $>99.4\%$ null-embedding accuracy, effectively eliminating false positives. By integrating wonder filters with EEG-specific adaptations, this work bridges a critical gap in IP protection for neurophysiological models, offering a secure, tamper-proof solution for healthcare and biometric applications. The framework's robustness against adversarial modifications underscores its potential to safeguard sensitive EEG models while maintaining diagnostic utility.
