Clinical translation of machine learning algorithms for seizure detection in scalp electroencephalography: systematic review
Nina Moutonnet, Steven White, Benjamin P Campbell, Saeid Sanei, Toshihisa Tanaka, Hong Ji, Danilo Mandic, Gregory Scott
TL;DR
This paper addresses the gap between ML-based seizure-detection research and clinical deployment using scalp EEG by performing a structured, domain-informed review. It synthesizes evidence from public datasets, systematic search, and a broad range of ML approaches (feature-based and DL), emphasizing generalisability, data heterogeneity, and clinically relevant evaluation. The authors provide concrete guidelines for improving clinical translatability, including multi-domain representations, diverse training data, patient-independent validation, and transparent post-processing, while highlighting the need for robust ground-truth labeling and accounting for non-ictal activity. The work aims to accelerate real-world adoption of automated seizure detection by clarifying methodological best practices and identifying critical research gaps.
Abstract
Machine learning algorithms for seizure detection have shown considerable diagnostic potential, with recent reported accuracies reaching 100%. Yet, only few published algorithms have fully addressed the requirements for successful clinical translation. This is, for example, because the properties of training data may limit the generalisability of algorithms, algorithm performance may vary depending on which electroencephalogram (EEG) acquisition hardware was used, or run-time processing costs may be prohibitive to real-time clinical use cases. To address these issues in a critical manner, we systematically review machine learning algorithms for seizure detection with a focus on clinical translatability, assessed by criteria including generalisability, run-time costs, explainability, and clinically-relevant performance metrics. For non-specialists, the domain-specific knowledge necessary to contextualise model development and evaluation is provided. It is our hope that such critical evaluation of machine learning algorithms with respect to their potential real-world effectiveness can help accelerate clinical translation and identify gaps in the current seizure detection literature.
