An Explainable Deep Learning-Based Method For Schizophrenia Diagnosis Using Generative Data-Augmentation
Mehrshad Saadatinia, Armin Salimi-Badr
TL;DR
This work addresses the lack of objective diagnostic tools for schizophrenia by leveraging EEG-derived spectrograms and a CNN classifier trained with generative data augmentation. It compares VAEs and WGAN-GP for synthetic data, finding that a VAE-based augmentation (VAE-700) substantially improves accuracy to 99.0% on a 16-channel dataset, while the 19-channel dataset already achieves 99.6% accuracy without augmentation. The paper also employs LIME to provide explanations, revealing frequency-region patterns that differentiate schizophrenic and healthy EEG spectra, thereby enhancing trust in the model. Overall, the study demonstrates that targeted generative augmentation can generalize well on EEG-based schizophrenia diagnosis and that explainability methods help interpret neural decisions for clinical use.
Abstract
In this study, we leverage a deep learning-based method for the automatic diagnosis of schizophrenia using EEG brain recordings. This approach utilizes generative data augmentation, a powerful technique that enhances the accuracy of the diagnosis. To enable the utilization of time-frequency features, spectrograms were extracted from the raw signals. After exploring several neural network architectural setups, a proper convolutional neural network (CNN) was used for the initial diagnosis. Subsequently, using Wasserstein GAN with Gradient Penalty (WGAN-GP) and Variational Autoencoder (VAE), two different synthetic datasets were generated in order to augment the initial dataset and address the over-fitting issue. The augmented dataset using VAE achieved a 3.0\% improvement in accuracy reaching up to 99.0\% and yielded a lower loss value as well as a faster convergence. Finally, we addressed the lack of trust in black-box models using the Local Interpretable Model-agnostic Explanations (LIME) algorithm to determine the most important superpixels (frequencies) in the diagnosis process.
