Enhancing spatial auditory attention decoding with neuroscience-inspired prototype training
Zelin Qiu, Jianjun Gu, Dingding Yao, Junfeng Li
TL;DR
This work tackles the variability caused by trial-specific EEG fingerprints in spatial auditory attention decoding (Sp-AAD) by introducing Prototype Training, a neuroscience-inspired method that creates prototypes from multiple trials to emphasize energy-distribution features. Paired with EEGWaveNet, a wavelet-transformed time-frequency decoder, the approach improves cross-trial generalization and provides comprehensive benchmarking across three datasets and multiple data-partitioning schemes. Key findings show that prototype training yields gains in cross-trial scenarios (notably with $K$ around 25) and that time-frequency energy representations better capture auditory attention features than time-domain signals. The proposed framework offers a practical, architecture-agnostic training paradigm that reduces trial-specific bias and provides a rich benchmarking resource for Sp-AAD research.
Abstract
The spatial auditory attention decoding (Sp-AAD) technology aims to determine the direction of auditory attention in multi-talker scenarios via neural recordings. Despite the success of recent Sp-AAD algorithms, their performance is hindered by trial-specific features in EEG data. This study aims to improve decoding performance against these features. Studies in neuroscience indicate that spatial auditory attention can be reflected in the topological distribution of EEG energy across different frequency bands. This insight motivates us to propose Prototype Training, a neuroscience-inspired method for Sp-AAD. This method constructs prototypes with enhanced energy distribution representations and reduced trial-specific characteristics, enabling the model to better capture auditory attention features. To implement prototype training, an EEGWaveNet that employs the wavelet transform of EEG is further proposed. Detailed experiments indicate that the EEGWaveNet with prototype training outperforms other competitive models on various datasets, and the effectiveness of the proposed method is also validated. As a training method independent of model architecture, prototype training offers new insights into the field of Sp-AAD.
