Detecting gamma-band responses to the speech envelope for the ICASSP 2024 Auditory EEG Decoding Signal Processing Grand Challenge
Mike Thornton, Jonas Auernheimer, Constantin Jehn, Danilo Mandic, Tobias Reichenbach
TL;DR
This work tackles the match-mismatch EEG decoding problem in speech perception by extending prior approaches to use gamma-band responses to the speech envelope. It introduces a dual-branch architecture with a low-frequency envelope-tracking decoder and a gamma-band envelope decoder, both trained with randomized mismatched segments and sharing a common deep-learning backbone. The two logits are fused with linear discriminant analysis in a composite decoder, which yields the best performance. On the ICASSP SPGC dataset, gamma-band evidence of envelope tracking is demonstrated, and ensemble averaging further boosts accuracy, highlighting the practical potential for robust multi-band auditory EEG decoding.
Abstract
The 2024 ICASSP Auditory EEG Signal Processing Grand Challenge concerns the decoding of electroencephalography (EEG) measurements taken from participants who listened to speech material. This work details our solution to the match-mismatch sub-task: given a short temporal segment of EEG recordings and several candidate speech segments, the task is to classify which of the speech segments was time-aligned with the EEG signals. We show that high-frequency gamma-band responses to the speech envelope can be detected with a high accuracy. By jointly assessing gamma-band responses and low-frequency envelope tracking, we develop a match-mismatch decoder which placed first in this task.
