Table of Contents
Fetching ...

Aligning Brain Activity with Advanced Transformer Models: Exploring the Role of Punctuation in Semantic Processing

Zenon Lamprou, Frank Polick, Yashar Moshfeghi

TL;DR

The paper investigates how human brain activity aligns with state-of-the-art transformer models and whether punctuation affects semantic processing. Extending the Toneva & Wehbe framework, it evaluates RoBERTa, DistiliBERT, ALBERT, and ELECTRA against fMRI data collected as participants read a chapter from Harry Potter, while also testing four punctuation-removal scenarios. The results show RoBERTa (and to a lesser extent DistiliBERT) achieving the strongest brain alignment, whereas ALBERT and ELECTRA do not surpass BERT; notably, removing punctuation generally improves the brain-to-model mapping. These findings suggest the brain relies less on punctuation for semantic interpretation than the models do, and demonstrate the utility of brain-to-model alignment as a tool to probe and improve language representations in NLP systems.

Abstract

This research examines the congruence between neural activity and advanced transformer models, emphasizing the semantic significance of punctuation in text understanding. Utilizing an innovative approach originally proposed by Toneva and Wehbe, we evaluate four advanced transformer models RoBERTa, DistiliBERT, ALBERT, and ELECTRA against neural activity data. Our findings indicate that RoBERTa exhibits the closest alignment with neural activity, surpassing BERT in accuracy. Furthermore, we investigate the impact of punctuation removal on model performance and neural alignment, revealing that BERT's accuracy enhances in the absence of punctuation. This study contributes to the comprehension of how neural networks represent language and the influence of punctuation on semantic processing within the human brain.

Aligning Brain Activity with Advanced Transformer Models: Exploring the Role of Punctuation in Semantic Processing

TL;DR

The paper investigates how human brain activity aligns with state-of-the-art transformer models and whether punctuation affects semantic processing. Extending the Toneva & Wehbe framework, it evaluates RoBERTa, DistiliBERT, ALBERT, and ELECTRA against fMRI data collected as participants read a chapter from Harry Potter, while also testing four punctuation-removal scenarios. The results show RoBERTa (and to a lesser extent DistiliBERT) achieving the strongest brain alignment, whereas ALBERT and ELECTRA do not surpass BERT; notably, removing punctuation generally improves the brain-to-model mapping. These findings suggest the brain relies less on punctuation for semantic interpretation than the models do, and demonstrate the utility of brain-to-model alignment as a tool to probe and improve language representations in NLP systems.

Abstract

This research examines the congruence between neural activity and advanced transformer models, emphasizing the semantic significance of punctuation in text understanding. Utilizing an innovative approach originally proposed by Toneva and Wehbe, we evaluate four advanced transformer models RoBERTa, DistiliBERT, ALBERT, and ELECTRA against neural activity data. Our findings indicate that RoBERTa exhibits the closest alignment with neural activity, surpassing BERT in accuracy. Furthermore, we investigate the impact of punctuation removal on model performance and neural alignment, revealing that BERT's accuracy enhances in the absence of punctuation. This study contributes to the comprehension of how neural networks represent language and the influence of punctuation on semantic processing within the human brain.
Paper Structure (17 sections, 3 figures)

This paper contains 17 sections, 3 figures.

Figures (3)

  • Figure 1: This figure shows the overall accuracy across all subjects for the four different models we investigated.
  • Figure 2: On the left hand side are the original results reported in toneva_interpreting_2019. On the right hand side are the reproduced results we obtained when running our code for BERT. Note that the y-axis has a different minimum and maximum value for the two panels, original and reproduced
  • Figure 3: The figures presents the results of the 4 different punctuation scenarios.