Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes

Taichi Aida; Danushka Bollegala

Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes

Taichi Aida, Danushka Bollegala

TL;DR

Comparing pre-trained CWEs and their fine-tuned versions on contextual and temporal semantic change benchmarks under Principal Component Analysis (PCA) and Independent Component Analysis (ICA) transformations finds that PCA to better represent semantic changes than ICA within the top 10% of axes.

Abstract

The sense-aware contextualised word embeddings (SCWEs) encode semantic changes of words within the contextualised word embedding (CWE) spaces. Despite the superior performance of SCWEs in contextual/temporal semantic change detection (SCD) benchmarks, it remains unclear as to how the meaning changes are encoded in the embedding space. To study this, we compare pre-trained CWEs and their fine-tuned versions on contextual and temporal semantic change benchmarks under Principal Component Analysis (PCA) and Independent Component Analysis (ICA) transformations. Our experimental results reveal (a) although there exist a smaller number of axes that are specific to semantic changes of words in the pre-trained CWE space, this information gets distributed across all dimensions when fine-tuned, and (b) in contrast to prior work studying the geometry of CWEs, we find that PCA to better represent semantic changes than ICA within the top 10% of axes. These findings encourage the development of more efficient SCD methods with a small number of SCD-aware dimensions. Source code is available at https://github.com/LivNLP/svp-dims .

Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes

TL;DR

Abstract

Paper Structure (13 sections, 168 figures, 3 tables)

This paper contains 13 sections, 168 figures, 3 tables.

Introduction
Task Description
Contextual Semantic Change Detection Task
Temporal Semantic Change Detection Task
Models
Contextual Semantic Changes
RQ1: When do the contextual SCD-aware axes emerge?
RQ2: Can top-$k$ PCA/ICA-transformed axes capture contextual semantic changes?
Temporal Semantic Changes
RQ3: Can top-$k$ PCA/ICA-transformed axes capture temporal semantic changes?
Conclusion
Data Statistics
Full Results

Figures (168)

Figure 1: Pre-trained CWE, Raw
Figure 2: Pre-trained CWE, PCA
Figure 3: Pre-trained CWE, ICA
Figure 4: Fine-tuned SCWE, Raw
Figure 5: Fine-tuned SCWE, PCA
...and 163 more figures

Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes

TL;DR

Abstract

Investigating the Contextualised Word Embedding Dimensions Specified for Contextual and Temporal Semantic Changes

Authors

TL;DR

Abstract

Table of Contents

Figures (168)