DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

Dominik Schlechtweg; Nina Tahmasebi; Simon Hengchen; Haim Dubossarsky; Barbara McGillivray

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

Dominik Schlechtweg, Nina Tahmasebi, Simon Hengchen, Haim Dubossarsky, Barbara McGillivray

TL;DR

The paper tackles the challenge of capturing diachronic, graded word meaning at scale by introducing DWUGs, a large multilingual resource built via two annotation paradigms that generate Usage Usage Graphs (UUGs) and Usage-Sense Graphs (USGs). It combines multi-round, edge-efficient annotation with a robust clustering objective $L(C)$ to infer sense structure across English, German, Swedish, and Latin, yielding around 100k judgments across two time periods per language. Key contributions include the largest diachronic graded-meaning dataset to date, a detailed annotation pipeline with robustness analyses, and public release of clusterings and visualizations to support evaluation of contextualized embeddings and semantic-change detection models. The work demonstrates practical impact for evaluating and training diachronic NLP systems and shows how Latin adaptation can be integrated when native annotators are limited. Overall, the resource enables nuanced, time-aware semantic evaluation across multiple languages and provides a foundation for future improvements in clustering methods and annotation strategies.

Abstract

Word meaning is notoriously difficult to capture, both synchronically and diachronically. In this paper, we describe the creation of the largest resource of graded contextualized, diachronic word meaning annotation in four different languages, based on 100,000 human semantic proximity judgments. We thoroughly describe the multi-round incremental annotation process, the choice for a clustering algorithm to group usages into senses, and possible - diachronic and synchronic - uses for this dataset.

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

TL;DR

to infer sense structure across English, German, Swedish, and Latin, yielding around 100k judgments across two time periods per language. Key contributions include the largest diachronic graded-meaning dataset to date, a detailed annotation pipeline with robustness analyses, and public release of clusterings and visualizations to support evaluation of contextualized embeddings and semantic-change detection models. The work demonstrates practical impact for evaluating and training diachronic NLP systems and shows how Latin adaptation can be integrated when native annotators are limited. Overall, the resource enables nuanced, time-aware semantic evaluation across multiple languages and provides a foundation for future improvements in clustering methods and annotation strategies.

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

TL;DR

Abstract

DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)