Table of Contents
Fetching ...

Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features

Li Zhou, Antonia Karamolegkou, Wenyu Chen, Daniel Hershcovich

TL;DR

The findings reveal that cultural value surveys indeed possess a predictive power for cross-cultural transfer learning success in OLD tasks and that it can be further improved using offensive word distance.

Abstract

The increasing ubiquity of language technology necessitates a shift towards considering cultural diversity in the machine learning realm, particularly for subjective tasks that rely heavily on cultural nuances, such as Offensive Language Detection (OLD). Current understanding underscores that these tasks are substantially influenced by cultural values, however, a notable gap exists in determining if cultural features can accurately predict the success of cross-cultural transfer learning for such subjective tasks. Addressing this, our study delves into the intersection of cultural features and transfer learning effectiveness. The findings reveal that cultural value surveys indeed possess a predictive power for cross-cultural transfer learning success in OLD tasks and that it can be further improved using offensive word distance. Based on these results, we advocate for the integration of cultural information into datasets. Additionally, we recommend leveraging data sources rich in cultural information, such as surveys, to enhance cultural adaptability. Our research signifies a step forward in the quest for more inclusive, culturally sensitive language technologies.

Cultural Compass: Predicting Transfer Learning Success in Offensive Language Detection with Cultural Features

TL;DR

The findings reveal that cultural value surveys indeed possess a predictive power for cross-cultural transfer learning success in OLD tasks and that it can be further improved using offensive word distance.

Abstract

The increasing ubiquity of language technology necessitates a shift towards considering cultural diversity in the machine learning realm, particularly for subjective tasks that rely heavily on cultural nuances, such as Offensive Language Detection (OLD). Current understanding underscores that these tasks are substantially influenced by cultural values, however, a notable gap exists in determining if cultural features can accurately predict the success of cross-cultural transfer learning for such subjective tasks. Addressing this, our study delves into the intersection of cultural features and transfer learning effectiveness. The findings reveal that cultural value surveys indeed possess a predictive power for cross-cultural transfer learning success in OLD tasks and that it can be further improved using offensive word distance. Based on these results, we advocate for the integration of cultural information into datasets. Additionally, we recommend leveraging data sources rich in cultural information, such as surveys, to enhance cultural adaptability. Our research signifies a step forward in the quest for more inclusive, culturally sensitive language technologies.
Paper Structure (42 sections, 3 equations, 5 figures, 10 tables)

This paper contains 42 sections, 3 equations, 5 figures, 10 tables.

Figures (5)

  • Figure 1: T-SNE Visualization of Cultural Values: Countries represented as data points, with color-coded regions highlighting similarities and differences in cultural dimensions.
  • Figure 2: Network based on Offensive Word Distance. Edges connect languages based on their top-2 closest rankings. Solid lines represent mutual ranking, while dashed lines represent exclusive ranking.
  • Figure 3: Relative losses (in percent) from zero-shot transfer learning models over the intra-cultural models.
  • Figure 4: A pairplot showcasing the relationship between culturally relevant features using kernel density estimation (KDE), accompanied by correlation coefficients (in red digit) annotated on the corresponding scatter plots. The diagonal plots display the distribution of each feature using KDE curves, while the lower triangle depicts the joint distributions and contour plots using a color gradient (Blues colormap).
  • Figure 5: Fine-grained correlation analysis of dimension features in CulDim.