Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks
Roksana Goworek, Haim Dubossarsky
TL;DR
The study interrogates whether multilingual exposure genuinely boosts zero-shot transfer for sense-aware NLP tasks, focusing on polysemy disambiguation (WiC) and lexical semantic change (LSCD) across 28 languages. By employing a controlled, large-scale framework with fixed fine-tuning sizes and held-out languages across multiple multilingual bases, it isolates the effect of multilingual fine-tuning. The findings indicate that multilinguality is not necessary nor consistently beneficial; pretraining size and data quality primarily drive transfer, with language similarity playing a secondary role. The work highlights data-centric factors as key determinants of cross-lingual performance and offers fine-tuned baselines to inform future research on transfer, especially for low-resource languages.
Abstract
Cross-lingual transfer is central to modern NLP, enabling models to perform tasks in languages different from those they were trained on. A common assumption is that training on more languages improves zero-shot transfer. We test this on sense-aware tasks-polysemy and lexical semantic change-and find that multilinguality is not necessary for effective transfer. Our large-scale analysis across 28 languages reveals that other factors, such as differences in pretraining and fine-tuning data and evaluation artifacts, better explain the perceived benefits of multilinguality. We also release fine-tuned models and provide empirical baselines to support future research. While focused on two sense-aware tasks, our findings offer broader insights into cross-lingual transfer, especially for low-resource languages.
