Learning Translations via Matrix Completion
Derry Wijaya, Brendan Callahan, John Hewitt, Jie Gao, Xiao Ling, Marianna Apidianaki, Chris Callison-Burch
TL;DR
The paper tackles bilingual lexicon induction under limited parallel data by framing translation learning as matrix completion using multiple noisy signals. It formalizes the translation task as $\hat{X} = P Q^T$, integrating bilingual signals (WIKI, WIKI+CROWD) with auxiliary monolingual and visual cues through a Bayesian Personalized Ranking objective to handle positive-only data and enable multilingual transfer. The approach includes a back-off strategy for cold-start words and explores both linear and nonlinear mappings for monolingual embeddings, demonstrating significant, consistent improvements across 27 languages with top-10 accuracy metrics. The results show state-of-the-art performance and strong generalization, with modular, extensible design and publicly released code and datasets, signaling practical impact for MT in low-resource settings.
Abstract
Bilingual Lexicon Induction is the task of learning word translations without bilingual parallel corpora. We model this task as a matrix completion problem, and present an effective and extendable framework for completing the matrix. This method harnesses diverse bilingual and monolingual signals, each of which may be incomplete or noisy. Our model achieves state-of-the-art performance for both high and low resource languages.
