LangBridge: Multilingual Reasoning Without Multilingual Supervision
Dongkeun Yoon, Joel Jang, Sungdong Kim, Seungone Kim, Sheikh Shafayat, Minjoon Seo
TL;DR
LangBridge presents a zero-shot method to extend reasoning capabilities of language models to multilingual tasks by aligning a multilingual encoder (mT5) with a reasoning LM (e.g., MetaMath or Orca) via a single trainable linear adapter, trained exclusively on English data. The approach relies on the language-agnostic properties of multilingual representations and uses a prefix-language-model objective, keeping the target LM frozen while optionally training the encoder. Across mathematical reasoning, code completion, logical reasoning, and commonsense reasoning, LangBridge yields substantial improvements in low-resource languages and, in several cases, matches or surpasses much larger multilingual baselines. This method reduces the need for multilingual supervision and demonstrates practical impact for multilingual reasoning, with public release of code and models. The findings suggest that language-neutral representations can be effectively transferred to target LMs, offering a scalable path to more inclusive AI systems for underrepresented languages.
Abstract
We introduce LangBridge, a zero-shot approach to adapt language models for multilingual reasoning tasks without multilingual supervision. LangBridge operates by bridging two models, each specialized in different aspects: (1) one specialized in understanding multiple languages (e.g., mT5 encoder) and (2) one specialized in reasoning (e.g., MetaMath). LangBridge connects the two models by introducing minimal trainable parameters between them. Despite utilizing only English data for training, LangBridge considerably enhances the performance of language models on low-resource languages across mathematical reasoning, code completion, logical reasoning, and commonsense reasoning. Our analysis suggests that the efficacy of LangBridge stems from the language-agnostic characteristics of multilingual representations. We publicly release our code and models.
