A Survey of Multilingual Reasoning in Language Models
Akash Ghosh, Debayan Datta, Sriparna Saha, Chirag Agarwal
TL;DR
This survey investigates how language models can perform logical reasoning across multiple languages, highlighting cross-lingual misalignment, data scarcity, and cultural considerations as key obstacles. It organizes the field through a taxonomy of methods (representation alignment, finetuning, prompting, and model editing), catalogues multilingual datasets and benchmarks, and analyzes evaluation metrics and benchmark performance. The authors identify gaps in domain coverage, language diversity, and efficient, scalable reasoning, and propose concrete directions including cross-lingual transfer, explainable reasoning, unified metrics, and multimodal multilingual tasks. Overall, the work provides a structured roadmap for advancing multilingual reasoning in LLMs with practical implications for inclusive, culturally aware AI systems.
Abstract
While reasoning and multilingual capabilities in language models (LMs) have achieved remarkable progress in recent years, their integration into a unified paradigm - multilingual reasoning - is at a nascent stage. Multilingual reasoning requires language models to handle logical reasoning across languages while addressing misalignment, biases, and challenges in low-resource settings. This survey provides the first in-depth review of multilingual reasoning in LMs. In this survey, we provide a systematic overview of existing methods that leverage LMs for multilingual reasoning, specifically outlining the challenges, motivations, and foundational aspects of applying language models to reason across diverse languages. We provide an overview of the standard data resources used for training multilingual reasoning in LMs and the evaluation benchmarks employed to assess their multilingual capabilities. Next, we analyze various state-of-the-art methods and their performance on these benchmarks. Finally, we explore future research opportunities to improve multilingual reasoning in LMs, focusing on enhancing their ability to handle diverse languages and complex reasoning tasks. Rapid growth of evolving developments in this field can be actively tracked on our project page: [https://github.com/AkashGhosh/Survey-of-Multilingual-Reasoning-in-Language-Models](https://github.com/AkashGhosh/Survey-of-Multilingual-Reasoning-in-Language-Models)
