Table of Contents
Fetching ...

Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning

Gaurav Arora, Srujana Merugu, Shreya Jain, Vaibhav Saxena

TL;DR

The paper investigates whether multilingual LLMs possess language-agnostic representations for reasoning by equivalence and inheritance. It introduces parallel benchmarks across six languages to quantify cross-language conflicts and develops Compositional Representations (CoRe) to bridge distant language representations, improving consistency. Key findings show substantial cross-language conflicts (up to 57.5% for equivalence and up to ~37% for inheritance) and demonstrate that CoRe can reduce conflicts by up to 4.7% and improve some downstream tasks by ~14%. The work highlights systemic gaps in multilingual reasoning and provides a concrete, transferable method to enhance cross-language knowledge transfer in LLMs.

Abstract

Reasoning and linguistic skills form the cornerstone of human intelligence, facilitating problem-solving and decision-making. Recent advances in Large Language Models (LLMs) have led to impressive linguistic capabilities and emergent reasoning behaviors, fueling widespread adoption across application domains. However, LLMs still struggle with complex reasoning tasks, highlighting their systemic limitations. In this work, we focus on evaluating whether LLMs have the requisite representations to reason using two foundational relationships: "equivalence" and "inheritance". We introduce novel tasks and benchmarks spanning six languages and observe that current SOTA LLMs often produce conflicting answers to the same questions across languages in 17.3-57.5% of cases and violate inheritance constraints in up to 37.2% cases. To enhance consistency across languages, we propose novel "Compositional Representations" where tokens are represented as composition of equivalent tokens across languages, with resulting conflict reduction (up to -4.7%) indicating benefits of shared LLM representations.

Towards Robust Knowledge Representations in Multilingual LLMs for Equivalence and Inheritance based Consistent Reasoning

TL;DR

The paper investigates whether multilingual LLMs possess language-agnostic representations for reasoning by equivalence and inheritance. It introduces parallel benchmarks across six languages to quantify cross-language conflicts and develops Compositional Representations (CoRe) to bridge distant language representations, improving consistency. Key findings show substantial cross-language conflicts (up to 57.5% for equivalence and up to ~37% for inheritance) and demonstrate that CoRe can reduce conflicts by up to 4.7% and improve some downstream tasks by ~14%. The work highlights systemic gaps in multilingual reasoning and provides a concrete, transferable method to enhance cross-language knowledge transfer in LLMs.

Abstract

Reasoning and linguistic skills form the cornerstone of human intelligence, facilitating problem-solving and decision-making. Recent advances in Large Language Models (LLMs) have led to impressive linguistic capabilities and emergent reasoning behaviors, fueling widespread adoption across application domains. However, LLMs still struggle with complex reasoning tasks, highlighting their systemic limitations. In this work, we focus on evaluating whether LLMs have the requisite representations to reason using two foundational relationships: "equivalence" and "inheritance". We introduce novel tasks and benchmarks spanning six languages and observe that current SOTA LLMs often produce conflicting answers to the same questions across languages in 17.3-57.5% of cases and violate inheritance constraints in up to 37.2% cases. To enhance consistency across languages, we propose novel "Compositional Representations" where tokens are represented as composition of equivalent tokens across languages, with resulting conflict reduction (up to -4.7%) indicating benefits of shared LLM representations.

Paper Structure

This paper contains 32 sections, 2 equations, 20 figures, 5 tables.

Figures (20)

  • Figure 1: An example of Reasoning by "Equivalence" and Reasoning by "Inheritance" based on existence of equivalence/inheritance relationship between concepts.
  • Figure 2: An example of LLM (Claude-v1 Instant) lacking "equivalence relationship between equivalent concepts across languages" due to tightly coupling of knowledge representation and language expression unlike in Humans.
  • Figure 3: Avg. Rank and Fraction of tokens which had to be replaced in German and Hindi with parallel English token to achieve consistent answer as English.
  • Figure 4: Conflict rate for different language pair types.
  • Figure 5: An example of LLM (Claude v1 Instant) exhibiting reasoning by inheritance in $\mathrm{En}$ but not in $\mathrm{Hi}$.
  • ...and 15 more figures