Language Surgery in Multilingual Large Language Models

Joanito Agili Lopo; Muhammad Ravi Shulthan Habibi; Tack Hwa Wong; Muhammad Ilham Ghozali; Fajri Koto; Genta Indra Winata; Peerat Limkonchotiwat; Alham Fikri Aji; Samuel Cahyawijaya

Language Surgery in Multilingual Large Language Models

Joanito Agili Lopo, Muhammad Ravi Shulthan Habibi, Tack Hwa Wong, Muhammad Ilham Ghozali, Fajri Koto, Genta Indra Winata, Peerat Limkonchotiwat, Alham Fikri Aji, Samuel Cahyawijaya

TL;DR

This work investigates naturally emerging representation alignment in multilingual LLMs, showing that middle-layer representations retain language-specific cues while maintaining cross-lingual alignment. It introduces Inference-Time Language Control (ITLC), a latent-intervention technique that extracts language vectors via Linear Discriminant Analysis from middle-layer states and injects a shift vector during generation to steer decoding toward a target language with minimal semantic loss. ITLC yields strong cross-lingual control on language confusion benchmarks and competitive semantic retention, often matching or approaching performance of more expensive test-time interventions while requiring only a single middle-layer intervention. The findings deepen understanding of how representation alignment relates to language-specific information and offer a practical, efficient tool to enhance multilingual capabilities of LLMs for robust cross-lingual generation.

Abstract

Large Language Models (LLMs) have demonstrated remarkable generalization capabilities across tasks and languages, revolutionizing natural language processing. This paper investigates the naturally emerging representation alignment in LLMs, particularly in the middle layers, and its implications for disentangling language-specific and language-agnostic information. We empirically confirm the existence of this alignment, analyze its behavior in comparison to explicitly designed alignment models, and demonstrate its potential for language-specific manipulation without semantic degradation. Building on these findings, we propose Inference-Time Language Control (ITLC), a novel method that leverages latent injection to enable precise cross-lingual language control and mitigate language confusion in LLMs. Our experiments highlight ITLC's strong cross-lingual control capabilities while preserving semantic integrity in target languages. Furthermore, we demonstrate its effectiveness in alleviating the cross-lingual language confusion problem, which persists even in current large-scale LLMs, leading to inconsistent language generation. This work advances our understanding of representation alignment in LLMs and introduces a practical solution for enhancing their monolingual and cross-lingual performance.

Language Surgery in Multilingual Large Language Models

TL;DR

Abstract

Language Surgery in Multilingual Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)