On the Entity-Level Alignment in Crosslingual Consistency
Yihong Liu, Mingyang Wang, François Yvon, Hinrich Schütze
TL;DR
This work investigates why multilingual LLMs often misrecall facts across languages by proposing that entity-level alignment in a shared conceptual space underpins crosslingual consistency. It introduces an entity translation probing task and the KLAR dataset to quantify alignment and recall, showing strong correlations between subject/object alignment and crosslingual recalls. To address misalignment, two prompting strategies, SubSub and SubInj, inject or substitute English subject information, yielding substantial gains in recall and crosslingual consistency across model families, especially for smaller or English-centric models. Mechanistic analysis via Logit Lens indicates these prompts reinforce language-agnostic entity representations in pivot-language space, clarifying why these simple prompts improve multilingual factual prediction and suggesting a practical path to more reliable crosslingual knowledge.
Abstract
Multilingual large language models (LLMs) are expected to recall factual knowledge consistently across languages. However, the factors that give rise to such crosslingual consistency -- and its frequent failure -- remain poorly understood. In this work, we hypothesize that these inconsistencies may arise from failures in entity alignment, the process of mapping subject and object entities into a shared conceptual space across languages. To test this, we assess alignment through entity-level (subject and object) translation tasks, and find that consistency is strongly correlated with alignment across all studied models, with misalignment of subjects or objects frequently resulting in inconsistencies. Building on this insight, we propose SubSub and SubInj, two effective methods that integrate English translations of subjects into prompts across languages, leading to substantial gains in both factual recall accuracy and consistency. Finally, our mechanistic analysis reveals that these interventions reinforce the entity representation alignment in the conceptual space through model's internal pivot-language processing, offering effective and practical strategies for improving multilingual factual prediction.
