Language Anisotropic Cross-Lingual Model Editing
Yang Xu, Yutai Hou, Wanxiang Che, Min Zhang
TL;DR
This paper defines cross-lingual model editing for multilingual pre-trained language models, showing that edits in one language must propagate to others while preserving unrelated knowledge. It introduces a parallel-corpus–based framework to adapt existing monolingual editors to the cross-lingual setting and proposes language anisotropic editing via language-specific masks to target language-dependent parameter subsets. Empirical results on mLAMA and XNLI demonstrate that cross-lingual training significantly improves transferability and reduces language variability, while LiME further boosts performance and narrows language gaps, with analyses revealing learned masks capture language-specific anisotropy. The work highlights practical implications for maintaining consistent behavior across languages and provides code for reproducibility, while acknowledging limits related to parallel data availability and continual learning challenges.
Abstract
Multilingual pre-trained language models can learn task-specific abilities or memorize facts across multiple languages but inevitably make undesired predictions with specific inputs. Under similar observation, model editing aims to post-hoc calibrate a model targeted to specific inputs with keeping the model's raw behavior. However, existing work only studies the monolingual scenario, which lacks the cross-lingual transferability to perform editing simultaneously across languages. In this work, we focus on cross-lingual model editing. Firstly, we define the cross-lingual model editing task and corresponding metrics, where an edit in one language propagates to the others. Next, we propose a framework to naturally adapt monolingual model editing approaches to the cross-lingual scenario using parallel corpus. Further, we propose language anisotropic editing to improve cross-lingual editing by amplifying different subsets of parameters for each language. On the newly defined cross-lingual model editing task, we empirically demonstrate the failure of monolingual baselines in propagating the edit to multiple languages and the effectiveness of the proposed language anisotropic model editing. Our code is publicly available at https://github.com/franklear/LiME.
