An Exploration of Knowledge Editing for Arabic
Basel Mousi, Nadir Durrani, Fahim Dalvi
TL;DR
This work is the first systematic study of knowledge editing in Arabic, addressing the scarcity of Arabic KE benchmarks and the challenges posed by morphology and diglossia. It evaluates four KE methods—ROME, MEMIT, ICE, and LTE—on Arabic translations of ZsRE and Counterfact, demonstrating that parameter-based approaches struggle with cross-lingual transfer, while instruction-tuned, multilingual LTE achieves robust edits across Arabic and other languages. By extending LTE to monolingual Arabic and bilingual Arabic-English training, the authors show improved editability and cross-lingual generalization, with multilingual training outperforming single-language variants. They also release Arabic KE benchmarks and multilingual LTE data to spur future research in low-resource and morphologically rich languages.
Abstract
While Knowledge Editing (KE) has been widely explored in English, its behavior in morphologically rich languages like Arabic remains underexamined. In this work, we present the first study of Arabic KE. We evaluate four methods (ROME, MEMIT, ICE, and LTE) on Arabic translations of the ZsRE and Counterfact benchmarks, analyzing both multilingual and cross-lingual settings. Our experiments on Llama-2-7B-chat show that parameter-based methods struggle with cross-lingual generalization, while instruction-tuned methods perform more robustly. We extend Learning-To-Edit (LTE) to a multilingual setting and show that joint Arabic-English training improves both editability and transfer. We release Arabic KE benchmarks and multilingual training for LTE data to support future research.
