Graph-Assisted Culturally Adaptable Idiomatic Translation for Indic Languages
Pratik Rakesh Singh, Kritarth Prasad, Mohammadi Zaki, Pankaj Wasnik
TL;DR
This work tackles idiomatic translation across Indic languages, where cultural nuance and one-to-many mappings hinder accurate rendering. It introduces IdiomCE, an inductive graph neural network that uses cultural elements as features to model mappings between English idioms and Indic idioms, generalizing to unseen items and supporting pivot-based cross-language translation. The approach includes a data-creation pipeline for cultural features, an inductive GNN with link prediction, a node-duplication strategy to alleviate cold-start, and inter-Indic translation via English as a pivot. Empirical results show notable improvements over static knowledge-graph and prompting baselines across multiple language pairs, including human evaluations that corroborate the quality of idiomatic translations, with particular gains for smaller models in resource-constrained settings.
Abstract
Translating multi-word expressions (MWEs) and idioms requires a deep understanding of the cultural nuances of both the source and target languages. This challenge is further amplified by the one-to-many nature of idiomatic translations, where a single source idiom can have multiple target-language equivalents depending on cultural references and contextual variations. Traditional static knowledge graphs (KGs) and prompt-based approaches struggle to capture these complex relationships, often leading to suboptimal translations. To address this, we propose IdiomCE, an adaptive graph neural network (GNN) based methodology that learns intricate mappings between idiomatic expressions, effectively generalizing to both seen and unseen nodes during training. Our proposed method enhances translation quality even in resource-constrained settings, facilitating improved idiomatic translation in smaller models. We evaluate our approach on multiple idiomatic translation datasets using reference-less metrics, demonstrating significant improvements in translating idioms from English to various Indian languages.
