MetaMT,a MetaLearning Method Leveraging Multiple Domain Data for Low Resource Machine Translation
Rumeng Li, Xun Wang, Hong Yu
TL;DR
The paper tackles low-resource, multi-domain machine translation by learning a domain-invariant word representation and a meta-learning policy that yields fast adaptation to new domains. It introduces MetaMT, which maps words into a shared semantic space via a base embedding $E^G$ and domain-specific transforms $A^i$, while optimizing a dual-parameter system $\theta_0$ (model) and $\theta_1$ (meta) to facilitate cross-domain transfer. Evaluated on seven English–Spanish datasets with a Transformer backbone, MetaMT achieves BLEU gains of about $1$–$2$ points over strong baselines and excels on a very small EHR corpus (3,020 sentences). The approach is language-agnostic, data-efficient, and extensible to other neural models, offering a practical path for robust MT in resource-limited, domain-diverse settings.
Abstract
Manipulating training data leads to robust neural models for MT.
