Table of Contents
Fetching ...

HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks

Zhanglin Wu, Yuanchang Luo, Daimeng Wei, Jiawei Zheng, Bin Wei, Zongyao Li, Hengchao Shang, Jiaxin Guo, Shaojun Li, Weidong Zhang, Ning Xie, Hao Yang

TL;DR

To explore whether large language model (LLM) can help improve the translation quality of NMT systems, supervised fine-tuning is used to train llama2-13b as an Automatic post-editing model to improve the translation results of the NMT model on the multi-domain machine translation task.

Abstract

This paper presents the submission of Huawei Translation Services Center (HW-TSC) to machine translation tasks of the 20th China Conference on Machine Translation (CCMT 2024). We participate in the bilingual machine translation task and multi-domain machine translation task. For these two translation tasks, we use training strategies such as regularized dropout, bidirectional training, data diversification, forward translation, back translation, alternated training, curriculum learning, and transductive ensemble learning to train neural machine translation (NMT) models based on the deep Transformer-big architecture. Furthermore, to explore whether large language model (LLM) can help improve the translation quality of NMT systems, we use supervised fine-tuning to train llama2-13b as an Automatic post-editing (APE) model to improve the translation results of the NMT model on the multi-domain machine translation task. By using these plyometric strategies, our submission achieves a competitive result in the final evaluation.

HW-TSC's Submission to the CCMT 2024 Machine Translation Tasks

TL;DR

To explore whether large language model (LLM) can help improve the translation quality of NMT systems, supervised fine-tuning is used to train llama2-13b as an Automatic post-editing model to improve the translation results of the NMT model on the multi-domain machine translation task.

Abstract

This paper presents the submission of Huawei Translation Services Center (HW-TSC) to machine translation tasks of the 20th China Conference on Machine Translation (CCMT 2024). We participate in the bilingual machine translation task and multi-domain machine translation task. For these two translation tasks, we use training strategies such as regularized dropout, bidirectional training, data diversification, forward translation, back translation, alternated training, curriculum learning, and transductive ensemble learning to train neural machine translation (NMT) models based on the deep Transformer-big architecture. Furthermore, to explore whether large language model (LLM) can help improve the translation quality of NMT systems, we use supervised fine-tuning to train llama2-13b as an Automatic post-editing (APE) model to improve the translation results of the NMT model on the multi-domain machine translation task. By using these plyometric strategies, our submission achieves a competitive result in the final evaluation.
Paper Structure (25 sections, 1 equation, 2 figures, 6 tables)

This paper contains 25 sections, 1 equation, 2 figures, 6 tables.

Figures (2)

  • Figure 1: The overall training flow chart of our NMT system.
  • Figure 2: APE System for the zh$\rightarrow$en multi-domain machine translation task.