Exploring the traditional NMT model and Large Language Model for chat translation

Jinlong Yang; Hengchao Shang; Daimeng Wei; Jiaxin Guo; Zongyao Li; Zhanglin Wu; Zhiqiang Rao; Shaojun Li; Yuhao Xie; Yuanchang Luo; Jiawei Zheng; Bin Wei; Hao Yang

Exploring the traditional NMT model and Large Language Model for chat translation

Jinlong Yang, Hengchao Shang, Daimeng Wei, Jiaxin Guo, Zongyao Li, Zhanglin Wu, Zhiqiang Rao, Shaojun Li, Yuhao Xie, Yuanchang Luo, Jiawei Zheng, Bin Wei, Hao Yang

TL;DR

This work analyzes English–German chat translation with both traditional NMT and Large Language Model approaches in the WMT24 context. It combines a Deep Transformer NMT baseline with techniques such as Minimum Bayesian Risk (MBR) decoding, Regularized Dropout, self-training, back-translation, and model averaging, and compares these against LLM-based strategies including few-shot prompting and LoRA-based supervised fine-tuning. The key finding is that MBR self-training delivers the best results on the chat task, while LLMs struggle with domain shift and require more chat-specific fine-tuning; document-level LLM results show some cross-domain potential but are not consistently superior. The study provides practical guidance on leveraging MBR and enlists clear evidence of the limitations and opportunities in applying LLMs to chat translation, outlining avenues for future work in domain-adaptive, context-aware translation systems.

Abstract

This paper describes the submissions of Huawei Translation Services Center(HW-TSC) to WMT24 chat translation shared task on English$\leftrightarrow$Germany (en-de) bidirection. The experiments involved fine-tuning models using chat data and exploring various strategies, including Minimum Bayesian Risk (MBR) decoding and self-training. The results show significant performance improvements in certain directions, with the MBR self-training method achieving the best results. The Large Language Model also discusses the challenges and potential avenues for further research in the field of chat translation.

Exploring the traditional NMT model and Large Language Model for chat translation

TL;DR

Abstract

This paper describes the submissions of Huawei Translation Services Center(HW-TSC) to WMT24 chat translation shared task on English

Germany (en-de) bidirection. The experiments involved fine-tuning models using chat data and exploring various strategies, including Minimum Bayesian Risk (MBR) decoding and self-training. The results show significant performance improvements in certain directions, with the MBR self-training method achieving the best results. The Large Language Model also discusses the challenges and potential avenues for further research in the field of chat translation.

Paper Structure (18 sections, 6 tables, 1 algorithm)

This paper contains 18 sections, 6 tables, 1 algorithm.

Introduction
Data
Data Size
Data pre-processing
System Overview
Model
MBR Decoding
Regularized Dropout
Self-Training
Back Translation
Model Averaging
LLM Few-shot Prompting
LLM SFT with LoRA
Experiment Setting
Result and Analysis
...and 3 more sections

Exploring the traditional NMT model and Large Language Model for chat translation

TL;DR

Abstract

Exploring the traditional NMT model and Large Language Model for chat translation

Authors

TL;DR

Abstract

Table of Contents