Table of Contents
Fetching ...

Evaluation of Google Translate for Mandarin Chinese translation using sentiment and semantic analysis

Xuechun Wang, Rodney Beard, Rohitash Chandra

TL;DR

An automated assessment of translation quality of Google Translate with human experts using sentiment and semantic analysis indicates that the precision of Google Translate differs both in terms of semantic and sentiment analysis when compared to human expert translations.

Abstract

Machine translation using large language models (LLMs) is having a significant global impact, making communication easier. Mandarin Chinese is the official language used for communication by the government and media in China. In this study, we provide an automated assessment of translation quality of Google Translate with human experts using sentiment and semantic analysis. In order to demonstrate our framework, we select the classic early twentieth-century novel 'The True Story of Ah Q' with selected Mandarin Chinese to English translations. We use Google Translate to translate the given text into English and then conduct a chapter-wise sentiment analysis and semantic analysis to compare the extracted sentiments across the different translations. Our results indicate that the precision of Google Translate differs both in terms of semantic and sentiment analysis when compared to human expert translations. We find that Google Translate is unable to translate some of the specific words or phrases in Chinese, such as Chinese traditional allusions. The mistranslations may be due to lack of contextual significance and historical knowledge of China.

Evaluation of Google Translate for Mandarin Chinese translation using sentiment and semantic analysis

TL;DR

An automated assessment of translation quality of Google Translate with human experts using sentiment and semantic analysis indicates that the precision of Google Translate differs both in terms of semantic and sentiment analysis when compared to human expert translations.

Abstract

Machine translation using large language models (LLMs) is having a significant global impact, making communication easier. Mandarin Chinese is the official language used for communication by the government and media in China. In this study, we provide an automated assessment of translation quality of Google Translate with human experts using sentiment and semantic analysis. In order to demonstrate our framework, we select the classic early twentieth-century novel 'The True Story of Ah Q' with selected Mandarin Chinese to English translations. We use Google Translate to translate the given text into English and then conduct a chapter-wise sentiment analysis and semantic analysis to compare the extracted sentiments across the different translations. Our results indicate that the precision of Google Translate differs both in terms of semantic and sentiment analysis when compared to human expert translations. We find that Google Translate is unable to translate some of the specific words or phrases in Chinese, such as Chinese traditional allusions. The mistranslations may be due to lack of contextual significance and historical knowledge of China.
Paper Structure (23 sections, 11 figures, 7 tables)

This paper contains 23 sections, 11 figures, 7 tables.

Figures (11)

  • Figure 1: A framework representing stages of processes of sentiment and semantic analysis for Mandarin to English, comparing Google Translate and human experts.
  • Figure 2: Framework of top 10 bi-grams and top 10 trigrams for different translations.
  • Figure 3: Top 10 optimistic and pessimistic bigrams and trigrams from Xianyi Yang and Gladys Yang's translation
  • Figure 4: Top 10 optimistic and pessimistic bigrams and trigrams from Julia Lovell's translation
  • Figure 5: Top 10 joking and annoyed bigrams and trigrams from Xianyi Yang and Gladys Yang's translation
  • ...and 6 more figures