On the Hallucination in Simultaneous Machine Translation
Meizhi Zhong, Kehai Chen, Zhengshan Xue, Lemao Liu, Mingming Yang, Min Zhang
TL;DR
This paper investigates hallucination in Simultaneous Machine Translation (SiMT) by analyzing hallucination words through distributional and predictive lenses and by quantifying target-side context usage with a new metric, TSSR. Using Wait-$k$ policies on IWSLT14 De-En and MuST-C Zh-En, the study finds hallucination words exhibit high distribution entropy and high predictive uncertainty, partly due to limited source context. It shows a strong link between target-context reliance and hallucination, and demonstrates that reducing target-context usage via scheduled sampling can modestly improve BLEU and reduce hallucination rates at low latency. The findings offer a practical avenue for mitigating hallucination in SiMT by balancing context usage, while noting limitations related to policy scope and potential alignment biases.
Abstract
It is widely known that hallucination is a critical issue in Simultaneous Machine Translation (SiMT) due to the absence of source-side information. While many efforts have been made to enhance performance for SiMT, few of them attempt to understand and analyze hallucination in SiMT. Therefore, we conduct a comprehensive analysis of hallucination in SiMT from two perspectives: understanding the distribution of hallucination words and the target-side context usage of them. Intensive experiments demonstrate some valuable findings and particularly show that it is possible to alleviate hallucination by decreasing the over usage of target-side information for SiMT.
