An Empirical Study of Generation Order for Machine Translation

William Chan; Mitchell Stern; Jamie Kiros; Jakob Uszkoreit

An Empirical Study of Generation Order for Machine Translation

William Chan, Mitchell Stern, Jamie Kiros, Jakob Uszkoreit

TL;DR

The paper investigates whether generation order matters in machine translation by leveraging an insertion-based Insertion Transformer and a soft order-reward framework that enables training under arbitrary oracle orders. It systematically evaluatesuninformed, location-based, frequency-based, content-based, and model-based orders on WMT'14 En-De and WMT'18 En-Zh, comparing against a standard Transformer baseline. For English-German, most orders yield BLEU scores close to the Transformer, indicating order is largely non-essential; English-Chinese results vary more, showing sensitivity to generation order and opportunity for tailored ordering. The work demonstrates flexible, partially autoregressive generation and provides a principled training approach to explore diverse generation schemas, opening avenues for non-monotonic generation in MT and related sequence tasks.

Abstract

In this work, we present an empirical study of generation order for machine translation. Building on recent advances in insertion-based modeling, we first introduce a soft order-reward framework that enables us to train models to follow arbitrary oracle generation policies. We then make use of this framework to explore a large variety of generation orders, including uninformed orders, location-based orders, frequency-based orders, content-based orders, and model-based orders. Curiously, we find that for the WMT'14 English $\to$ German translation task, order does not have a substantial impact on output quality, with unintuitive orderings such as alphabetical and shortest-first matching the performance of a standard Transformer. This demonstrates that traditional left-to-right generation is not strictly necessary to achieve high performance. On the other hand, results on the WMT'18 English $\to$ Chinese task tend to vary more widely, suggesting that translation for less well-aligned language pairs may be more sensitive to generation order.

An Empirical Study of Generation Order for Machine Translation

TL;DR

Abstract

An Empirical Study of Generation Order for Machine Translation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)